AI Company Konan Technology

Why D-Cheeps is
your best choice?

As the need for fast and accurate speech-to-text conversion grows across
various environments — such as 24/7 customer service, online lectures, and
meeting transcription — D-Cheeps delivers a high-quality, Korean-optimized
speech recognition AI that automates real-time speech-to-text conversion to
maximize operational efficiency.

D‑Cheeps is an STT software powered by the latest end-to-end (E2E) deep neural network
– based Korean speech recognition technology.
It supports real-time speech recognition as well as large-scale audio file processing
in various formats, and can be flexibly deployed to fit any customer environment.

BENEFITS

Benefit 01.

Automating Repetitive Tasks

By applying STT technology, tasks such as transcription, meeting documentation, and data entry can be automated. This reduces the time spent on manual work, allowing employees to focus on more strategic activities and improving overall operational efficiency.

Benefit 02.

Reducing Errors & Improving Accuracy

STT technology minimizes omissions and transcription errors that can occur during manual input, ensuring data consistency and accuracy. Users can rely on accurate results throughout every stage of data entry and processing.

Benefit 03.

Enhancing Productivity & Saving Time

STT technology instantly converts speech into text, accelerating information utilization and significantly reducing the time required for documentation. As a result, overall processing speed improves, boosting organizational productivity and decision-making efficiency.

Benefit 04.

Improving User Experience

A voice-command-based interface enhances accessibility and convenience, allowing users to interact with devices and services more intuitively. Even in hands-busy environments, voice input enables seamless operation, maximizing usability across diverse work settings.

FEATURE

D-Cheeps delivers high-accuracy Korean speech recognition results
powered by advanced speech recognition algorithms.

High-Quality, Reliable STT

Provides highly accurate speech recognition results
powered by deep learning and LLM-based algorithms
Offers a stable, in-house developed solution with
flexible technology support tailored to customer needs

Scalable & Stable Operation

Supports up to 50 full-duplex channels per server, with flexible scalability based on processing volume
Operates on an L4 switch-based distributed
architecture for stable and reliable service delivery

Flexible Integration & Utilization

Provides data APIs delivering word- and sentence-level confidence scores, timestamps, and metadata
Seamlessly integrates with add-on services such as
meeting minutes, captions, and call analysis

TECHNOLOGY

Real-time Speech Recognition

Real-time streaming and batch
speech-to-text conversion
End-of-sentence (EOS) detection
metadata

Batch Speech Recognition

Bulk transcription by uploading audio files
Supports multiple audio formats(WAV,
PCM, MP3)
Supports multiple sampling rates
(8k, 16k, 44.1k, 48kHz)

Accuracy Enhancement

Hotword boosting and LM-assisted
inference
Support for new vocabulary and
domain adaptation training

Speaker Diarization

Speaker-wise segmentation of input
audio

System Monitoring

Monitor the STT system via a Health Check API
Provide status per server component

Server Redundancy &
Scale-Up

Active-Active redundancy configuration
Docker Compose–based deployment,
scaling, and failover

User Interface & Settings

Real-time result viewing/saving and
confidence analysis
Automatic extraction of the corresponding
audio waveform; recording settings
Configurable sampling rate and WAV file
saving options

Automatic Accuracy Calculation

Recognition accuracy computed automatically
using Character Error Rate(CER), with result
export and analytics

ARCHITECTURE

Engine
API

API Parser

Worker
Request
DB Query
Engine
Manager
Request

Engine Manager

Session Management
User Management
Database Management

DB

Worker
Request
Post-processing Dictionary
DB Query
Recognition
Accuracy
Results
Engine Manager Request
Server Status

Scheduler

Job Queue Management
Worker Mapping

Engine Worker

Worker 1

Worker N

D-Cheeps Library

Voice Activity
Detection
Speech Recognition
Speech Segment Detection
Post-processing Algorithm
Automatic Gain
Control
Hotword Boosting
Feature Extraction
Forced Alignment
Config DB
Integration
Word Position
Tracking

USE CASES

Across domains where information is delivered via voice conversations or spoken commands,
D-Cheeps — our Korean-optimized speech recognition AI — powers voice-driven workflows.

Contact Center
AI Transcription

Target Users

Customers of the National Police Agency’s call
center handling telecom and financial fraud reports

Service Overview

Automatically generates call transcripts by
converting voice phishing reports and civil
complaint calls into text in real time

Implementation Effects

Enables 24/7/365 automatic call recording and
database construction for all consultation
services
Enhances citizen service quality and fraud
response capability through data-driven analysis
of consultation content

AI Secretary for
Aircraft Design Experts

Target Users

Engineers and staff of Korea Aerospace Industries
(KAI)

Service Overview

Allows AI assistants to receive and process voice or
text queries related to aircraft design in offices,
meeting rooms, and manufacturing sites

Implementation Effects

Improves operational efficiency by providing
instant access to information anytime, anywhere
Enhances decision-making speed with 24-hour
AI-assisted service availability

Subtitles for
Broadcasts & Lectures

Target Users

Students enrolled in online university courses

Service Overview

Generates subtitles for lectures and broadcast
content in real time

Implementation Effects

Provides accessible learning materials for not only
regular students but also hearing-impaired and
international learners
Improves learning effectiveness with real-time
subtitle support during lecture playback

Why D-Cheeps is your best choice?

Automating Repetitive Tasks

Reducing Errors & Improving Accuracy

Enhancing Productivity & Saving Time

Improving User Experience

Real-time Speech Recognition

Batch Speech Recognition

Accuracy Enhancement

Speaker Diarization

System Monitoring

Server Redundancy & Scale-Up

User Interface & Settings

Automatic Accuracy Calculation

API Parser

Engine Manager

DB

Scheduler

Engine Worker

D-Cheeps Library

Talk to KONAN

Why D-Cheeps is
your best choice?

Server Redundancy &
Scale-Up