Overview
The WT2003HX is a high-performance, high-quality audio recording chip that employs a high-performance 32-bit processor with a maximum frequency of 120MHz. It features low cost, low power consumption, high reliability, and strong versatility. The maximum recording duration is up to 70 seconds at a sampling rate of 8K. Currently, The Voice Chip are three package types available: WT2003HX-16S, WT2003HX-24SS, and WT2003HP8-32N (with a small size of 4x4mm). The control method is flexible, supporting one-wire serial port, two-wire serial port communication, and button control; it offers 8 levels of volume adjustment. It supports SPI-Flash as storage, capable of supporting up to 128Mbit external Flash.
Functional Overview
• Maximum support for an external 128Mbit Flash;
• Control method: one-wire serial port, customizable UART or button control;
• Power-on default does not play; has a BUSY status indicator, high level during recording or playback, low level when recording stops or not playing;
• Supports high-quality recording, sampling rates support 8K/12K/16K/20K/24K;
• Supports high-quality audio decoding playback in voice formats (8kbps~320kbps), producing beautiful sound quality;
• Operating voltage: 2.4-5.2V;
• Built-in 0.5W Class D amplifier, volume adjustable, with 8 levels of volume;
• Enters deep sleep mode by default after 5 seconds of power-on, needs to be woken up before sending code, otherwise the first code command is invalid, only acts as a wake-up command, sending a second command within 5 seconds is effective, refer to the provided code example from our company;
• Single voice IC in deep sleep consumes less than 5uA, current consumption of the recording circuit using internal LDO 3.3V power supply is generally between 30uA-450uA, to keep it within 5uA, other IO ports need to be used for power supply, custom engineering (communicate with our sales representative);
• Chip defaults to PWM (SPK) output before leaving the factory, external amplifier output requires connection to DAC pin, send audio switching command F4 00, details can be previewed in the function introduction section;
• Two 16-bit asynchronous divider timers;
• Digital audio stream, IIS supports master and slave modes;
• One IIC controller, one infrared remote control decoder;
• 16-bit high precision ADC, 16-bit high precision DAC;
• High-power IO drive capability, maximum direct drive of 64mA;
• Chip power-on initialization time is 200-300ms, typically the chip completes power-on initialization in 100ms, the remaining 200ms is due to our company adding voice replacement functionality, after power-on initialization completion, handshake determines if there is a need for voice update, therefore it is recommended to wait 200-300ms after power-on before sending code control;
• When using a single chip (using built-in capacity), the built-in voice needs to be written before leaving the factory;
• Supports UART for program and voice updates, recommend reserving a UART serial port on the board, refer to the serial port upgrade document for upgrades;
• Important note: If the voice chip needs to be connected to Flash, it is recommended to use Flash from ‘Waytronic’, Flash from other manufacturers cannot guarantee normal operation. (Recommend drawing 150mil and 208mil size compatible extensions for easier inventory).
Reference for recording storage duration
| Chip type | Sampling rate | Maximum capacity KByte(±50K) | Duration S(±30s,Seconds) | |
| Audio version | WT2003H4 | 8K | 350k | 77 |
| 12K | 51 | |||
| 16K | 40 | |||
| 24K | 26 | |||
| WT2003Hp8-32N | 8K | 800K | 176 | |
| 12K | 117 | |||
| 16K | 87 | |||
| 24K | 58 | |||
| WT25Q80B-8S | 8K | 800K | 176 | |
| 12K | 117 | |||
| 16K | 87 | |||
| 24K | 58 | |||
| WT25Q16B-8S | 8K | 1900K | 418 | |
| 12K | 278 | |||
| 16K | 208 | |||
| 24K | 138 | |||
| WT25Q32B-8S | 8K | 3900K | 858 | |
| 12K | 572 | |||
| 16K | 429 | |||
| 24K | 286 | |||
| WT25Q64B-8S | 8K | 8000K | 1760 | |
| 12K | 1173 | |||
| 16K | 879 | |||
| 24K | 586 | |||
| WT25Q128B-8S | 8K | 16200K | 3564 | |
| 12K | 2376 | |||
| 16K | 1782 | |||
| 24K | 1188 |
Note:
- 1. There are differences in capacity calculation methods and the characteristics of different chips, so a rough ± value range is provided for reference.
- 2. The default sampling rate for our company’s standard audio samples is generally set at 24K sampling. For more specific needs, please contact our business department.
- 3. The positive and negative deviations of ±30 seconds mentioned above mainly apply to external Flash memory.
Pin Description
The packaging of the WT2003H series voice recorder chip includes SOP16, TSSOP24, and QFN32 types, suitable for various applications. The pin diagrams and pin definitions are as follows:
1)SOP16 package pin description
| Pin | Name | Type | Explain |
| 1 | COM0/KEY1/DAT/CS | I/O | Position 0/Button 1/SD_DAT/SPI Flash Chip Select |
| 2 | COM1/KEY2/CMD/DO | I/O | Position 1/Button/2SD_CMD/SPI Flash Data |
| 3 | COM2/KEY3/CLK | I/O | Position 2/Button/3SD_CLK/SPI Flash Clock |
| 4 | ICEDAT/KEY4/D-/IO1 | I/O | Download port/key 4/D-/I/O port |
| 5 | ICECLK/KEY5/D+/IO2 | I/O | Download port/key 5/D+/I/O port |
| 6 | RXD/KEY9/DATA1/CL2K | I/O | RXD/Key 9/Single-wire serial data input/Two-wire serial clock signal input |
| 7 | LED3/KEY12/MIC | I/O | Section 3/Button 12/MIC (Microphone input pin) |
| 8 | AGND | G | Simulation |
| 9 | LED4/KEY13/DAC | I/O | Section 4/Key 13/DAC output |
| 10 | VOUT | P | External storage power port (must connect a 106 capacitor to ground) |
| 11 | VCC | P | Power input (must connect a 106 capacitor to ground) |
| 12 | GND | G | Digital land |
| 13 | PWM+ | O | Speaker terminal |
| 14 | PWM- | O | Speaker terminal |
| 15 | KEY14/LED5 | I/O | Button 14/Segment 5/Busy signal output |
| 16 | TXD/KEY15/ADC1/DATA2A22 | I/O | TXD/Push Button 15/ADC Channel 1/Two-wire Serial Data Input |
Application Scheme Diagram








