Skip to content

ESP32.AI Quick Start Guide

Welcome to ESP32.AI! This guide will help you quickly get started with our platform and begin your journey in ESP32 smart hardware development.

YunDo V1

YunDo is an open-source intelligent dialogue system based on large models.

Project Content

Simple Flow Chart

plaintext
+------------------+                                  +-----------------+
|    Client (ESP32)  |                                  |     Server        |
|                  |                                  |                 |
| +--------------+ |                                  | +-------------+ |
| | Audio Capture | |                                  | | MQTT Broker | |
| +--------------+ |                                  | +------+------+ |
|        |          |                                  |        ^        |
|        v          |                                  |        |        |
| +--------------+  |                                  | +------+------| |
| | MQTT Publish  |  +------------------------------->  | | Receive Audio| |
| +--------------+  |                                  | +------+------| |
|                  |                                  +------+------| |
|                  |                                  |       v        |
|                  |                                  | +-------------+ |
|                  |                                  | | Speech-to-Text| <----> Azure Speech-to-Text API
|                  |                                  | +-------------+ |
|                  |                                  |       v        |
|                  |                                  | +-------------+ |
|                  |                                  | |    LLM Model   | <----> Dify.ai
|                  |                                  | +-------------+ |
|                  |                                  |       v        |
|                  |                                  | +-------------+ |
|                  |                                  | | Text-to-Speech| <----> Azure Text-to-Speech API
|                  |                                  | +-------------+ |
|                  v                                  |       v        |
| +--------------+  <---------------------------------+  | +-------------+ |
| | Audio Playback|                                     | | MQTT Publish  |
| +--------------+                                      | +-------------+ |
+------------------+                                   +-----------------+

Hardware Overview

  • Complete Hardware with Enclosure:

Description of image

Hardware

Firmware

  • Based on MicroPython
  • Provides complete firmware code and detailed configuration instructions

Server

  • Based on Python
  • Integrates dify.ai's large model capabilities
  • Supports multiple models as described in dify.ai
  • Utilizes Azure's TTS (Text-to-Speech) and STT (Speech-to-Text) services

Key Features

  • Voice Interaction: Implements natural language understanding and generation through large models, providing intelligent voice interaction experiences.
  • Voice Recognition and Synthesis: Utilizes Azure's TTS and STT services for high-quality voice recognition and voice synthesis.
  • Open Source and Community-Driven: All code are open-source, welcoming community contributions and collaboration.

Quick Start

  1. Clone this repository:
git clone https://github.com/hx23840/YunDo.git
  1. Assemble the development board and flash the firmware according to the documentation.
  • Flashing MicroPython onto ESP32

    • Download the MicroPython Firmware:

      • Visit the MicroPython downloads page for ESP32: MicroPython for ESP32
      • Download the latest firmware binary file for ESP32.
    • Flash Firmware to ESP32:

      • Follow the instructions on the MicroPython site to flash the firmware onto your ESP32 device.
  • Setting Up Thonny IDE

  • Copy Firmware Files:

    • Open Thonny, and connect your ESP32 device via USB.
    • Use Thonny's file manager to navigate to the firmware folder.
    • Copy the MicroPython code from the firmware folder on your computer to the ESP32.
  1. Server Configuration
bash
cd Server
cp .env.example .env
  1. Install dependencies and start the server:
bash
pip install -r requirements.txt
python main.py
  1. Start interacting with the intelligent dialogue system!

Contribution

We welcome any form of contribution, including but not limited to:

  • Bug fixes
  • New feature submissions
  • Improvement suggestions
  • Documentation enhancements

License

This project is licensed under the GNU General Public License v3.0.

Contact Us

If you have any questions or suggestions, please submit them via issues, or email our development team.

Thank you for your attention and support! Let's build a smarter and more open future together.

Happy developing on the ESP32.AI platform!