Max Translator: Intelligent AI Voice Translator

Objective

Create a mobile web application for natural communication between people in different languages, solving the problem of "machine" literalism and loss of meaning in complex dialogues.

Solution

A hybrid system was developed, integrating advanced cloud technologies. Key modules include:

  • Multi-LLM Engine: Connection to Gemini, OpenAI, Groq, and Mistral through a Cloudflare Workers layer for maximum speed and translation quality.
  • Contextual Translation (The Scene): Ability to set a situation (e.g., "at the hospital" or "rental negotiations"), allowing the AI to choose specific terms.
  • Double Control System: Automatic "Back Translation" with semantic analysis of meaning matching (in percentages).
  • Vision AI: Module for recognizing and discussing images (menus, documents, road signs).

Technical Stack

The application is built on a monorepo architecture: React 19, Tailwind CSS, Firebase (Auth/Firestore). For sound processing, the Web Audio API with a custom silence detector is used. Billing is implemented via Firestore atomic transactions with per-second resource charging.

Check it out

You can see the service in action and use its services on the website: tr.ai-robot.us.