A smart transcription platform that quickly converts speech to text and refines transcripts for specific professional contexts, achieving over 90% accuracy.
Clarison is a full stack intelligent transcription platform built to deliver high accuracy speech-to-text conversion with context aware refinement. Developed from November to December 2025, the platform combines on-premise AI models with cloud-based LLM processing to provide sector-specific transcription services for medical, legal, and other professional domains.
The platform employs a sophisticated multi-stage architecture that combines real-time transcription with intelligent post-processing. Audio input is first processed through the Ollama Whisper model for initial transcription, then passed through a Python-based diarization microservice to identify speakers. Finally, the transcript undergoes context-aware refinement using Gemini LLM with sector-specific prompts to ensure professional-grade output quality.
Frontend: React, TypeScript
Backend: Node.js, Express.js, Python
AI/ML: Ollama Whisper (on-premise), Gemini LLM
Processing: Multi-speaker diarization microservice