Back Home

Clarison - Intelligent Transcription

A smart transcription platform that quickly converts speech to text and refines transcripts for specific professional contexts, achieving over 90% accuracy.

Clarison Platform Screenshot

Project Overview

Clarison is a full stack intelligent transcription platform built to deliver high accuracy speech-to-text conversion with context aware refinement. Developed from November to December 2025, the platform combines on-premise AI models with cloud-based LLM processing to provide sector-specific transcription services for medical, legal, and other professional domains.

Key Features

Technical Architecture

The platform employs a sophisticated multi-stage architecture that combines real-time transcription with intelligent post-processing. Audio input is first processed through the Ollama Whisper model for initial transcription, then passed through a Python-based diarization microservice to identify speakers. Finally, the transcript undergoes context-aware refinement using Gemini LLM with sector-specific prompts to ensure professional-grade output quality.

Technology Stack

Frontend: React, TypeScript
Backend: Node.js, Express.js, Python
AI/ML: Ollama Whisper (on-premise), Gemini LLM
Processing: Multi-speaker diarization microservice

For more info on the code

GitHub – pritamsap/Clarison-IntelligentTranscription