projectbanner
projectbanner

Organisation Profile

FOSSIA

FOSSIA is a women-centric commnunity aimed to bring underrepresented people to contribute to the Indian FOSS ecosystem. We primarily work on projects and initiatives related to inclusion and accessibility.

FastAPIPythonDockerChakra UINext.js
FOSSIA

MENTORS

Keerthana Rajesh Kumar

PROJECTS

TranscribeIt - Video transcription for videos and live streams

Next.jsFastAPIPythonChakra UIDocker

Problem Statement

Hard of hearing and speech population, people with visual impairments and sensory processing disorders have difficulty navigating video content. With online platforms providing little to no transcriptions or inaccurate transcriptions, especially for live streams, it hinders acccessibility. TransribeIt aims to provide customized and diarized transcriptions for making video content accessible.

Focus Area

*Backend: API development• asynchronous processing • ASR (automatic speech recognition) • diariazation • video processing *Frontend: Accessibility • testing • UI development *Deployment: Containerized deployment • monitoring • performance optimization *Documentation: Onboarding instructions for developers • technical documentation

Student Contribution Guide

1. Backend Contributions • Optimize ASR pipelines for both pre-recorded videos and live streams from local content. • Integrate speaker diarization for multi-speaker videos with customization for speaker identification. • Streamline asynchronous video processing workflows using Python and FastAPI for real-world usage. • Provide alternative formats for exporting transcripts and streamline transcript generation *Expected Outcomes: • Robust backend capable of handling multiple video formats and live streams. • Efficient, accurate transcription with speaker labels in customized manner • Formats such as Markdown, PDF and JSON for exports. 2. Frontend Contributions • Develop an accessible UI using Next.js and Chakra UI for the features described in backend section. • Implement features such as searchable transcripts, speaker highlighting, transcript exporting and timestamps for accessibility. • Perform accessibility testing (screen readers, keyboard navigation, color contrast) and integrate accessibility focused tooling for WCAG compliance. • Create a responsive design for desktop and mobile. Expected Outcomes: • User-friendly interface for uploading videos and viewing transcriptions and providing URLs. • Improved accessibility for users with visual and hearing impairments. • Enhanced user experience with intuitive navigation and customization options. 3. Deployment Contributions • Set up orchestration using Docker Compose for self-hosting by optimizing existing Docker setup. • Implement monitoring for system health and performance metrics for production. • Optimize backend performance for large video files or multiple concurrent live streams. Expected Outcomes: • Easy-to-deploy, scalable system for developers and end-users. • Improved stability and uptime. • Clear deployment guides for development and production environments. 4. Documentation Contributions • Write technical documentation for technical implementation in an integrated manner. • Create developer onboarding guides, including setup instructions. • Document API endpoints and video processing workflows using Swagger and VitePress • Maintain changelogs and contribution guidelines for future developers. Expected Outcomes: • Well-documented codebase for future contributors. • Reduced onboarding time for new developers. • Clear usage instructions for both developers and end-users. ***Milestones 1. Milestone 1 – Setup & Familiarization • Set up local development environment (Docker, FastAPI, Next.js) • Explore existing codebase and documentation 2. Milestone 2 – Core Backend Development • Create endpoints for video transcriptions in aternative formats • Optimize video transcription for larger videos • Implement speaker diarization with customisations 3. Milestone 3 – Frontend Integration • Display transcription results with timestamps and speaker information • Develop user interface for exporting transcripts in prefered format 4. Milestone 4 – Deployment & Monitoring • Containerize application for self-hosting and optimize existing Docker setup • Add logging, monitoring, and performance optimization for observability 5. Milestone 6 – Documentation & Final Touches • Complete developer guide • Conduct accessibility testing

Ready to collaborate?

Join the community chat, review the issue tracker, and pick a project to start contributing. Mentors are available to help you scope your first patch.

WOC
instagramtwitterdiscordlinkedin
ResourcesHow It WorksGetting StartedPast OrganizationsWoC 5.0 Policies & Guidelines
© GDG IIIT KALYANI 2026
EMAIL US FOR MORE DETAILS
mailbox
submit