Website

Youtube_Transcriber

Project Spotlight: The Ultimate Media & Document Intelligence Suite

In an era where content is king but time is scarce, having a unified tool to process, manage, and understand digital media is a game-changer. I’m excited to share my latest project: a robust, locally-hosted web application built with Django and TailwindCSS that serves as a central station for transcription, document management, and AI-powered analysis.

At its heart, this platform bridges the gap between raw data (video, audio, PDF) and actionable intelligence.

1. The Core: High-Performance Media Transcription

The primary engine of this website is a versatile Transcription Hub. Designed for creators, researchers, and archivists, it streamlines the process of converting speech to text.

YouTube Integration: Simply paste a list of YouTube URLs, and the system automatically downloads the audio and generates precise transcripts. It handles batch processing effortlessly, so you can queue up an entire playlist and let it run.
Universal File Support: Beyond web links, the system accepts direct uploads of video and audio files from your local machine.
Flexible Output: Whether you need a raw text file for notes or a timed
```
.srt
```
file for subtitles, the application generates both formats on demand.
Real-Time Queue System: A dynamic, AJAX-driven interface keeps you updated with a live progress bar, ensuring you know exactly when your transcription is started, processing, and ready for download.

2. Essential PDF Operations

Recognizing that information lives in many formats, I integrated a dedicated PDF Toolkit to handle document manipulation without relying on sketchy online tools.

Text Extraction: Instantly pull readable text from PDF documents to make them searchable and editable.
Split & Organize: Break large PDF files into manageable chunks or separate pages.
Security Tools: Unlock password-protected PDFs (using known credentials) to regain access to your documents.
Seamless Storage: All processed text from PDFs is automatically saved to the database, making it immediately available for the AI Studio.

3. AI Studio: Your Intelligent Workspace

The "Jewel in the Crown" of this project is the AI Studio. This isn't just a chatbot; it's a context-aware workspace that unifies all your processed data.

Unified Context Management: The true power lies in its memory. You can select specific transcripts, extracted PDF content, or custom text snippets as "Context" for the AI.
Multi-Provider Support: Built for flexibility, the system supports both OpenAI and OpenRouter. You can dynamically switch between models (like GPT-4, Claude, or Llama) and bring your own API keys, which are securely stored locally in your browser.
Smart Summarization & Chat: Need a summary of that 2-hour YouTube video? Or need to find connections between a PDF report and a video transcript? Select them both as context, and ask the AI. It generates insights based specifically on the data you provide.

4. Technical Architecture

Built for performance and privacy, this application runs entirely on a local server environment (compatible with WSL2).

Backend: Powered by Django 6.0, offering a secure and scalable foundation.
Frontend: Designed with TailwindCSS for a sleek, modern, and responsive user experience. It features "Glassmorphism" design elements, smooth transitions, and a clean, distraction-free interface.
Persistence: The persistent database ensures that your transcripts, documents, and notes are always there when you restart your machine.

Summary This website is more than just a transcriber; it's a complete productivity ecosystem. By combining media processing, document tools, and large language models into one persistent local web app, it empowers you to consume and create content faster than ever before.

View on GitHub

Project Gallery

View Full Size