markitdown
K-Dense-AI/scientific-agent-skills
MarkItDown is a powerful tool designed to convert a wide array of file types—including PDF, DOCX, images, audio, and structured data (JSON/XML)—into clean, token-efficient Markdown format. It supports advanced features like OCR for scanned documents, transcription for audio, and AI-enhanced descriptions, making diverse content readily consumable and structured for modern Language Models and AI pipelines.