Description
OpenAI's general-purpose speech recognition model. Supports 99 languages, transcription, translation to English, and language identification. Six model sizes from tiny (39M params) to large (1550M params). Use for speech-to-text, podcast transcription, or multilingual audio processing. Best for robust, multilingual ASR.
Skill File
Tags
Information
You Might Also Like
Slack Gif Creator
Knowledge and utilities for creating animated GIFs optimized for Slack
Youtube Downloader
Download YouTube videos with customizable quality and format options
Slack Gif Creator
Toolkit for creating animated GIFs optimized for Slack, with validators for size constraints and ...
Web Design Reviewer
This skill enables visual inspection of websites running locally or remotely to identify and fix ...
Frontend Ui Ux
Designer-turned-developer who crafts stunning UI/UX even without design mockups
Ui Design System
UI design system toolkit for Senior UI Designer including design token generation, component docu...