Description of App
High-quality on-device transcription. Easily convert speech to text from meetings, lectures, and more.
The transcription is powered by OpenAI's Whisper running locally on your device. The audio never leaves your device.
You can export the transcription as subtitles too.
Aiko favors accuracy over speed.
The app requires a Mac with at least 16 GB of RAM.
Supports 100 different languages:
Afrikaans, Albanian, Amharic, Arabic, Armenian, Assamese, Azerbaijani, Bangla, Bashkir, Basque, Belarusian, Bosnian, Breton, Bulgarian, Burmese, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Faroese, Finnish, French, Galician, Georgian, German, Greek, Gujarati, Haitian Creole, Hausa, Hawaiian, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Italian, Japanese, Javanese, Kannada, Kazakh, Khmer, Korean, Lao, Latin, Latvian, Lingala, Lithuanian, Luxembourgish, Macedonian, Malagasy, Malay, Malayalam, Maltese, Marathi, Mongolian, Māori, Nepali, Norwegian, Norwegian Nynorsk, Occitan, Pashto, Persian, Polish, Portuguese, Punjabi, Romanian, Russian, Sanskrit, Serbian, Shona, Sindhi, Sinhala, Slovak, Slovenian, Somali, Spanish, Swahili, Swedish, Tagalog, Tajik, Tamil, Tatar, Telugu, Thai, Tibetan, Turkish, Turkmen, Ukrainian, Urdu, Uzbek, Vietnamese, Welsh, Yiddish, Yoruba
The app was made possible thanks to Whisper by OpenAI and whisper.cpp by Georgi Gerganov.
■ FAQ
‣ Can I edit the text in the app?
I don't plan to support any editing. Export the transcription and edit it in a proper text editor.
‣ Why is the app so large?
The app delivers the highest quality transcription on the market for 100 different languages. Rather than asking why it's so large, the real question is how is it so small.
■ Technical details
The app uses the Whisper large v2 model.
■ Support
You can contact me through the feedback button in the app or at sindresorhus@gmail.com
Comments
Aiko is awesome!
I originally listened to the AppleVis podcast about this app in August of 2023. While I was a bit skeptical about the accuracy of the transcripts, I decided to try Aiko out. I was very impressed with the accuracy of the transcripts that were produced. Also, compared to Apple's transcription (ex: voicemail, audio messages, etc), the transcriptions Aiko produces are great! In fact, I find that basic punctuation, like capital letters, commas, and periods, are added. As for other punctuation marks, I typically add them when I edit the transcript. I think the developer has done a great job with this app, and I hope that further improvement/development will continue on this app. In fact, one thing I would love to see is a way to see who is speaking when, For example, an audio recording of a conversation was being transcribed, it would be great to see each person indicated by something like "speaker 1", and "speaker 2". I have no idea if this is possible, but I hope this might be added to this app in the future.
So Far So Good
I recently grabbed myself a copy of the Mac version of this app. Just yesterday evening as I was waiting for my personal assistant, I thought I'd use the app to transcribe a song from one of the albums in my music library. Apparently I didn't wait quite long enough for the transcription to finish, because only part of the lyrics showed up. I suppose this was just as well though, since my personal assistant arrived as I was doing it. But the app did a great job, and as a bonus the 2 of us got to listen to the song again. The only drawback for me thus far is that the app is a system hog, but now I think I know the reason for that.