Growing India News, world news, nation news, our news, people's news, grow news, entertainment, fashion, movies, tech, automobile and many more..
Monday, February 20, 2023
Show HN: Whisper.cpp and YAKE to Analyse Voice Reflections [iOS] https://ift.tt/cX5qgOI
Show HN: Whisper.cpp and YAKE to Analyse Voice Reflections [iOS] Six months ago, I went full-time indie, but I haven't released anything so far. The products just never felt good enough for me to publicly say this is what I'm doing now. To get out of this mindset, I decided to make an app for myself in a week, add monetization, release it and move on. The app idea was simple: Reflect on your day by answering the same four questions out loud. The answers are transcribed and with regular use you can see what influences you the most and take action. All on-device, as otherwise I wouldn't feel comfortable sharing my thoughts. I had all core features working within a day by simply modifying an existing example app. However I was dissatisfied with iOS's built-in offline transcription due to a lack of punctuation and the speech recognition permission prompt that made it seem like data would leave the device. Decided to use whisper.cpp [0] (small model) instead. This change, lead to many others, as I now felt too little of the app's code was mine. e.g.: - Added automatic mood analysis. First using sentiment analysis, then changed to a statistical approach - Show trends: First implemented TextRank to provide a summary for an individual day, then changed it to extract keywords to spot trends over weeks and months. Replaced TextRank with KeyBERT for speed and n-grams, then BERT-SQuAD, and ended on a modified YAKE [1] for subjectively better results. (Do you know of a better approach?) As a result, this tiny app took me over a month, but it still has its flaws: - Transcription is not live but performed on recordings, so if you immediately want the transcript of your most recent answer, you have to wait. - Mood and keyphrase extraction are optimized for my languages and way of speaking, so they might not generalize well. - Music in the background can result in nearly empty transcripts. Nevertheless, after using the app regularly and enjoying it, I feel ready to release. Hope you will find the app useful too. [0] Show HN: Whisper.cpp https://ift.tt/K9urjOY [1] YAKE: https://ift.tt/1Mg4eFn https://ift.tt/4FlPIcK February 20, 2023 at 08:38PM
Subscribe to:
Post Comments (Atom)
Show HN: Pocket2Linkding – Migrate from Mozilla Pocket to Linkding https://ift.tt/IwYJfju
Show HN: Pocket2Linkding – Migrate from Mozilla Pocket to Linkding With the Mozilla Pocket shutdown coming up in about two weeks, I thought ...
-
Show HN: An AI logo generator that can also generate SVG logos Hey everyone, I've spent the past 2 weeks building an AI logo generator, ...
-
Show HN: Snap Scope – Visualize Lens Focal Length Distribution from EXIF Data https://ift.tt/yrqHZtDShow HN: Snap Scope – Visualize Lens Focal Length Distribution from EXIF Data Hey HN, I built this tool because I wanted to understand which...
-
Show HN: Federated IndieAuth Server implemented as a notebook https://ift.tt/32IC633 April 27, 2021 at 04:37PM
No comments:
Post a Comment