Feed
Discussions

ASR

ASR

#asr

1 followers·7 articles

ASR

#asr·1 followers·7 articles

ASR

Changelog

New steps component and improved accessibility on Hashnode's blog and docs product.

Nov 05, 2024·

new

Trending Articles

STON.fi: The Next-Gen DEX Powering TON Blockchain DeFi

Tommy williams·18 reads

AEON Pay: From Crypto Wallets to Everyday Life

Prochino·13 reads

Empowering the Decentralized Future: How Endless Merges Web2 Ease with Web3 Sovereignty

Edimavouge·12 reads

Top commenters this week

Writing Challenges

#2Articles1Week Challenge

Become better at technical writing; accept Hashnode's writing challenge for four weeks.

#2Articles1Week Challenge

#WomenWhoTech

Share your story, achievements, or experiences as a woman, non-binary folk in tech or as a #WomenWhoTech ally!

#WomenWhoTech

Self Starter

Publish your first article on Hashnode and become a self starter!

Self Starter

Serial Blogger

Publish an article every day for 7 days and earn a cool serial blogger badge!

Serial Blogger

Talk of the town

Write a story that drives amazing engagement on Hashnode and become the talk of the town!

Talk of the town

Word Warrior

Write an in-depth article on your Hashnode blog that's more than 2000 words and become a word warrior!

Word Warrior

Buy Old Gmail Accounts

Buy Old Gmail Accounts

Sonu Goswami

Ariska Hidayat

Anik Sikder

Aet

Kaustubh Sharma

Kaustubh Sharma

kaustubhtech.hashnode.dev·Jul 25, 2025

Jul 25, 2025

Understanding Unexpected System Reboots

When Windows systems reboot unexpectedly, it can be challenging to determine the root cause. This newsletter provides comprehensive guidance on investigating these mysterious events using event logs, system files, and virtualization-specific tools. T...

Understanding Unexpected System Reboots

Discuss·3 reads

Maksim Panfilov

Maksim Panfilov

m.z3r.io·Apr 04, 2025

Apr 04, 2025

RTTM format specification and its application

Rich Transcription Time Marked (RTTM) is a widely used, text-based format for annotating audio and video, representing results of speech recognition, speaker diarization, and related metadata. Developed by NIST in the early 2000s, RTTM files consist ...

RTTM format specification and its application

Discuss·50 reads

Akriti Upadhyay

Akriti Upadhyay

akritiu.hashnode.dev·Jan 02, 2024

Jan 02, 2024

How to Make an Automatic Speech Recognition System with Wav2Vec 2.0 on E2E’s Cloud GPU Server

Introduction Creating an Automatic Speech Recognition (ASR) system using Wav2Vec 2.0 on E2E’s Cloud GPU server is a compelling endeavor that brings together cutting-edge technology and robust infrastructure. Leveraging the power of Wav2Vec 2.0, a sta...

How to Make an Automatic Speech Recognition System with Wav2Vec 2.0 on E2E’s Cloud GPU Server

Discuss·7 reads

Richard Thompson

Richard Thompson

richardmthompson.hashnode.dev·Oct 18, 2023

Oct 18, 2023

Searching for a Python-based Speech Recognition Engine (for CPU Inference)

To give my Ai learning a context to ground into, I'm writing a funny little app I've called VoxPlan (in Python) which allows you to organise goals and tasks in a hierarchical tree and display them in an interactive GUI. I'm very interested in explori...

Searching for a Python-based Speech Recognition Engine (for CPU Inference)

Discuss·15 reads

automatic-speech-recognition

Suvro Banerjee

ai-projects.hashnode.dev·Apr 15, 2023

Apr 15, 2023

OpenAI Whisper - a neural net for speech to text

Background With the development of unsupervised pre-training, exemplified by Wav2Vec 2.0 released in 2020, these models could learn directly from the raw audio without the need for human labels. So the raw training data could be scaled to 1 million h...

OpenAI Whisper - a neural net for speech to text

Discuss·10 likes·174 reads

Dave Horton

blog.jambonz.org·Mar 31, 2023

Mar 31, 2023

Tutorial: adding support for a custom speech provider

jambonz supports many speech providers out of the box, but what if you want to use a speech provider for that is not currently supported? There is where the jambonz custom speech API comes in. The custom speech api requires jambonz 0.8.2 or above I...

Tutorial: adding support for a custom speech provider

Discuss·1402 reads

Aadarsh Kannan

aadarshkannan.hashnode.dev·Jan 28, 2023

Jan 28, 2023

Transcribing Audio with OpenAI Whisper in One Post Request

Artificial Intelligence is currently applied in all fields. It's been wild and the development is tremendous. One of the interesting applications of AI is speech recognition. Automatic speech recognition (ASR) is a technology that allows machines to ...

Transcribing Audio with OpenAI Whisper in One Post Request

Discuss·110 reads