Wave of the Day

エピソード

What is NN-grams?

2025/01/07

What happens when you combine the best of old-school language models and the power of neural networks? You get NN-grams! In this episode, we break down how this new model blends n-grams (which remember word patterns) with neural networks (which can generalize like a pro). The result? More accurate and faster speech recognition. NN-grams are already outperforming traditional models on tasks like Italian speech recognition, and they’re faster too. Want to know how this hybrid model is changing the speech AI game? Tune in to learn more!

Link to research paper-

https://arxiv.org/abs/1606.07470

Follow us on social media:

Linkedin: https://www.linkedin.com/company/smallest/

Twitter: https://x.com/smallest_AI

Instagram: https://www.instagram.com/smallest.ai/

Discord: https://smallest.ai/discord

続きを読む一部表示

4 分

カートのアイテムが多すぎます

ご購入は五十タイトルがカートに入っている場合のみです。

カートに追加できませんでした。

しばらく経ってから再度お試しください。

ウィッシュリストに追加できませんでした。

しばらく経ってから再度お試しください。

ほしい物リストの削除に失敗しました。

しばらく経ってから再度お試しください。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

無料で聴く
How Listen, Attend and Spell (LAS) neural network was gigantic is breakthrough in speech AI

2025/01/06

In this episode, we dive into the revolutionary Listen, Attend and Spell (LAS) model that transforms how speech-to-text systems work. Unlike traditional methods that separate the process into multiple stages, LAS combines everything into one model, making it faster and more efficient. The system has two key parts: a 'listener' that processes the audio input, and a 'speller' that converts the information into text using attention-based mechanisms. Tune in to learn how LAS outperforms older speech recognition models, achieving impressive accuracy without relying on dictionaries or language models!

Link to research paper-

https://arxiv.org/abs/1508.01211

Follow us on social media:

Linkedin: https://www.linkedin.com/company/smallest/

Twitter: https://x.com/smallest_AI

Instagram: https://www.instagram.com/smallest.ai/

Discord: https://smallest.ai/discord

続きを読む一部表示

5 分

カートのアイテムが多すぎます

ご購入は五十タイトルがカートに入っている場合のみです。

カートに追加できませんでした。

しばらく経ってから再度お試しください。

ウィッシュリストに追加できませんでした。

しばらく経ってから再度お試しください。

ほしい物リストの削除に失敗しました。

しばらく経ってから再度お試しください。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

無料で聴く
What is scheduled sampling? Improving sequence prediction in RNNs

2025/01/05

In this episode, we explore how Scheduled Sampling helps Recurrent Neural Networks (RNNs) make better predictions for tasks like machine translation and image captioning. Normally, during training, RNNs use the actual previous word or token to predict the next one. But when making predictions, the model has to use its own previous predictions, which can lead to mistakes building up. Scheduled Sampling solves this by slowly shifting the model from using the correct token during training to using its own predictions, helping it learn more effectively and reduce errors. Tune in to learn how this approach helped improve results in a major image captioning competition!

Link to research paper-

https://arxiv.org/abs/1506.03099

Follow us on social media:

Linkedin: https://www.linkedin.com/company/smallest/

Twitter: https://x.com/smallest_AI

Instagram: https://www.instagram.com/smallest.ai/

Discord: https://smallest.ai/discord

続きを読む一部表示

4 分

カートのアイテムが多すぎます

ご購入は五十タイトルがカートに入っている場合のみです。

カートに追加できませんでした。

しばらく経ってから再度お試しください。

ウィッシュリストに追加できませんでした。

しばらく経ってから再度お試しください。

ほしい物リストの削除に失敗しました。

しばらく経ってから再度お試しください。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

無料で聴く
How batch normalization led to faster, smarter AI training

2025/01/04

How do you speed up deep neural network training and improve its performance simultaneously? Batch Normalization is the answer. By addressing internal covariate shift, it allows models to train faster, requiring fewer steps and lower learning rates. In this episode, we break down how this technique was applied to a state-of-the-art image classification model, cutting training time by 14 times and surpassing human-level accuracy on ImageNet. Tune in to learn how Batch Normalization is transforming deep learning and setting new benchmarks in AI research.

Link to research paper-

https://arxiv.org/abs/1502.03167

Follow us on social media:

Linkedin: https://www.linkedin.com/company/smallest/

Twitter: https://x.com/smallest_AI

Instagram: https://www.instagram.com/smallest.ai/

Discord: https://smallest.ai/discord

続きを読む一部表示

4 分

カートのアイテムが多すぎます

ご購入は五十タイトルがカートに入っている場合のみです。

カートに追加できませんでした。

しばらく経ってから再度お試しください。

ウィッシュリストに追加できませんでした。

しばらく経ってから再度お試しください。

ほしい物リストの削除に失敗しました。

しばらく経ってから再度お試しください。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

無料で聴く
Teaching AI to Move: GRUs in Sequence Modeling

2025/01/03

How does AI learn to predict and generate realistic human motion? In this episode, we dive into the power of Gated Recurrent Units (GRUs) for sequence modeling. Discover how this advanced RNN architecture captures long-term dependencies, predicts motion data point by point, and generates lifelike movements. From speech synthesis to machine translation, GRUs are proving their versatility—tune in to see how they’re reshaping AI’s ability to understand and create dynamic sequences.

Link to research paper-

https://arxiv.org/abs/1501.00299

Follow us on social media:

Linkedin: https://www.linkedin.com/company/smallest/

Twitter: https://x.com/smallest_AI

Instagram: https://www.instagram.com/smallest.ai/

Discord: https://smallest.ai/discord

続きを読む一部表示

4 分

カートのアイテムが多すぎます

ご購入は五十タイトルがカートに入っている場合のみです。

カートに追加できませんでした。

しばらく経ってから再度お試しください。

ウィッシュリストに追加できませんでした。

しばらく経ってから再度お試しください。

ほしい物リストの削除に失敗しました。

しばらく経ってから再度お試しください。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

無料で聴く
The Significance of LSTMs in Speech Recognition

2025/01/02

What’s the secret to teaching AI to understand large vocabularies? This week, we’re unpacking the power of Long Short-Term Memory (LSTM) networks in speech recognition. These advanced RNN architectures overcome the limitations of traditional models, like vanishing gradients, to deliver state-of-the-art performance with compact designs. Tune in to learn how LSTMs are changing the game for large-scale acoustic modeling and why they’re a cornerstone of modern AI speech systems.

Link to research paper-

https://arxiv.org/abs/1402.1128

Follow us on social media:

Linkedin: https://www.linkedin.com/company/smallest/

Twitter: https://x.com/smallest_AI

Instagram: https://www.instagram.com/smallest.ai/

Discord: https://smallest.ai/discord

続きを読む一部表示

5 分

カートのアイテムが多すぎます

ご購入は五十タイトルがカートに入っている場合のみです。

カートに追加できませんでした。

しばらく経ってから再度お試しください。

ウィッシュリストに追加できませんでした。

しばらく経ってから再度お試しください。

ほしい物リストの削除に失敗しました。

しばらく経ってから再度お試しください。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

無料で聴く
Noisy Student Training: A leap forward in speech recognition

2024/12/31

Can machines teach themselves to listen better? In this episode, we explore how the innovative "noisy student training" method—originally a game-changer for image classification—is now transforming automatic speech recognition. By combining self-training with smart data augmentation, researchers have achieved record-breaking word error rates on challenging datasets like LibriSpeech. Tune in to learn how this approach is setting new benchmarks in AI’s ability to understand and process human speech.

Link to research paper- https://arxiv.org/abs/2005.09629

Follow us on social media:

Linkedin: https://www.linkedin.com/company/smallest/

Twitter: https://x.com/smallest_AI

Instagram: https://www.instagram.com/smallest.ai/

Discord: https://smallest.ai/discord

続きを読む一部表示

5 分

カートのアイテムが多すぎます

ご購入は五十タイトルがカートに入っている場合のみです。

カートに追加できませんでした。

しばらく経ってから再度お試しください。

ウィッシュリストに追加できませんでした。

しばらく経ってから再度お試しください。

ほしい物リストの削除に失敗しました。

しばらく経ってから再度お試しください。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

無料で聴く
The power of Dropout: Making LLM smarter by making them dumber

2024/12/30

Why would an AI engineer intentionally turn off parts of a neural network during training? Sounds counterintuitive, right? In this episode, we’re uncovering the magic of dropout—a technique that forces neural networks to generalize better and avoid overfitting. Join us as we explore how this breakthrough is reshaping AI benchmarks across the board.

Link to research paper- https://arxiv.org/abs/1207.0580

Follow us on social media:

Linkedin: https://www.linkedin.com/company/smallest/

Twitter: https://x.com/smallest_AI

Instagram: https://www.instagram.com/smallest.ai/

Discord: https://smallest.ai/discord

続きを読む一部表示

4 分

カートのアイテムが多すぎます

ご購入は五十タイトルがカートに入っている場合のみです。

カートに追加できませんでした。

しばらく経ってから再度お試しください。

ウィッシュリストに追加できませんでした。

しばらく経ってから再度お試しください。

ほしい物リストの削除に失敗しました。

しばらく経ってから再度お試しください。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

無料で聴く

特集

カテゴリー別

エピソード

What is NN-grams?

カートのアイテムが多すぎます

カートに追加できませんでした。

ウィッシュリストに追加できませんでした。

ほしい物リストの削除に失敗しました。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

How Listen, Attend and Spell (LAS) neural network was gigantic is breakthrough in speech AI

カートのアイテムが多すぎます

カートに追加できませんでした。

ウィッシュリストに追加できませんでした。

ほしい物リストの削除に失敗しました。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

What is scheduled sampling? Improving sequence prediction in RNNs

カートのアイテムが多すぎます

カートに追加できませんでした。

ウィッシュリストに追加できませんでした。

ほしい物リストの削除に失敗しました。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

How batch normalization led to faster, smarter AI training

カートのアイテムが多すぎます

カートに追加できませんでした。

ウィッシュリストに追加できませんでした。

ほしい物リストの削除に失敗しました。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

Teaching AI to Move: GRUs in Sequence Modeling

カートのアイテムが多すぎます

カートに追加できませんでした。

ウィッシュリストに追加できませんでした。

ほしい物リストの削除に失敗しました。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

The Significance of LSTMs in Speech Recognition

カートのアイテムが多すぎます

カートに追加できませんでした。

ウィッシュリストに追加できませんでした。

ほしい物リストの削除に失敗しました。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

Noisy Student Training: A leap forward in speech recognition

カートのアイテムが多すぎます

カートに追加できませんでした。

ウィッシュリストに追加できませんでした。

ほしい物リストの削除に失敗しました。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました

The power of Dropout: Making LLM smarter by making them dumber

カートのアイテムが多すぎます

カートに追加できませんでした。

ウィッシュリストに追加できませんでした。

ほしい物リストの削除に失敗しました。

ポッドキャストのフォローに失敗しました

ポッドキャストのフォロー解除に失敗しました