AssemblyAI
AssemblyAI
  • 313
  • 11 145 615
How to use @postman to test LLMs with audio data (Transcribe and Understand)
🔑 Get an AssemblyAI API Key: www.assemblyai.com/?
When starting to learn a new API, things can get messy. One way to tackle the first few days of confusion is to use a tool like Postman until you understand how to send requests to an API and how to parse the response you get.
In this video, we will learn how to transcribe audio and video files using AssemblyAI and also how to use LeMUR, AssemblyAI's framework for using Large Language Models on spoken data without having to code at all.
🧑‍💻 AssemblyAI Documentation: www.assemblyai.com/docs/?
▬▬▬▬▬▬▬▬▬▬▬▬ CONNECT ▬▬▬▬▬▬▬▬▬▬▬▬
🖥️ Website: www.assemblyai.com
🐦 Twitter: AssemblyAI
🦾 Discord: discord.gg/Cd8MyVJAXd
▶️ Subscribe: ua-cam.com/users/AssemblyAI
🔥 We're hiring! Check our open roles: www.assemblyai.com/careers
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
00:00 Introduction
00:57 Postman interface
01:43 Transcribing a file from a URL
06:39 Transcribing a local file
09:11 Getting speaker labels
11:44 LeMUR for creating action items
15:04 LeMUR for asking questions about the audio
20:20 Take a look at AssemblyAI Docs!
#MachineLearning #DeepLearning
Переглядів: 889

Відео

Build A Talking AI with LLAMA 3 (Python tutorial)
Переглядів 4,5 тис.День тому
🔑 Get your AssemblyAI API key here: www.assemblyai.com/? Code Repo: github.com/smithakolan/AssemblyAI-Applications/tree/main/real-time/Ollama-Voice-Bot Learn to build a talking AI! This tutorial covers real-time transcription with AssemblyAI, using LLAMA 3 as the language model with Ollama, and ElevenLabs for text-to-speech. Timestamps: 00:00 - Demo 00:17 - How we will build a talking AI with L...
How to Build a Better User Experience with Customizable Real-Time Speech-to-Text
Переглядів 1,1 тис.День тому
🔑 Get an AssemblyAI API Key: www.assemblyai.com/? Voice bots, automated phone calls, and simultaneous transcriptions: many applications use real-time transcription. With the latest innovations, the transcription speed is faster than ever. One practical caveat of using real-time transcription is the question of when the program should stop listening to the speaker and return the final edited tra...
🚀 Master Python & Zoom API | Build a Server-to-Server App That Transcribes Recordings
Переглядів 1,3 тис.14 днів тому
🔑 Get an AssemblyAI API Key: www.assemblyai.com/dashboard/signup? 🧑‍💻 GitHub repo: github.com/AssemblyAI-Examples/assemblyai-zoom-transcripts 📃 Blog post: www.assemblyai.com/blog/zoom-transcription-zoom-api/? 🟦 Zoom plans: zoom.us/pricing Learn how to use Zoom's API with Python in this step-by-step guide! In this tutorial, you'll learn how to create a robust server-to-server OAuth application t...
Build an AI Lecture Assistant with Python | Full tutorial
Переглядів 1,9 тис.21 день тому
In this tutorial we'll learn how to create an application that summarizes your lectures, and lets you ask questions about the lecture with responses that relate specifically to the lecture's content. We'll use Python and Streamlit to build the application - You can demo the app here: lemur-lecture-summarizer.streamlit.app/ Relevant links: 1. Get an AssemblyAI API Key: www.assemblyai.com/dashboa...
Speech Recognition In Java | Convert Speech To Text
Переглядів 87428 днів тому
🔑 Get your AssemblyAI API key here: www.assemblyai.com/? Java Speech-to-text documentation: www.assemblyai.com/docs/getting-started/transcribe-an-audio-file/? Discover the revolutionary capabilities of AssemblyAI's latest speech recognition models, Universal-1 and Nano, in this in-depth tutorial. Our newly introduced Universal-1 model offers state-of-the-art accuracy for transcribing speech to ...
Automatically generate timestamps for videos with Python
Переглядів 856Місяць тому
Video sections, like UA-cam's "Chapters" feature, are useful for many reasons, such as meeting reviews, media consumption, and sales coaching. In this video, we'll learn how to automatically determine video sections using Python. We'll use Artificial Intelligence to automatically segment a video into semantically-isolated sections, and then modify the results with an LLM to get timestamped, aut...
Unmatched Accuracy and Lightning Speed in Python for Speech Recognition
Переглядів 1,9 тис.Місяць тому
Get your AssemblyAI API key for this tutorial: www.assemblyai.com/? AssemblyAI is building the best API platform for developers to transform and understand voice data with AI so that you can build amazing new products and services for the world to use. What is possible with AssemblyAI? You can transcribe an audio file, get speaker labels, get a list of topics discussed, the sentiment of sentenc...
Automatically extract phone call insights with LLMs and Python | Full tutorial
Переглядів 1,9 тис.Місяць тому
Extracting phone call insights is useful for many applications, like quality assurance, call centers, sales coaching, and more. In this tutorial, we'll learn how to extract phone calls automatically with LLMs and Python. We'll use the framework LeMUR to automatically summarize a phone inquiry at a home building company, as well as extract action items to follow up on and contact information to ...
This new model is transforming Speech AI: Accurate, Fast, Cost-Effective
Переглядів 14 тис.Місяць тому
Start using Universal-1 today: www.assemblyai.com/? AssemblyAI just launched Universal-1, our most capable and highly trained speech recognition model. Trained on over 12.5 million hours of multilingual audio data, Universal-1 achieves best-in-class speech-to-text accuracy, reduces word error rate and hallucinations, improves timestamp estimation, and helps us continue to raise the bar as the i...
How to Build a RAG Application for Multi-Speaker Audio Data
Переглядів 3 тис.Місяць тому
Get AssemblyAI API key for this tutorial: www.assemblyai.com/? LLMs work wonders on text data but if you want to use audio or video files instead of text, things get a bit trickier. An easy solution is to transcribe the audio or video files. This would work but you will lose valuable information, especially in multi-speaker situations, like how many people were speaking and who said what. In th...
Best AI Tools for Content Creation in 2024: Automate Repetitive Work
Переглядів 2,5 тис.2 місяці тому
Get a Free AssemblyAI API key today to get started with Speech AI: www.assemblyai.com/? New AI-powered tools are coming out every week if not every day. In this video, we compiled our favorite content creation tools with AI features for you. With these AI tools (or creator platforms), you can plan, record, edit and publish your content with ease. They could help you write your script, remove th...
4 LLM frameworks to build AI apps with voice data
Переглядів 2,7 тис.2 місяці тому
4 LLM frameworks to build AI apps with voice data
Coding an AI Voice Bot from Scratch: Real-Time Conversation with Python
Переглядів 13 тис.2 місяці тому
Coding an AI Voice Bot from Scratch: Real-Time Conversation with Python
Transcribe a live phone call with Python - Flask tutorial
Переглядів 4,6 тис.2 місяці тому
Transcribe a live phone call with Python - Flask tutorial
How Graph Neural Networks Are Transforming Industries
Переглядів 8 тис.3 місяці тому
How Graph Neural Networks Are Transforming Industries
How to Index Podcasts with Keywords like on Huberman's Website
Переглядів 1,8 тис.3 місяці тому
How to Index Podcasts with Keywords like on Huberman's Website
The Physics of Generative AI - How AI models use physics to generate novel data
Переглядів 17 тис.3 місяці тому
The Physics of Generative AI - How AI models use physics to generate novel data
Live Speech-to-Text With Google Docs Using LLMs (Python Tutorial)
Переглядів 6 тис.3 місяці тому
Live Speech-to-Text With Google Docs Using LLMs (Python Tutorial)
No-Code, No Problem: Create Speech-to-Text Apps with Minimal or No Coding
Переглядів 2,8 тис.4 місяці тому
No-Code, No Problem: Create Speech-to-Text Apps with Minimal or No Coding
The Emergent Abilities of LLMs - why LLMs are so useful
Переглядів 4,5 тис.4 місяці тому
The Emergent Abilities of LLMs - why LLMs are so useful
2024's AI Essentials: 10 Must-Know AI Terms from 2023 Explained in 5 Minutes! 🚀🌟
Переглядів 6 тис.5 місяців тому
2024's AI Essentials: 10 Must-Know AI Terms from 2023 Explained in 5 Minutes! 🚀🌟
Convert Speech to Text In Java (Basic Tutorial)
Переглядів 3,8 тис.5 місяців тому
Convert Speech to Text In Java (Basic Tutorial)
Build AI App Prototypes Visually with No-Code (Open-source)
Переглядів 14 тис.5 місяців тому
Build AI App Prototypes Visually with No-Code (Open-source)
How do Multimodal AI models work? Simple explanation
Переглядів 19 тис.5 місяців тому
How do Multimodal AI models work? Simple explanation
Convert Hindi Speech to Text (Python Tutorial)
Переглядів 4,8 тис.5 місяців тому
Convert Hindi Speech to Text (Python Tutorial)
Run LLMs locally - 5 Must-Know Frameworks!
Переглядів 15 тис.5 місяців тому
Run LLMs locally - 5 Must-Know Frameworks!
Analyze a Conversation with AI for Free on the Playground
Переглядів 156 тис.5 місяців тому
Analyze a Conversation with AI for Free on the Playground
How to Convert Speech to Text in JavaScript using AssemblyAI's Node.js SDK
Переглядів 2,8 тис.6 місяців тому
How to Convert Speech to Text in JavaScript using AssemblyAI's Node.js SDK
🤯 OpenAI Assistants API Python (Full Tutorial)
Переглядів 62 тис.6 місяців тому
🤯 OpenAI Assistants API Python (Full Tutorial)

КОМЕНТАРІ

  • @okefejoseph6825
    @okefejoseph6825 11 годин тому

    Is there a way I can get the slide?

  • @gasserghareeb8915
    @gasserghareeb8915 12 годин тому

    Not working

  • @benrubinic1716
    @benrubinic1716 15 годин тому

    Such a great and easy tutorial! thanks!

  • @michelebersani7294
    @michelebersani7294 15 годин тому

    Good morning, this playlist is amazing and I was searching it for several weeks. I have a question about the interpretection of the eigenvectors. Why do the eigenvectors, of the covariance matrix, point in the direction of maximum variance?

  • @yavarjn2055
    @yavarjn2055 18 годин тому

    Reinforcement learning is not supervised so why do they need lot of data?

  • @sallyisabel
    @sallyisabel День тому

    I loved your face when you’re smile talking 😊 not to mention your pronunciation of “batch”

  • @jeevanjaison9646
    @jeevanjaison9646 День тому

    The assembly ai api is not free.

  • @MDEdwardsCreative
    @MDEdwardsCreative День тому

    Some of us noobs have no idea what program that is your are working in from the get go....

  • @alex-stalker
    @alex-stalker День тому

    Great!

  • @FaisalKhrisan
    @FaisalKhrisan День тому

    But I still have problems it says that [from elevenlabs import generate, stream ImportError: cannot import name 'generate' from 'elevenlabs'] how come

  • @rkop737
    @rkop737 2 дні тому

    If you do it like you did it, you will need to install tf_keras as well.

  • @Threecommaaclub
    @Threecommaaclub 2 дні тому

    When running the code i receive the following message in the terminal "`ALSA lib confmic.c.:160, i think this may be a warning message but i want to surpress them, how would i go about doing that?

  • @Anorch-oy9jk
    @Anorch-oy9jk 2 дні тому

    you can use the source code of the interfaces and wrappers and build your owns.

  • @user-dr7nr5ee2h
    @user-dr7nr5ee2h 2 дні тому

    1000 epochs : |

  • @alexcrdst
    @alexcrdst 3 дні тому

    This is a great course! Thanks a lot! I have one question though: is it right that the test-data comes from the same data-set but loaded again? So the test data has already been seen by the model? Wouldn't it be better if we split up the dataset into a training and test subset?

  • @muhammadabubakarsaddique3216

    Awesome!!! Highly recommended!! I usually work with TF most of the time. But due to some research work i have to learn PyTorch!! This tutorial is like getting Big Picture idea of coding with PyTorch!! Bravo!!

  • @991122bc
    @991122bc 4 дні тому

    Very good explanation, learn something, enjoy your tutorial 😂

  • @mahdighribi4151
    @mahdighribi4151 4 дні тому

    Best explanation , Thank you

  • @harshmeena4
    @harshmeena4 4 дні тому

    *Can we change voices!*

  • @Khuzaima985
    @Khuzaima985 5 днів тому

    but avalabe on where

  • @user-qp1jq3eh3e
    @user-qp1jq3eh3e 5 днів тому

    I am very api to have found this

  • @TheRedbullforever
    @TheRedbullforever 5 днів тому

    Hello :)

  • @mehdismaeili3743
    @mehdismaeili3743 6 днів тому

    Excellent .

  • @LouisDuran
    @LouisDuran 6 днів тому

    How might this code change if I were to use the Gini Index instead of Entropy to decide on splits? Does that make sense?

  • @riegaldutoit5792
    @riegaldutoit5792 6 днів тому

    what python editor are you usng??

  • @riegaldutoit5792
    @riegaldutoit5792 6 днів тому

    where and how to get to the page where you say"pip install" people dont know this!!?

    • @dadolab2314
      @dadolab2314 3 дні тому

      its the terminal, you can see it by clicking view on top of your IDE and and then open the terminal

  • @zishaansayyed2092
    @zishaansayyed2092 6 днів тому

    Whats her name?

  • @ayhanardal
    @ayhanardal 6 днів тому

    how can find presentation file.

  • @mehdismaeili3743
    @mehdismaeili3743 6 днів тому

    Do you have a free or paid API to convert text to sound or to dub video?

  • @mehdismaeili3743
    @mehdismaeili3743 6 днів тому

    Do you have a free or paid API to convert text to sound or to dub video?

  • @a.mo7a
    @a.mo7a 6 днів тому

    How is this different from just using the relu function?

  • @urekmazino1327
    @urekmazino1327 7 днів тому

    why are you saying fro. scratch if you're only using api

  • @urekmazino1327
    @urekmazino1327 7 днів тому

    any way to make one with adam voice like the one in elevenlabs?😊

  • @rabidrabbitmeow
    @rabidrabbitmeow 7 днів тому

    Any idea how to deal with this? OSError: [Errno -9996] Invalid input device (no default output device)

  • @tinafernandez4138
    @tinafernandez4138 7 днів тому

    "Promosm"

  • @abhishekhgupta6302
    @abhishekhgupta6302 7 днів тому

    I love you mam

  • @naveenalla3000
    @naveenalla3000 8 днів тому

    wonderful project

  • @fiorellademedina8419
    @fiorellademedina8419 8 днів тому

    hello, how you can know the version of llama 3 with ollama? Is it 8B or 70B.?

    • @AssemblyAI
      @AssemblyAI 8 днів тому

      The default 'llama3' model is 8B with ollama. If you want to call the 70B model you need to specifiy 'llama3:70b'. Check our their naming conventions here: github.com/ollama/ollama/blob/main/docs/api.md#conventions

  • @theghostyced
    @theghostyced 8 днів тому

    how would you handle interruptions while the ai is talking?

  • @user-mg7qw3zr1p
    @user-mg7qw3zr1p 9 днів тому

    how learining the english

  • @AssemblyAI
    @AssemblyAI 9 днів тому

    🚀MEGA UPDATE 🚀 We've launched Universal-1, our most powerful and accurate multilingual speech-to-text model to date-trained on 12.5M hours of multilingual audio data. www.assemblyai.com/blog/announcing-universal-1-speech-recognition-model/?

  • @ilikegeorgiabutiveonlybeen6705
    @ilikegeorgiabutiveonlybeen6705 9 днів тому

    good video

  • @ilikegeorgiabutiveonlybeen6705
    @ilikegeorgiabutiveonlybeen6705 9 днів тому

    yeah nitty gritty indexing options overview and use cases for said options would be higlhy appreciated

  • @cheybrown2076
    @cheybrown2076 9 днів тому

    Amazing video 🎉. Was portaudio the library used to listen and capture the audio?

  • @martinnoah9716
    @martinnoah9716 9 днів тому

    Is there a benefit to using AssemblyAI to do audio transcriptions over native Zoom transcriptions?

    • @AssemblyAI
      @AssemblyAI 9 днів тому

      Hey there! The short answer is that the transcriptions will likely be more accurate - we've just released our Universal-1 model that attains top-level performance on application-relevant audio domains. You can learn more here: www.assemblyai.com/blog/announcing-universal-1-speech-recognition-model/ Beyond just this, using AssemblyAI also allows you to perform other operations on your meetings like summarization, PII redaction, or prompting via LLMs!

  • @user-ut7pc2di4c
    @user-ut7pc2di4c 9 днів тому

    Chat GTP

  • @thebackpainmiracle
    @thebackpainmiracle 9 днів тому

    Exactly what I was intending on making. Thanks!

  • @PalashDandge
    @PalashDandge 9 днів тому

    i am getting error "Cannot find reference 'generate' in '__init__.py' " on from elevenlabs import generate, stream line can you please help me to resolve this issue

    • @user-po9ru7dl9j
      @user-po9ru7dl9j 7 днів тому

      yes same error, did you find a solution to it mate?

  • @ojasvisingh786
    @ojasvisingh786 10 днів тому

    👏👏

  • @marketfinds
    @marketfinds 10 днів тому

    Sorry, newbie, what is the 'source' command you use at the start? You also jumped from step 1 to 6. ie when I run: The term 'ollama' is not recognized as the name of a cmdlet Thanks

    • @AssemblyAI
      @AssemblyAI 9 днів тому

      The 'source' command is used when activating the python virtual environment. As for ollama you might not have installed it. So run 'pip install ollama' before using it. And additionally, you need to download it here: ollama.com