YouTube transcripts
This covers how to load youtube transcript into LangChain documents.
Setup
You'll need to install the youtube-transcript package and youtubei.js to extract metadata:
- npm
- Yarn
- pnpm
npm install @langchain/community @langchain/core youtube-transcript youtubei.js
yarn add @langchain/community @langchain/core youtube-transcript youtubei.js
pnpm add @langchain/community @langchain/core youtube-transcript youtubei.js
Usage
You need to specify a link to the video in the url
. You can also specify language
in ISO 639-1 and addVideoInfo
flag.
import { YoutubeLoader } from "@langchain/community/document_loaders/web/youtube";
const loader = YoutubeLoader.createFromUrl("https://youtu.be/bZQun8Y4L2A", {
language: "en",
addVideoInfo: true,
});
const docs = await loader.load();
console.log(docs);
API Reference:
- YoutubeLoader from
@langchain/community/document_loaders/web/youtube