[HN Gopher] Show HN: Summary Cat, a YouTube Video Summary Generator ___________________________________________________________________ Show HN: Summary Cat, a YouTube Video Summary Generator Hello HN! Please check out Summary Cat (https://www.summarycat.com). It uses OpenAI's GPT-3.5 to summarize YouTube transcripts. Please note that it only works for - *English* videos. - videos that are not too long in length. I'd appreciate any feedbacks, criticisms, or feature requests!! You can also find my contact info in my profile. Thank you in advance. ------------Technical Details--------------- Tech Stack - Frontend: HTML/CSS - Backend: Python/Flask APIs: - For grabbing YouTube's transcripts: I used youtube-transcript-api (https://pypi.org/project/youtube-transcript-api/) - For summarizing the transcripts: I used OpenAI's GPT-3.5-turbo-16k: https://platform.openai.com/docs/guides/gpt. - I used GPT-3.5 because GPT-4 is quite a lot more expensive (roughly 10X). My Prompt (Super Simple!) - "please summarize the following text into a few paragraphs:" + the full transcript. Thoughts about GPT-4 vs GPT-3.5-Turbo-16k for Summary Cat - GPT-4 was 20% better for "summary quality" - GPT-4 feels 50% faster - However, GPT-4 is about 10X as expensive as GPT-3.5 - Winner: GPT-3.5-Turbo-16k Author : bing_dai Score : 31 points Date : 2023-09-22 20:24 UTC (2 hours ago) (HTM) web link (www.summarycat.com) (TXT) w3m dump (www.summarycat.com) | iamflimflam1 wrote: | Not bad - fed in one of my videos and it's surprisingly readable. | Are you using a particular prompt? Would you be willing to share | it? | bing_dai wrote: | More than happy to share! | | ----------- My prompt is super simple. It is "please summarize | the following text into a few paragraphs:" + the full | transcript. ----------- | | Seriously that's it! | StevenNunez wrote: | Oh boy, wait until the "Prompt Engineers" get a hold of this | one. | bing_dai wrote: | Sorry, could you please explain what you mean? I am not | really quite getting it. What might happen if a Prompt | Engineer get a hold of it? | varispeed wrote: | The off centre spinning wheel bothers me too much. | | That's so evil. Please correct the alignment! | [deleted] | corobo wrote: | Just used this to clear out my watch later list without having to | watch anything. Nice! | | Only note I have at this time is that it seemed to time out or | hang or something on a long video (>2h) -- I'm guessing that | there might be limitations to how much transcript you can chuck | into GPT, it might be worth throwing an error of some sort in | that scenario rather than the forever load | | E: Seen you've asked for an example to the other person | mentioning this. In my case it was this video | https://www.youtube.com/watch?v=hFL6qRIJZ_Y | bing_dai wrote: | I think your suspicion might be correct: long videos exceeds | GPT token limitation (16,385 tokens in my case of | GPT-3.5-turbo-16k). | | Thanks for your suggestion about how to address it. | pvg wrote: | 'Clean up your watch later list" is a neat use case that | might be worth supporting directly in some way. | ChrisArchitect wrote: | It should always append the message "This could have been a blog | post" to everything it summarizes. | toomuchtodo wrote: | Is GPT-4 performance better enough paying would be worth it? | | Edit: Thank you! | bing_dai wrote: | Hello, GPT-4 is not worth it in my experience so far! | | I would say, GPT-4 is - 20% better at "summary quality" - feels | 50% faster - BUT, 10X as expensive. | | So using GPT-3.5 was the right choice for me at this point. | ChrisArchitect wrote: | Totally missed what this was supposed to do and tried to get a | summary of a video discussing some music with captions. Got back | garbage. Thought it might process the text from the frames. | Shrug. Good idea for the use case you intended tho! | [deleted] | twayt wrote: | Also try www.askYouTube.ai for q&a across multiple videos! | mac_cain13 wrote: | Pretty nice! Very useful idea, especially for videos on my | watchlist I never get to because I feel they're too long. | | Would love if I could ask follow up questions. Would be awesome | to ask "Is X also explained?" and get a little summary back with | the timestamp so I can jump to that point in the video. | | Also it feels a bit slow and doesn't really give feedback whether | it's making progress. That would be a good UX improvement. | joelkesler wrote: | Nice! | | For those interested in comparing, https://www.summarize.tech/ | also builds summaries from YouTube videos but includes an | overview, then a summary of each 5 min segment | [deleted] | TuringNYC wrote: | Holy wow, this is FAST. I wonder if both videos I used were | cached. How do they do it so fast? | SCUSKU wrote: | Plugged in this meme video and it gave me the "As a AI I | can't...": https://www.youtube.com/watch?v=NlZzftmtGJY | | Are you using celery for your async workers? Cool project! | johntiger1 wrote: | Interesting, very cool! | | However, how does it do on videos where there's not a lot of | speaking? Any plans to do _actual_ video (image) processing? | bing_dai wrote: | Thanks for the question... Any sample videos you are thinking | of? | | Summary Cat doesn't work for videos where there's not a lot of | speaking. I am hoping to build a bit more on text-rich videos | first, so I do not plan to do actual video (image) processing | any time soon. | | I will keep that in mind!! | johntiger1 wrote: | I am thinking about music videos, where the lyrics don't | describe the video necessarily. For instance, I am curious | about what exactly is the story in this music video: | https://www.youtube.com/watch?v=pruKV1chnHA&ab_channel | flir wrote: | It's hanging for everything I try. | | I suggest a progress bar rather than a spinny thingy. Give the | user some sense that a conclusion is on the horizon. | | From my own experiments, I think you'll get better summaries with | a prompt like "This is a transcription of a youtube video. Please | etc etc etc". Context seems to help. | TuringNYC wrote: | I tried to do something similar, but I could only get transcripts | for videos with transcript files attached, which isnt a huge | number of videos. How did you get around this? | bing_dai wrote: | Hi, I used this Python library | (https://pypi.org/project/youtube-transcript-api/) to get | transcripts. It works great. | crtasm wrote: | Giving it either a long (2.5hr) video, or a non-youtube URL (e.g. | an invidious link) appears to leave it spinning forever - no | error message. | bing_dai wrote: | If you don't mind, could you share an example of such long | video? I'd love to debug it. | | As for non-Youtube URL: I indeed do not handle that error right | now :D. Thanks so much for bringing it up! | crtasm wrote: | This is the 2.5hr video i tried: | https://www.youtube.com/watch?v=JGIGA8taN-M | | This 1hr video works (but I note you don't seem to be caching | the output?) https://www.youtube.com/watch?v=0s9fpFPAC94 | TuringNYC wrote: | This <1hr video seemed to crash the system: | https://www.youtube.com/watch?v=VV949D8AUKU | crtasm wrote: | That one works for me. | Racing0461 wrote: | Seems to be an arms race between youtube forcing creators to make | videos 8 mins long min to be able to get mid roll ads and people | coming up with ways to summarize the transcript. | | Idea for the future: Use the summarize to re-cut the videos to | the most important parts. Like a super to the point tiktok style | video that is nothing but dopamine being injected into your | veins. There seems to already be "auto podcast clipper ai agents" | out there but nothing for consumers to use. those are more video | editor adjacent. If anyone wants to work on something like this, | lemme know. | bing_dai wrote: | I agree that this is happening " an arms race between youtube | forcing creators to make videos 8 mins long min to be able to | get mid roll ads and people coming up with ways to summarize | the transcript." Along the same line: I have been thinking | about how my Summary Cat might mean for the content creators. | How would it impact their income? | | Your "use the summarizer to re-cut the videos" is fantastic! | [deleted] | vitalsea wrote: | How many tokens do you allow per session? I've been thinking | about creating a similar app, but I'm a little bit concerned | about the unintended costs. | bing_dai wrote: | Hello! Thanks for the question. I do not myself restrict | tokens/session. The model I am using GPT-3.5-Turbo-16k | (https://platform.openai.com/docs/models/gpt-3-5), allows max | 16,385 tokens in total per input/output. | | So far, I found that each ~10 minute video uses around 1000 | tokens. It costs me about 3 cents to summarize, which is not | too bad as I don't have many users, and users haven't been | requesting summaries for super long videos yet. | | If this sites gets a lot of interest, I might start restricting | something :).. | | GPT 3.5 Pricing: https://openai.com/pricing | zthomas wrote: | Looks great, it gave a quick response. Are you putting the whole | transcript in context? Have you encountered issues with | transcripts that are too large? | kawsper wrote: | I tested it with two videos, the first one it does the summary | quite well: https://youtu.be/Cy-NgpRN1FU, I love how it mentions | the dogs name is Ernie, that made me smile :) | | But in the second video | https://www.youtube.com/watch?v=NBFyvOV7fz8 the app keeps | mentioning things like: "The text discusses...", but the content | is not a text, it's a video. | | Really cool app, it's really quick too! | codetrotter wrote: | > the content is not a text, it's a video | | To be fair, OP did say that they summarise the YouTube | transcript. So OpenAI GPT receives text. | | But if OP didn't do so already maybe they could start the | OpenAI system prompt with something like "you are summarising | transcripts of YouTube videos" and possibly it could help to | make the summary refer to the material as video. | zthomas wrote: | Would be nice to add a textarea to give it more specific | instructions or to change the summarization prompt. | epalzm wrote: | It hang on non - english video. I tried this one: | https://youtu.be/B4kRwlHTcLM?si=3kp3pvQ4M4l6eRTT Otherwise, | brilliant | badrequest wrote: | It literally says in the original post that it only works for | English videos. | bing_dai wrote: | You are right. The site doesn't handle non-English videos, but | that is on the roadmap. Thanks so much! ___________________________________________________________________ (page generated 2023-09-22 23:00 UTC)