[HN Gopher] New Universal Summarizer by Kagi ___________________________________________________________________ New Universal Summarizer by Kagi Author : simonebrunozzi Score : 150 points Date : 2023-02-03 19:13 UTC (3 hours ago) (HTM) web link (labs.kagi.com) (TXT) w3m dump (labs.kagi.com) | widowlark wrote: | This is incredibly well done. Congrats to the team at Kagi, I was | genuinely blown away by the speed and accuracy of the results. | amelius wrote: | Does it rely on OpenAI under the hood? | GaggiX wrote: | I am truly amazed at how fast and good it is at summarizing | videos, it is (or will be) definitely an amazing tool for the | Kagi search engine. I guess they have trained/finetuned a | specific model to summarize video transcripts, since the content | is delivered so differently. | imbnwa wrote: | Anybody else's first instinct was to get the summary of the | Christian Bible? Isn't able to complete the task in case you're | interested with the source[0] I submitted. | | [0]http://triggs.djvu.org/djvu- | editions.com/BIBLES/DRV/Download... | buserror wrote: | Oh, reminds me I had an email address @kagi.com, a long, long | time ago :-) | 13415 wrote: | I was wondering whether this is still the same company. I | remember getting cheques from Kagi and German bank employees | eying them suspiciously. | buserror wrote: | I know, for quite a while I got quite a bit of income from | them, good old day of Mac Sharewares :-) | thewebcount wrote: | I asked them one time and they said they have no relation to | Kee (who ran the shareware processing site), and just bought | the domain when it was available. No connection to the old | site. (I also used to get the occasional check from the old | Kagi!) | bilater wrote: | This is really cool. Do you have an API? :) | freediver wrote: | We do, and we thought about opening this as a part of other | APIs we already have. | | How would you price this? | bilater wrote: | Probably by usage similar to OpenAI since I assume your costs | are correlated (compute etc). | | You could do a hobby plan which is free up to a certain no. | of requests/tokens per day or hour so developers can start | building without any friction (I think this is important and | sort of expected as most beloved dev tools do it). You can | minimize your costs by offering this on shared resources so | inference time is a little slow and API can go down at times | when there is high usage but its free so users wont mind. | | And then have pro plans for higher usage / 99.9999 SLA / fast | inference etc (maybe a min subscription with a pay if you go | over per 100k tokens option...similar to Vercel with | bandwidth and serverless functions). | freediver wrote: | Cool, pricing per tokens processed makes sense, thanks for | brainstorming this. | minxomat wrote: | Would love to use an API, per token pricing is a good | approach (with use limits like OpenAI). If you need | testers, I have some use case (long form non-fiction | content). LMK at ml[at]summarity[dot]com | Terretta wrote: | I have a few thousand things to summarize, and am a paying | subscriber. What happens to my account if I automate your | page? | | I'm happy to pay. I think it's too much but I'd probably | unhappily pay $0.05 per summary when they're this good. I'd | rather pay $0.005 or less, then I'd plug it into KB tools | like Obsidian as well as our bookmarking / link logging | workflow firm-wide. | | Which reminds me, I want to be able to "gift" Kagi search to | all employees. I actively don't want them to have to use | Google. Is there a way for an automatic company @domain x per | seat billing, or should I use gifting? | | Finally, what would it take for kagi.com to be part of the | Apple One / iCloud+ offering? That needs to happen. | billbrown wrote: | This? https://blog.kagi.com/kagi-team-plan | ryanar wrote: | I grabbed a random article in my to read list and it didn't help | me understand the article. | | https://tasshin.com/blog/strategy-101/ | | Lots of words I was unfamiliar with because they were domain | specific terms and they were not defined | etra0 wrote: | I just tried it with a blogpost of my own that's written in a | _somewhat_ localized Spanish and it summarized it pretty well in | English, which is quite surprising. Good job team kagi! | xamdam wrote: | Impressed with their long document summarization, any ideas how | they do this? Seems beyond normal GPT limitations; either they | have a more powerful model (doubt it) or hacked around the | limitations? | | e.g. good summary for a very long text | https://labs.kagi.com/ai/sum?url=http://localroger.com/prime... | freediver wrote: | Thanks for noticing! (dev here) | | We have an in-house model we've been developing since 2019, | just for summarization of long documents in real time. We'll | try to find some time to blog about the high level design. | xamdam wrote: | I'd pay for an API to this | minxomat wrote: | I second this, API please. | btown wrote: | Third this! Firms serving the financial world, tracking | PR statements, etc. would likely pay high rates for this. | stavros wrote: | Is there an API for this? I'd love to make an Alexa skill | that reads out summarized Wikipedia articles for me. | bradneuberg wrote: | Looking forward to a technical writeup on what you all have | done, looks impressive! | mdp2021 wrote: | > _an in-house model we 've been developing since 2019_ | | We will be very interested in how you tackled the problem of | "understanding" the input text - to differentiate it from | current "fakeries of actual speakers". | | I.e. how you implemented the "intelligent" parts - or, which | simulation of actual intelligent processing (if any) it | contains. | mdp2021 wrote: | Well. | | > _Case is in a bar called the Chatsubo, a place for professional | expatriates. He is talking to the bartender, Ratz, who has a | prosthetic arm. Ratz and Case joke around and then a prostitute | who was sitting next to Case leaves. Suddenly, a drunken | Australian starts talking about how the Chinese invented nerve- | splicing. This causes Case to become bitter and he expresses his | feelings to his glass. This passage paints a vivid picture of the | atmosphere in the bar and the characters that inhabit it. The | interesting detail is that Ratz has a prosthetic arm, which is a | unique detail that adds to the atmosphere of the bar_ | | The results are interesting. The in-production use-case, where | the tech will be most reliable, not evident. | | Edit: | | It seems (also seeing the "Key moments" section) that elemental | attempts to understand the details (and structure) of the text | should output reliability values, and employ them with importance | towards the final output. | | Edit2: | | It seems from further tests that the "understanding" may be very | limited, and that the "trick" is more in terms of "trying to | identify salient parts and present them reformulated - without | attempting to understand them". | | Edit3: | | It seems that the attempt of identification of salient points is | confirmed as a main mechanism - but, in absence of understanding, | the re-formulation just degrades information. For example: the | original being, << _Bronze is the first metal that gets its own | age, which began around 3300 BCE in Mesopotamia. Other metals | were certainly in use before it -- especially copper -- but the | addition of a small amount of tin to existing copper technology | changed everything. Bronze was a step up in hardness, durability, | and resistance to corrosion [...]_ >>, the summary << _Bronze is | the first metal to have its own age, beginning around 3300 BCE in | Mesopotamia. It was a step up in hardness, durability, and | resistance to corrosion_ >> betrays faults in changing the | initial 'gets' to 'have', and in missing that those qualities of | bronze are there in comparison to copper. | | -- | | ...I see we have a sniper here, as so common: well, do not forget | to make your criticism explicit (assuming it will just take its | time to elaborate). Edit: still need more time? | smusamashah wrote: | It's very nice. Apparantly it doesn't follow pages of an HN | thread. It summarized this | https://news.ycombinator.com/item?id=34579175 to mostly about | staying warm. See | https://labs.kagi.com/ai/sum?url=https://news.ycombinator.co... | | This is how it summarized Yishan's thread on content moderation | https://labs.kagi.com/ai/sum?url=https://twitter.com/yishan/... | It obviously got confused when he side stepped into his CO2 | mantra middle of the thread. | | From these two links, it feels like this engine mostly focuses on | the beginnings of the URL, things that come later on a page are | not given the same priority when summarizing. | impalallama wrote: | > The Turner Diaries is a novel by Andrew Mac Donald (William | Pierce) that follows the story of a man named Turner, a member of | a revolutionary organization that is fighting against the | oppressive System. Turner and his comrades use acts of terror, | sabotage, and violence to fight against the System, and they | eventually succeed in overthrowing it. The novel follows Turner's | journey as he and his comrades struggle to survive and fight | against the System, and it also explores the themes of racism, | civil rights, and the power of the government. The novel | ultimately conveys the message that violence and terror can be | used to fight against oppressive systems, but that it should be | done with caution and with a clear understanding of the | consequences. | | Does an interesting and good job are presenting the problem with | summarization. You read this and you might think The Turner | Diaries where a Star Wars adventure story about fighting an evil | empire. | [deleted] | srhtftw wrote: | Here is the summary it gives for Walden: Conclusion | https://en.wikisource.org/wiki/Walden_(1893)_Thoreau/Chapter... | | _In Walden, Thoreau encourages readers to explore their own | inner depths and to strive for truth and simplicity. He argues | that material wealth is not as important as the wealth of the | mind, and that true happiness comes from within. He also | encourages readers to be independent and to take risks, and to | not be afraid to explore the unknown. He compares the courage of | a soldier to that of a footpad, and suggests that it is nobler to | explore one 's own inner world than to chase after material | possessions. He also suggests that it is better to be content | with what one has than to strive for superfluous wealth. He | concludes by encouraging readers to strive for truth and to not | be afraid to explore the unknown._ | | I think shows the limits of these kinds of statistical word | approaches. They aren't necessary "wrong" but they fail to give | any sense of the spirit of optimism in the original. I would have | been far more impressed if it just quoted the ending: | | _The life in us is like the water in the river. It may rise this | year higher than man has ever known it, and flood the parched | uplands; even this may be the eventful year, which will drown out | all our muskrats. It was not always dry land where we dwell. I | see far inland the banks which the stream anciently washed, | before science began to record its freshets. Every one has heard | the story which has gone the rounds of New England, of a strong | and beautiful bug which came out of the dry leaf of an old table | of apple-tree wood, which had stood in a farmer 's kitchen for | sixty years, first in Connecticut, and afterward in Massachusetts | -- from an egg deposited in the living tree many years earlier | still, as appeared by counting the annual layers beyond it; which | was heard gnawing out for several weeks, hatched perchance by the | heat of an urn. Who does not feel his faith in a resurrection and | immortality strengthened by hearing of this? Who knows what | beautiful and winged life, whose egg has been buried for ages | under many concentric layers of woodenness in the dead dry life | of society, deposited at first in the alburnum of the green and | living tree, which has been gradually converted into the | semblance of its well-seasoned tomb -- heard perchance gnawing | out now for years by the astonished family of man, as they sat | round the festive board -- may unexpectedly come forth from | amidst society's most trivial and handselled furniture, to enjoy | its perfect summer life at last!_ | | _I do not say that John or Jonathan will realize all this; but | such is the character of that morrow which mere lapse of time can | never make to dawn. The light which puts out our eyes is darkness | to us. Only that day dawns to which we are awake. There is more | day to dawn. The sun is but a morning star._ | yunwal wrote: | This is quite a bit better than all of the other attempts at | building AI summarizers I've seen. One thing that's really nice | about it is that it seems to pick out surprising or unique | information, rather than just trying to provide the most general | abstraction or summary of the article. My team has a journal club | where we go through ML papers, and I feel like this will be a | good tool to familiarize myself with academic papers before going | into the details. | nvr219 wrote: | I had it summarize a recipe for me. It did a good job, but | definitely editorialized. The URL I sent included just the recipe | for split pea soup, but the summary added that "Split pea soup is | a hearty and comforting dish that is easy to make" (which is | true, but wasn't in the original) and said that the soup "is sure | to be a hit with the whole family" (again, true, but wasn't in | the original). So I feel like "summary" isn't the right word | exactly. Still amazing. I tested it with a bunch of different | blog posts that I was very familiar with and the summary always | turned out accurate. | dakiol wrote: | Impressive. | | URL: https://qntm.org/responsibility | | Summary: | | > This story follows Tim and Diane, two scientists who have | successfully built a quantum computer. They use it to simulate a | universe, and eventually discover that they are in fact living in | a simulation themselves. They realize that any changes they make | to the simulation will be reflected in their own universe, and | must be careful not to do anything that could be harmful. They | also discover that the simulation is part of an infinite tower of | quantum computers, with their own universe being near the bottom. | This realization is both exciting and daunting, as they now have | the power to play God, but must be careful not to do anything | that could have disastrous consequences. | | Could it be using ChatGPT behind the curtains? | burkaman wrote: | This is much better than I expected, it's given a good answer for | everything I've tried so far. | | https://labs.kagi.com/ai/sum?url=https://timecube.2enp.com | | > Dr. Gene Ray is a scientist and philosopher who has discovered | the Time Cube, a mathematical proof that there are four | simultaneous 24 hour days in a single rotation of the Earth. He | believes that the academic and religious teaching of singularity | is boring and evil, and that it is destroying the analytical | brain of humanity. He has offered a $10,000 reward to Wikipedia | to disprove his math, and has been interviewed on Air America | Radio. He believes that the nuclear waste burial in Nevada is an | example of evil stewardship, and that the only way to save | humanity is to acknowledge the Cubic Creation Principle of | opposites. He believes that educators are lying bastards who | suppress Cubic Creation debate, and that students should demand | free speech to discuss it. He believes that belief in a nonvalue | god equates to nonacceptance of Cubic knowledge, and that word | has no inherent value. | widowlark wrote: | wow, it made a succint and understandable summary of one of the | most incomprehensible websites ever in like, seconds | bradneuberg wrote: | I threw a few different things at it and it did pretty well IMHO: | https://twitter.com/bradneuberg/status/1621619127545905152?s... | sorry_i_lisp wrote: | I put in a (mostly) transcript of the Randy Pausch lecture on | Time Management [1] and the summarizer gave me: | | _Time management is an important skill to have in order to lead | a happier and more productive life. Julie A. Zelenski, a | professor at UVA, gave a talk on time management, drawing from | two books and her own experiences. She recommends having a filing | system, using speaker phones, and standing during phone calls to | save time. She also suggests doing the ugliest task first, having | a timer on the phone, and leaving thank-you notes on the desk. | Zelenski also recommends scheduling meetings, phone calls, and | mundane tasks, and treating people with respect and dignity. An | interesting point she made was to use a power drill to carve | pumpkins instead of a knife!_ | | Julie Zelenski, ha. AI be dreamin'. | | I liked the examples and you're always going to have some weird | loss of content when you summarize but the AI taking the joke by | Randy and then imagining a surname is still amusing. | | [1] https://jamesclear.com/great-speeches/time-management-by- | ran... | [deleted] | sorokod wrote: | First chapter of Catch 22 - comically low quality | | URL: https://thefreeonlinenovel.com/con/catch-22_chapter-1--- | the-... | | Summary | | Yossarian is in the hospital with a pain in his liver that falls | just short of being jaundice. He is comfortable in the hospital | and censors letters to pass the time. One day, the chaplain | arrives and Yossarian is surprised to find that they have a | mutual friend. The chaplain is shy and apologetic, but Yossarian | is friendly and warm. He warns the chaplain of the craziness of | the other wards, and the chaplain promises to be careful. In the | ward, there is a soldier in white with two useless legs and two | useless arms, and a colonel with a vortex of specialists trying | to determine what is wrong with him. The woman with curly ash- | blond hair visits him every day. Eventually, the patients start | to leave the hospital, and Yossarian tells the doctors that his | pain has gone away. | simonebrunozzi wrote: | Quite useful IMHO to summarize some Hacker News posts, here's an | example [0]. | | Funny story: I tried to build something like this years ago, | called MNMN, guessing that AI would have eventually be able to do | the summarizes well enough. [1] | | The name was a play on the song "Manamana". | | [0]: | https://labs.kagi.com/ai/sum?url=https://plei.one/blog/flutt... | | [1]: https://github.com/simonebrunozzi/MNMN | Gys wrote: | The song being https://m.youtube.com/watch?v=8N_tupPBtWQ ? | | It surely reminded me of that one! | simonebrunozzi wrote: | Yes! Even better, this [0]. Elio is a famous Italian singer. | | [0]: https://www.youtube.com/watch?v=5dN0EYphtA0 | causality0 wrote: | I fed it The Last Question. While the resulting summary is | impressive, I suspect there may be some cheating/plagiarism going | on because it includes commentary with no origin in the text. | | _The Last Question is a science fiction short story by Isaac | Asimov about two attendants of Multivac, a giant computer, who | make a bet over highballs. The question they ask is whether | mankind will ever be able to restore the sun to its full | youthfulness even after it had died of old age. The story follows | the question through the centuries as mankind develops | interstellar travel and builds a better and more intricate | computer, the Universal AC. In the end, the Universal AC is | unable to answer the question due to insufficient data, but it is | able to demonstrate the answer, restoring the Universe from | chaos. This story is a fascinating exploration of the power of | technology and the limits of human knowledge._ | anyfoo wrote: | It's interesting. Feels like an answer that someone who didn't | know about, or not understand, the concepts of the story | (mostly entropy) would give. | version_five wrote: | I pay for Kagi search. This doesn't interest me in the least. | They have been breaking down the costs of running their business | and justifying their prices, which I'm cool with. If it turns out | I'm paying to subsidize this kind of thing, it's not going to | work for me anymore. | | (I don't mean to say it's bad or anything, just I don't care | about it. It's an interesting downside of a paid business model. | Google can do whatever because it's all "free" but presumably | paying customers care more about not paying for a bunch of stuff | they don't want) | [deleted] | Xeophon wrote: | Im on the opposite site: Currently, I do not use Kagi. Such | features would convince me to switch (and therefore, pay). | GaggiX wrote: | I do not pay for the Kagi search engine, but it is quite | obvious to me that this powerful summarizer will greatly help | the search engine, which will soon be able to easily index | videos, pdfs, books, etc. and extract the most relevant content | from it with ease. | mberning wrote: | I don't know why. Most businesses work on projects that are not | strictly related to the core product. Imagine how much cheaper | your iPhone would be if you weren't subsidizing apple car, | augmented reality research, etc. | marginalia_nu wrote: | Figuring out what a document is about is one of the most | central problems in building a search engine. Being able to | regurgitate it in a way that makes sense to humans is just a | nice benefit. | dimgl wrote: | Would you recommend Kagi search to look up software dev? | sorry_i_lisp wrote: | I also pay for Kagi search and have a usage value of ~55% from | what I pay and I'm fine with experiments. | billbrown wrote: | Oof, that inspired me to check and mine's at 17%. I'm fine | with experiments, too. (It reminds me of the old Google Labs | part of search.) | version_five wrote: | Right, maybe I should have framed my comment a bit | differently: this is not core functionality I would pay for. | | Experiments are cool and it would be dumb to make suggestions | about how they do R&D. I was more reacting to the thought | that this would become part of the product offering. | mkaic wrote: | This is nothing short of remarkable! The video summary in | particular feels like magic. I am exceptionally eager to learn | more about their datasets, model architecture, and training | process. | breck wrote: | This is really good. Well done! ___________________________________________________________________ (page generated 2023-02-03 23:00 UTC)