[HN Gopher] Show HN: WikiBinge - discover how all things are vag... ___________________________________________________________________ Show HN: WikiBinge - discover how all things are vaguely connected Connect two articles on Wikipedia, but do it the long way. I've always been a fan of the theory of six degree of separation, but it's an overused concept when exploring the Wiki-graph. Instead of showing the shortest path, which in my opinion is "boring" and ends up connecting super-important central articles, I came up with my own method: WikiBinge selects the smaller, less represented articles on Wikipedia. In a WikiBinge path, the underdogs are the kings! How does it work? It's pretty straightforward! Compute PageRank on the Wiki-graph and assign as weight of each edge the PageRank value of the destination node. A WikiBinge path is then simply a shortest path using these weights: the algorithm will then favor paths passing through articles with lower PageRank values. More on the motives to build this here: https://www.jamez.it/project/wikibinge/ This is an older project of mine, but it never got much exposure, so I'm humbly submitting it now. Author : jamez Score : 87 points Date : 2023-04-14 17:48 UTC (5 hours ago) (HTM) web link (www.wikibinge.com) (TXT) w3m dump (www.wikibinge.com) | adrianh wrote: | This is excellent. The path from "George Barnes (Musician)" to | "Django Reinhardt" somehow managed to pass through Hysterectomy, | 5-Bromouracil, Glucuronidation and Port Bannatyne (Scotland). | Kudos for some creative coding! | munro wrote: | The chains are so long, it's really not that impressive :/ | r3trohack3r wrote: | The point is that the chains are long and winding instead of | the shortest path between two articles. | | It seems that, if you pick an uncached path, the loading screen | shows you the shortest path while it computes the longer one. | More info in their linked article. | jonathankoren wrote: | Excellent. 32 degrees away, and brings in Air Bud. | | https://www.wikibinge.com/#John_Wilkes_Booth/Hentai | throwaway9131 wrote: | Strange, my first guess had no connections either way | | chicken nugget <-> constitution of canada | | Now I'm wondering if its a bug or if there is actually no | connection | ape4 wrote: | But this works... | https://www.wikibinge.com/#Chicken_nugget/Canada_Day | foota wrote: | It might be interesting to see this but with paths where they're | instead weighted based on the strength of the relation (e.g., | something like TF-idf on the articles each link to). | | I think this would avoid the super common article problem, but | also lead to more relation between each link. | jamez wrote: | TF-idf would definitely be something very interesting to try, | though I also treasure the serendipity brought by the | "blindness" to the content. | r3trohack3r wrote: | TF-idf would be a really cool tool for doing research. | | Seems like there are some cool modes you could explore here: | | * TF-idf Path | | * The Meandering Path (current) | | * The Shortest Path | | * The Human Path (prefer connections that are humans vs. | topics/places/concepts/etc.) | | * The Rabbit Hole Path (prefer connections that are | concepts/academic/etc.) | ArekDymalski wrote: | it's nice that it highlights less popular and obvious - i've | discovered a lot linking potato to the Millenium Falcon :) | https://www.wikibinge.com/#Potato/Millennium_Falcon | kirubakaran wrote: | Anyone got anything longer than this? (68) | | https://www.wikibinge.com/#Madurai/Semiahmoo_Bay | vallanceroad wrote: | My first search counted 118: | | https://www.wikibinge.com/#George_Bush_Intercontinental_Airp... | | I imagine you could double this. | codetrotter wrote: | I count 91 articles from FreeBSD to Weedonville, Virginia | | https://www.wikibinge.com/#FreeBSD/Weedonville,_Virginia | layman51 wrote: | I count 100+ articles between these two. It is a pretty long | path and that is the point I guess. | https://www.wikibinge.com/#Amy_Goodman/Anatoly_Karpov | r3trohack3r wrote: | This is absolutely amazing. | | I built wikiscroll.blankenship.io for myself to scratch my | neophile itch. You might be displacing it in my daily routine, a | nice pre-built rabbit hole between two topics of interest has | proven to be a lot of fun over the past 30 minutes. | | Amazing work. | | As a short aside, at first I didn't get it. I was surprised the | paths between articles were so long. It wasn't until I tried | "Adolf Hitler" -> Something (Hitler has notoriously short paths | to everything) that I realized these weren't the shortest paths. | Your loading text does a really great job of explaining that, but | the "random" button appears to be pulling from a cache (clever!) | so I didn't get to see that loading message about the "boring | shortest path" until I went off the beaten path. | | Since it seems like you are computing both the shortest and the | "most interesting" path between the two articles, it would be | cool to give me a way to see both on the final loaded page. The | shortest path is interesting too, even if it is less interesting | than the one you ultimately generate. | | It'd also be cool to be able to "pin" one of the boxes so the | random button only impacts the other. For example, if I started | at the Great Molasses Flood, what path could I take to random | other articles? Though I guess this can be accomplished by | spinning and then retyping the "Great Molasses Flood" | | Edit: I deeply appreciate your narrative at | https://www.jamez.it/project/wikibinge/ - this is one of my | favorite projects I've come across on HN in a long while. | Kerrick wrote: | This is pretty neat! I tried | https://www.wikibinge.com/#Moray_eel/Sony_Alpha and I was pleased | with the path it took me for first big chunk of the path. The | last bit was a tour of a significant number of camera models | (none of them Sony), which felt... strange? It certainly felt | less-varied than the combination of animalia, history, geography, | and pop culture that the first part took me through. | | Fun project, thanks for sharing! | | If I had a bit of feedback to share, it's that the shortest path | (which shows while loading the binge) continues to be visible | after it finishes loading -- maybe at the bottom of the page? | jchw wrote: | Interesting. Seems like it'd be ideal if there were some way to | penalize pages that are too similar to each-other (similar | categories/taxonomy, maybe similar text/structure, etc.) | because when it does chain a bunch of weird stuff it is very | interesting. | jamez wrote: | Cool path! The last bit of the tour you described is fairly | common - the algorithm is just doing its job, getting closer to | the target, avoiding articles with larger PageRank. It's very | uncommon to pass through "important" pages. Glad you enjoyed | it! | stevenking86l wrote: | This is great fun. Though my first guess had no connection: | Cleopatra and The Great Pacific Garbage patch | jamez wrote: | When you can't find a connection, always try the reverse! | https://www.wikibinge.com/#Great_Pacific_garbage_patch/Cleop... | CyborgCabbage wrote: | https://www.wikibinge.com/#Electoral_college/The_Long_Earth | | Apparently Humptulips, Washington was Terry Pratchett's favourite | place on earth. :) | Anduia wrote: | I tried to go from "Bronze Age" to "Mail" and it took me in a | long ride including the dog from Oz. Quite fun. | https://www.wikibinge.com/#Bronze_Age/Mail | passwordoops wrote: | Nice! My wife and I used to play a game sort of like this - find | the shortest path between two pages on Wikipedia. It actually | made for a fun party game too. | | I really love the circuitous path though. Fantastic route to | discovery and I can see those even being a neat thing for schools | birdyrooster wrote: | You two sound awesome together. | MattGaiser wrote: | https://www.thewikigame.com/ | ramesh1994 wrote: | It is a pretty fun game https://wiki-race.com/ | armandososa wrote: | This is funny, pressed the dice icon and I got "Milk" and | "Cookie" and I thought it was going to be a short connection. It | isn't. https://www.wikibinge.com/#Milk/Cookie | delecti wrote: | That is a shockingly long chain. Amusingly if you flip them, | Cookie links directly to Milk. | MattGaiser wrote: | Just using my Wikigame skills, I went from Milk -> Hot | Chocolate -> Chocolate -> Chocolate chip cookie -> Cookie. This | path likely got downranked because all would be popular | articles. | [deleted] | mcint wrote: | My first attempt I can't search donut (Doughnut is the | canonical), can't type the "()" parenthesis that appear in page | names, and can't use any of France, La France, or French Republic | to indicate the wiki page on France. Lots of francesca's though. | | Fun fun, thank you for sharing! In the interactive web | interface*, I hope non-canonical names can be used, that shortest | names can be completed and exact matches can be use, and at least | accept what it's in page names. | | *It looks like writing the URL fragment yourself allows more | leniency. | raybb wrote: | Pretty cool project! FYI for some reason search doesn't seem to | work that well. | | hackNY won't come up and if you try try to add a place with a | comma (Lowell, Massachusetts) you can't type it you have to | scroll it. | | https://en.wikipedia.org/wiki/HackNY | | https://en.wikipedia.org/wiki/Lowell,_Massachusetts | dzink wrote: | Horse and Astronaut are more closely connected (through the | shows, road, actors) than Cancer and Healthcare which are not | connected at all, apparently. This was the problem with public | datasets and counting on HTML links to build the graph of human | knowledge years ago. I tried to build better search at the time | and hit this wall. Large language models will be a bonanza for | new products now that the wall is broken. | jamez wrote: | To be clear - the point is not to display what can or cannot be | connected. The emphasis on this project is about the long and | tortuous path chosen. | | I take your point about the limits of knowledge graphs written | manually vs LLMs. IMHO it's not either/or. We need both | curation and statistical approaches, and when they are merged | they give the best results. Just ask Wolfram: | https://writings.stephenwolfram.com/2023/03/chatgpt-gets-its... | Edit: fixed link to Stephen Wolfram's blog. | cocodill wrote: | hmm, that six degrees of wikipedia bridge is little weird. i | thought you can land from everywhere in few clicks by hitler. but | it takes over 70 steps to get dill from hitler. ___________________________________________________________________ (page generated 2023-04-14 23:00 UTC)