[HN Gopher] Show HN: Hacker News user blogroll ___________________________________________________________________ Show HN: Hacker News user blogroll I saw this [0] pretty cool thread by user revskill, and wanted a quicker way to search through it, but also to keep them all in one place so I can read them at my leisure whenever I get time. Right now is like 60 lines of Ruby using Nokogiri, but I will certainly look into it further down the line and improve the list. There's a cronjob checking the thread every 12 hours but I will eventually shut that down and it will become static after that. There are some really awesome blogs in there. I really recommend going through the list, it made my day. [0] "Could you share your personal blog here". https://news.ycombinator.com/item?id=36575081 Author : deathbypenguin Score : 356 points Date : 2023-07-05 19:06 UTC (3 hours ago) (HTM) web link (dm.hn) (TXT) w3m dump (dm.hn) | LordDragonfang wrote: | Since hn karma probably correlates to how much hn readers would | enjoy a blog, I'd love a column/columns that includes the user's: | - hn profile karma - total karma of posts from that domain | - as above, but Sum(log(post_karma[i])) | | ...or something similar. | | Whatever is feasible. For a while I've wanted a list of | "blogs/domains that hn likes" that isn't polluted by general- | high-traffic domains. | ploum wrote: | That's an awesome idea, I would be really curious to see the | result. | | (hoping that this does not backfire, for example encouraging | people to spam HN with their own posts to gain some karma on | the blogroll) | LordDragonfang wrote: | To address the exact situation in your parenthetical, I | considered putting Sum(log(post_karma[i]-k)), for a k such | that the expected log value is negative unless you get enough | upvotes. | joseferben wrote: | Thanks for putting this together, love the name! | WoodenChair wrote: | Unfortunately I made a typo in writing my URL (should be | https://observationalhazard.com/ not | https://obervationalhazard.com/) and this has no way to update | it. | jefftk wrote: | Neat! It looks like something is broken with unicode handling? | For example the "smart" apostrophe in | https://news.ycombinator.com/item?id=36594375 (U+2019, RIGHT | SINGLE QUOTATION MARK) is being rendered as "a". Perhaps | something is interpreiting utf-8 as latin1? | deathbypenguin wrote: | Indeed. I'm having a fight with that at the moment and the line | breaks as well. | oneeyedpigeon wrote: | It's great. Is there any real point in sorting on description or | url? I guess url does group http and https, which might be | useful, but description definitely seems like it would be nicer | if the sort option were removed. | b8 wrote: | Hmm, my blog wasn't added. Maybe when the data was scraped I | hadn't posted it yet? | deathbypenguin wrote: | It will be picked up in a few hours. | voigt wrote: | I'm missing the good old time of webrings. This is very close :) | PartiallyTyped wrote: | You may be interested in [1], it's run by a few friends of | mine. Hopefully it won't get hugged to death. | | [1] https://32bit.cafe | jakebasile wrote: | Look ma, I'm in an HN link! This is pretty neat. | Moncefmd wrote: | This is great! Would've been cool to also be able to sort by | votes though. | xwdv wrote: | I'm going to train an LLM on all these blog posts, make a true HN | AI. | deathbypenguin wrote: | Added feeds thanks to JSTucker. They are being fetched from the | Gist. I think the cron ran, so there are more blogs now. | | json: https://dm.hn/blogroll.json (I'll add the feed to each item | in a minute) | astuyvenberg wrote: | Oh neat - I saw mine on the list earlier but is now removed, | maybe some kind of rate limiting? Anyway, neat project! | levysoft wrote: | I can't believe you had the same idea and necessity but you | preceded me. Good job! | zdwolfe wrote: | Looks cool, thanks for making this. | ghomem wrote: | Clap clap clap. This is excellent public service @deathbypenguim. | Yesterday I was scrolling through that enormous thread and using | control+F to look for keywords of interest on the posted blog | descriptions. Now it will be much easier to follow fellow | bloggers. Thanks for having my blog on your list too. | AndrewKemendo wrote: | We reinvented web rings! | I-M-S wrote: | Can we do this for HN users' podcasts? | swyx wrote: | shameless plug for my pod: https://www.latent.space/podcast | 1attice wrote: | Naked self-promotion here, but I was late to the party on the | original blogroll -- is there any way to add blogs post-ex-facto? | Is there a submission mechanism? | TOGoS wrote: | I commented, but mine got missed, somehow. Maybe because my | phone auto-capitalized the "H" in "http" and the script didn't | account for funny capitalization. Sad! | 1attice wrote: | I did too, but I didn't include the protocol, just the bare | domain :/ | | FWIW it's https://lizmars.net | eigenhombre wrote: | Same, I think this is a great idea and would like to submit | mine as well -- maybe an "add" feature on the page would make | sense, or re-inquire here at intervals, maybe monthly/yearly? | petercammeraat wrote: | Brilliant. Easy to use as filter for subjects (if people | described their blog) | do-me wrote: | That's awesome and so much more practical than scrolling through | HN. It would also be possible to integrate semantic search so | people don't necessarily need to know the keywords. If you're | interested, feel free to ping me or take a look at | https://github.com/do-me/SemanticFinder. In case I could just | create a pre-indexed version based on your data dump which would | be quite convenient to use. | tenkabuto wrote: | These threads make me wish that I had a blog, not just a regular | website. :( | nelsonfigueroa wrote: | I took at look at your website and it seems like you could | easily add blog posts to it! | tenkabuto wrote: | Hah, thanks. I've been hoping to do so, but still haven't | gotten around to it. There's some quirks with the static site | generator that I use[0] that lead me to keep postponing | setting up blog-ish features, and I don't know enough python | to fix them. | | [0]: https://github.com/gordonbrander/lettersmith_py | nelsonfigueroa wrote: | If you ever want to try a new static site generator, I use | Hugo[0] to generate my site. There's a lot of pre-built | themes[1] you can use. Most (if not all) have blogging | functionality built in, all you need to do is drop in a | Markdown file with your content. You may need to learn a | little bit if Golang if you want to customize themes. Just | throwing it out there as an option. | | [0]: https://gohugo.io/ | | [1]: https://themes.gohugo.io/ | tenkabuto wrote: | Thanks! In writing out my reply to you I realized that I | should look into other generators (specifically looking | into Hugo, as I think I've seen it used by people like | myself who take notes in Obsidian). The key features I | want are backlinks support and blogging features, along | with Markdown support. | victorbjorklund wrote: | Would be cool to make an RSS feed that combined all the RSS feeds | from all the blogs. | amadeuspagel wrote: | Here's an HTML feed: https://webloglist.com/hn | samsquire wrote: | Thanks for creating this. I will use this to go through people's | blogs. | | I think my blog/journals hasn't been picked up yet | | https://news.ycombinator.com/item?id=36588927 | | My blog is on GitHub, how do you parse the URLs? | deathbypenguin wrote: | It should show in a few hours when the cronjob triggers. | Parsing: nokogiri | ghoomketu wrote: | Looks great and congrats on shipping. If it were up to me I'd | still be deliberating the best framework and design to use for | this, and how I can pipe the comments through chatgpt to extract | the category, keywords and do things that make it the best | blogroll ever. | | And then I would have just thought it's too much work for nothing | and that'd be the end of it :P | myth2018 wrote: | Dude I can definitely relate | deathbypenguin wrote: | haha I had the exact same ideas, but then I was "bah! I'll put | it out there and I'll add functionality over time" | swyx wrote: | did you just have the .hn TLD standing by? where is that | from? must have cost a pretty penny | smokel wrote: | Great work, thank you for sharing this. | | I would prefer to see the entire list, so that I can easily | search for keywords in the browser. Apparently, all data is | available on the client side, but the table renderer seems to | limit the table size to at most 100 entries. | ryan-duve wrote: | A workaround while you're waiting for this to be supported by | OP is to go to inspector and change the last dropdown option to | <option value="10000">10000</option> | | then select it in the UI. | Aissen wrote: | It's very nice, thanks ! It would be nice if descriptions had new | lines; some aren't readable, while they work quite well on HN. | deathbypenguin wrote: | I'm on it. | scarface_74 wrote: | This is awesome | abathur wrote: | Hmm. Any idea why some wouldn't show up? I posted in | https://news.ycombinator.com/item?id=36588940 but don't see it in | the list. | toyg wrote: | Same for me. Maybe the scraper choked on pagination, maybe they | just took a snapshot before we posted. | JSTucker wrote: | Heres an OPML with all the feeds I could detect from the list! | https://gist.github.com/Josh-Tucker/030b8cba6557927a27f1c7e6... | swyx wrote: | if you share the code for OPML conversion maybe OP could | incorporate it quickly | re wrote: | This made me think of "planets", which I feel had a heyday back | in the late 2000s before Reddit and social media took over | everything. Anyone want to take all the blogs with RSS/Atom feeds | and build an HN planet? :) | | > In online media a planet is a feed aggregator application | designed to collect posts from the weblogs of members of an | internet community and display them on a single page | | https://en.wikipedia.org/wiki/Planet_(software) | soegaard wrote: | https://planet.scheme.org/ | ploum wrote: | Yeah, planet were awesome. I'm proud to say that my blog was | both on planet.gnome (the original one) and planet.ubuntu. | | Now, I feed that the most interesting planet is planet.debian, | which offers lot of variety without being focused on Debian. | | The great feature I liked was that Planet were _not_ about a | given project. It was about the people contributing to the | project. Their life. Their interests. | | At some point, lot of planets started to ask only "on-topic" | posts with a specific RSS feeds. Those planets became boring as | it was mainly stuffs you could find on forum or any tech | related websites. | tenkabuto wrote: | Yes! I've loved Planet Python[0] because it really lets you | see that the Python community is quite varied, fun, and | human. | | [0]: https://planetpython.org | susam wrote: | I still follow a few planets. For example: | | https://planet.lisp.org/ | | https://planet.emacslife.com/ | amadeuspagel wrote: | Here you go: https://webloglist.com/hn | susam wrote: | Very interesting! Thanks for sharing your project here. Out of | curiosity, I did some searches with some interesting strings. At | the time of posting this comment, here is what the search results | look like: | | Vim: 8 entries | | Emacs: 7 entries | | Python: 24 entries | | Rust: 24 entries | | Lisp: 5 entries | | Clojure: 3 entries | | Haskell: 5 entries | | Zig: 5 entries | | Elixir: 4 entries | | Scheme: 0 entries | | Postgres: 4 entries | | MySQL: 0 entries | | SQLite: 3 entries | | Jekyll: 9 entries | | HTML: 40 entries | | Markdown: 6 entries | | LaTeX: 1 entry | | Hugo: 12 entries | | Next.js / Nextjs: 4 entries | | Gatsby: 2 entries | | Pelican: 0 entries | | .com: 495 entries | | .dev: 90 entries | | .net: 84 entries | | .io: 82 entries | | .me: 53 entries | | .org: 43 entries | | .xyz: 15 entries | | .page: 6 entries | | github.io: 46 entries | | medium.com: 18 entries | | blogspot.com: 8 entries | | wordpress.com: 4 entries | | livejournal.com: 0 entries | | tech: 178 entries | | programming: 66 entries | | random: 61 entries | | thought: 49 entries | | math: 16 entries | | musing: 12 entries | | blag: 1 entry | | favorite: 28 entries | | favourite: 9 entries | | Now all of these results are string search results, so there is | always going to be a little bit of noise when we try to draw | conclusions out of these results. For example, the results for | ".dev" also contains results that look like "*dev*.com". | | Despite the noise, I found these results interesting. I remember | in the early days when the blogosphere was being constructed 20 | km above the tag clouds, it was very fashionable to have blogs | for random musings or random thoughts. So I am delighted to see | that most blogs out here are tech blogs. Surprisingly there is | only blag. I expected at least a few more. | | One of the Lisp entries is mine. Also, one of the Vim entries is | mine. It is a bit ironical because I am actually an Emacs user. | If I had known the comments we write on HN would become part of | the search string in this blogroll, I might have chosen my words | in my comment to the "Ask HN" port more judiciously! :) | boricj wrote: | reverse engineering: 5 entries | | Ghidra: 1 entry (mine) | | On one hand it does bring some level of perspective on the | popularity of a particular topic you're into. My first reaction | was "Just 0.5% for reverse-engineering? I guess I'm down in a | deep dark rabbit hole..." | | On the other hand, I haven't seen the blogs of Ken Shirriff, | Alex Ionescu or Raymond Chen on that list, which I know are | quite popular and regularly make it to the Hacker News front | page. | cavalcade119 wrote: | [dead] | kiruio wrote: | Cool, I forgot to add descriptions. Would be nice if I could fix | it | version_five wrote: | Can you say what criteria you used to filter the thread into | valid blogs? | hyperific wrote: | Thanks for doing this! | verse wrote: | Love this, thanks for building it! | amadeuspagel wrote: | The latest posts from these blogs: https://webloglist.com/hn | jakebasile wrote: | That's cool! Did you pull RSS from all the sites you could and | use that to aggregate it? | amadeuspagel wrote: | Yes, webloglist uses RSS autodiscovery. | darekkay wrote: | It seems the autodiscovery didn't work for my blog (link in | profile). I've posted something 2 days ago but it doesn't | appear on your site. My feed is on the list from JSTucker, | who also used some sort of autodiscovery. | amadeuspagel wrote: | Atom isn't supported yet. Working on it. | rambambram wrote: | Nice list! I was almost going to ask you if you have an OPML | file with all the feeds, but then I decided to check the list | manually for interesting latest posts and grab only their | feeds. Thanks for the list! | addandsubtract wrote: | Now we just need ChatGPT to read them all and give us a daily | update on the interesting ones. | skilled wrote: | Good job. I would honestly love this but with RSS feeds also, but | I know it's a tough ask unfortunately. (Not for you, but in | general) | xoranth wrote: | Most blogs that have RSS also have a `<link rel="alternate" | type="application/rss+xml">` tag that redirects you to the RSS | feed. If you pass the link to the homepage to a feed | reader[^0], it will follow the link tag and find the RSS feed. | | [^0]: At least, Liferea on Linux, NetNewsWire and Vienna on | Mac, do this. AFAIR NetNewsWire is even smarter than that, and | can sometimes find the RSS feed even when there is no link tag. | marginalia_nu wrote: | A bit of a snag is that many CMSes generate multiple feeds, | and there is no way I'm aware of for identifying which is the | "canonical" feed. | 1270018080 wrote: | Conspiracy: That post was only made to harvest data for someone's | model | bachmeier wrote: | Well, given that blogs are public and the whole point is for | others to read them, I think that's okay. | leejoramo wrote: | This is great. An OPML version of this would be great to bulk | IMPORT the RSS/ATOM feeds into your favorite feed reading app. | JSTucker wrote: | I've scraped what I could from the list and exported and opml | here: https://gist.github.com/Josh- | Tucker/030b8cba6557927a27f1c7e6... | tommy_axle wrote: | A feed is now added to https://codeinsider.dev | (https://codeinsider.dev/rss.xml) | JSTucker wrote: | Have added your feed to the list | [deleted] | deathbypenguin wrote: | Thanks man, it's been added: https://dm.hn/blogroll.json and | next to each entry. | rambambram wrote: | Indeed! But I guess not every blog has a feed, or there's no | quick way of letting people add one to the list after the fact. | prepend wrote: | I expect that every blog has an rss or atom feed. It would be | strange for someone to go to the effort of writing a blog and | not setting up a feed. That and most blogs have feeds | automatically. | | That being said, any blog that doesn't have a feed and has | some proprietary subscription is not one I want to subscribe | to. So not including feedless blogs is a positive for me. | TimCTRL wrote: | Saw the .hn domain and I was like What...HN has its own TLD. Then | i searched google and saw it belongs to honduras...daft me i | guess.. | mcmcmc wrote: | All two-letter TLDs are country codes. | TimCTRL wrote: | Thanks! | akiselev wrote: | Our future AI overlords sincerely thank you for this pristine | data set. | alfiedotwtf wrote: | Weird. I added to that original post, but I'm not on your list. | Maybe your code didn't go to the "See more comments" page? | syx wrote: | I would add a shuffle button that opens a random blog so it's | nicer to discover something new compared to endless paginations. | deathbypenguin wrote: | Noted. I will be correcting a few bits and adding new | functionality over the next few days/weekend. | scastiel wrote: | +1 | | I would even add a "I'm feeling lucky" button, to redirect to a | random blog ;) | splitbrain wrote: | That's what https://indieblog.page was made for | deathbypenguin wrote: | Random blog button up now. | alonsonic wrote: | Love this, have been reading random blogs for the last | 30minuts already | cavalcade119 wrote: | [dead] ___________________________________________________________________ (page generated 2023-07-05 23:00 UTC)