[HN Gopher] GITenberg is an open source community for publishing... ___________________________________________________________________ GITenberg is an open source community for publishing ebooks in the public domain Author : bilinualcom Score : 171 points Date : 2020-08-02 13:14 UTC (9 hours ago) (HTM) web link (www.gitenberg.org) (TXT) w3m dump (www.gitenberg.org) | maire wrote: | This looks like a subset of Project Gutenberg? | | The Girl from Alsace in Gitenberg: | https://www.gitenberg.org/book/35926 | | The Girl from Alsace in Gutenberg: | http://www.gutenberg.org/ebooks/35926 | | The numbers are even the same which seems suspicious. Hmmm. | bhickey wrote: | It's a version controlled dump of Gutenberg. | chmod775 wrote: | "GITenberg is an exploration of how Project Gutenberg might | work if all the Gutenberg texts were on Github, so that tools | like version control, continuous integration, and pull-request | workflow could be employed." | | https://ebookfoundation.org/ | sethish wrote: | Yes, absolutely. Apologies if that's not clear from the | website. GITenberg started as an experimental fork of PG, but | due to the work of Eric Hellman at the Free Ebook Foundation, | much of the infrastructure such as metadata formats, CI/CD for | building books, and DVCS backend are being ported upstream to | the Project Gutenberg infrastructure. | maire wrote: | Thanks for the clarification! | bilinualcom wrote: | OP here, I doubt that the website is "a SUBSET of Project | Gutenberg". This is not mentioned anywhere in the website. | As mentioned in PG website, while you can use the book | freely, "The name 'Project Gutenberg' is a registered | trademark.". I guess this is the reason they didn't mention | PG. | sethish wrote: | Now I remember, that was the reason I wasn't more clear | about the connection to PG when I wrote the website | https://github.com/gitenberg-dev/giten_site/. Since then | my co-founder Eric Hellman has been doing engineering | work for Project Gutenberg, as well as running the rest | of the Free Ebook Foundation, which is the parent org of | GITenberg, free-programing-ebooks, and Unglue.it. | | I think that the GITenberg collection contains all of the | books in PG. At this point, the creation of new repos is | automatically done when Distributed Proofreaders creates | a new book in PG. Originally, I didn't include around 400 | PG books due to their creators claiming copyright, and | didn't include Bruce Sterling's book because he wouldn't | let me re-license it creative commons rather than his | pseudo-public-domain license. | | Not much has been happening with GITenberg itself in the | past few years. But luckily, a lot of the concepts and | code are getting upstreamed into PG. Which in my opinion, | is way way better. | bilinualcom wrote: | Thanks for clarification. | sethish wrote: | You might be more familiar with another project the Free Ebook | Foundation maintains: https://github.com/EbookFoundation/free- | programming-books/ Which is one of the top-10 repos on github by | number of stars. | anaphor wrote: | I like how they're able to accurately tag translators vs original | authors, e.g. https://github.com/GITenberg/The-History-of-the- | Peloponnesia... | koolba wrote: | They're probably going to have to change their name to something | that does not have "git" in it: https://public- | inbox.org/git/20170202022655.2jwvudhvo4hmueaw... | sethish wrote: | Gitenberg was one of the alternate spellings of Gutenberg in | the 1400s. | agumonkey wrote: | I propose Gytenberg | artiszt wrote: | or, widely accepted standard [in german and slavic- | languages] way back then, 'v' instead of 'u', and vice | versa | agumonkey wrote: | ah well for some reason the visual asthetics of gytenberg | vs gvtenberg made me chose the former (even though at | first I typed gvtenberg) | bhickey wrote: | > It's hard to hold them responsible for picking a name that | violated a policy that didn't yet exist. | | fwiw, Gitenberg has been around at least since 2012. | metiscus wrote: | They've been named like that for six years apparently so I'm | not sure how that works for what you linked above. It is a | valid point that hopefully doesn't cause problems. | gravitas wrote: | The https://standardebooks.org/ project has been rebuilding | ebooks (open ePUB format) with an eye on quality and readability | on mobiles/tablets (fonts, copyedit, etc.) which might be of | interest, all books and the website revision are tracked in git. | sethish wrote: | I'm a big fan of standardebooks! They're doing fantastic work | producing better ebooks from Project Gutenberg sources. | bilinualcom wrote: | OP here, I found this website when I was looking for a way to | get the updated version (with correction) of a PG (Project | Gutenberg) book and all changes/diffs from the point that I | scrape the book from PG website for my language learning side | project: https://www.bilinual.com | | The bilinual Project also rebuild ebooks (modern HTML, PDF | and open ePUB format) with better quality and readability( | while it is not its prime goal). Take a look at one example | here: | | https://www.bilinual.com/book/18043sven/sv/en#line=68&lpp=23 | https://www.bilinual.com/download/18043sven-sv-en.pdf | https://www.bilinual.com/download/18043sven-sv-en.epub | yesenadam wrote: | Wow, they look awesome, thank you! (Learning spanish here) | voldemort1968 wrote: | What is improved by adding Git to PG? | sethish wrote: | It's not so much about just adding git to books, but git to the | transcriptions of books from printed materials. There are | errors in the transcriptions, many books are lacking | formatting, and many books pre-date Unicode and are in ascii. | Tracking the book's source files in git makes it easier to | collaborate on making these changes, or seeing what changes | have been made. | bilinualcom wrote: | Well, for my side project, https://www.bilinual.com I needed | version controlling for changes that are made on a book (fixed | typos, ...) and I found GITenberg project. There are several | interesting tools developed during the project accessible here: | https://github.com/gitenberg-dev | hnarayanan wrote: | I don't know why, but I expected some amazing typography in their | PDFs. :( | sethish wrote: | [Standard Ebooks](https://standardebooks.org/) has accomplished | a lot in producing better ebooks of public domain texts. | bilinualcom wrote: | OP here, it seems standardebooks doesn't provide the books in | PDF format. My side project, https://www.bilinual.com rebuild | the ebooks in PDF format with translation hints, if you don't | mind about learning a new language while reading your | favourite books ;) | sethish wrote: | There are about 400 GITenberg books that have CC-by | licensed covers provided by Recovering the Classics. If | you're interested in using that art for your PDFs I can | find you the index! | bilinualcom wrote: | Thanks, it would be great. I am curious to know why do | they have CC license and not public domain? | robin_reala wrote: | We have this problem with Standard Ebooks. The number of | people that say things are public domain without actually | checking it is very high. A CC0 licence is an _explicit | grant_ of public domain status by the licensor, and hence | the legal issues rest with them in the event of any | problem. | | Public domain obviously can be ascertained, but if CC0 | hasn't been granted we rely on dated reproductions: | basically a photograph of the artwork in question in a | book or journal with a copyright date of 1924 or earlier. | | (that's obviously specifically a US legal reading, but SE | from a legal point of view is a US project) | thangalin wrote: | I wrote a technical comparison between various public domain | eBook projects: | | https://dave.autonoma.ca/blog/2020/04/11/project-gutenberg-p... ___________________________________________________________________ (page generated 2020-08-02 23:00 UTC)