[HN Gopher] Show HN: I wrote a book about using data science to ...
       ___________________________________________________________________
        
       Show HN: I wrote a book about using data science to solve
       "everyday" problems
        
       Author : andrewnc
       Score  : 257 points
       Date   : 2021-02-24 18:04 UTC (4 hours ago)
        
 (HTM) web link (andrewnc.github.io)
 (TXT) w3m dump (andrewnc.github.io)
        
       | andrewnc wrote:
       | I've always wanted to write a book. I have helped write 3
       | different deeply technical books (and one solutions manual), but
       | I wanted something fun, interesting, and valuable.
       | 
       | So I wrote "Everyday Data Science" which is a collection of
       | stories, tutorials, jokes, math, and code all written to inspire
       | people to analyze their personal data.
       | 
       | In general, I was also inspired by the challenge to "make $100
       | online" which I have done in the past month since launching. It
       | was daunting, and I felt quite vulnerable, but overall I'm
       | pleased with what I've made.
       | 
       | I wrote up this quick post to give you an idea of the process I
       | followed to write the book, and some of the content.
       | 
       | I'd love to know your thoughts and am open to (nice) feedback :)
        
         | ghostbrainalpha wrote:
         | Did you make the $100 entirely through book sales?
        
           | andrewnc wrote:
           | Yes. I officially launched the book at the end of January. I
           | delayed doing a HN launch until I was sure people enjoyed the
           | book and I knew there would be some interest.
           | 
           | The unfortunate truth is that book writing is a low margin
           | game and you shouldn't do it for the money. Development or
           | actual data science is far more profitable.
           | 
           | But there are many other intangibles :)
        
         | mholt wrote:
         | Hey Andrew, it's Matt from our graduate research lab. This is
         | exciting, congrats!
         | 
         | You always did make me thirst for more understanding of ML...
         | guess I'll have to buy this book. Do you make more margins on
         | digital or print copies?
         | 
         | By the way -- HNers -- if your company needs talent in data
         | science, Andrew is easily one of your best candidates. His
         | intuitive understanding and teaching of data and ML inspired me
         | to be a better scientist. Andrew thinks critically and is also
         | a great person to work with.
        
           | andrewnc wrote:
           | I have higher margin on digital copies :) Amazon loves to
           | take their share.
           | 
           | Those are kind words coming from maybe the single greatest
           | programmer I know.
           | 
           | Thanks for your support and for the recommendation :D
        
         | matsemann wrote:
         | I've read "Algorithms to live by" and liked it. This looks like
         | a data science variant of that same idea, very cool.
         | 
         | I think books like these can be a great eye opener. We all
         | remember thinking "what am I gonna use this for??" in high
         | school maths, physics etc, and I think this is a fun,
         | approachable and interesting way to see real life impact of
         | maybe otherwise dry and abstract stuff.
        
         | dan_can_code wrote:
         | Hi Andrew, thank you for sharing this. I am inspired by the
         | book and the reasons why you wrote it. This will definitely
         | help with not only trying to use your book as some form of
         | self-help, but to remind myself that when you put your mind to
         | something, you can achieve it. This is something I have been
         | struggling with for the last year and for some reason your book
         | and this comment has been a motivating factor. Thank you.
        
           | andrewnc wrote:
           | Thank you for your kind words :)
           | 
           | Motivation is such a fickle beast. For whatever reason, I
           | felt good during the entire writing process. Going back and
           | forth with the editor was definitely more challenging.
           | 
           | One thing that helped, actually, was I tweeted an artificial
           | deadline when I started writing. That was immensely
           | motivating for me. I ended up missing the deadline (again
           | because of editing woes) but that was key in helping me push
           | this over the finish line.
           | 
           | Again, I'm glad this was motivating for you and I wish you
           | the best. Feel free to message me on twitter if you ever
           | wanted to chat more about projects you're working on.
        
         | rememberlenny wrote:
         | Congrats!
         | 
         | This book resonates with me.
         | 
         | This reminds me of the "data science" book was from Data Smart,
         | from the Mailchimp CDO, talking how to keep orange juice
         | tasting the same all year round (using a seasonal fruit) or
         | calculating the likelihood of a consumer being pregnant - all
         | within Excel.
        
       | NearAP wrote:
       | It looks interesting. Since you say the book is for folks 'as
       | untechnical as my Mom', I'll get a copy for my young niece that
       | is trying to get into programming.
       | 
       | You have a minor typo on the page - 'enthrawled' should be
       | 'enthralled'
        
         | e12e wrote:
         | > You have a minor typo on the page - 'enthrawled' should be
         | 'enthralled'
         | 
         | Sure that it's not "entrawled in data", like someone entangled
         | in a swift web of information? ;)
        
         | andrewnc wrote:
         | Nice catch, thank you!
         | 
         | I hope your niece likes it, I would honestly love to know how
         | she handles it. I bet she'll get a lot of the ideas, and might
         | have to skip over some of the nitty-gritty details.
        
       | simonbarker87 wrote:
       | This looks great but.... I'm not going to buy the paperback and I
       | hate reading extended stuff on tablet/phone/laptop - I read books
       | on kindle, kind to my eyes, distraction free, lightweight and
       | comfortable.
       | 
       | Anything plans that address this?
        
         | andrewnc wrote:
         | I tried a number of hours to get a kindle version. But since I
         | wrote the whole book in Latex and every converter from PDF ->
         | Mobi/Epub mangled the book, I wasn't able to get a kindle
         | version.
         | 
         | I settled with the PDF as a soft copy instead (and I put PDFs
         | on my kindle, even though that's obviously not desirable).
         | 
         | If there is a nice way to get PDFs to read natively on kindle,
         | I would jump on that, but I wasn't able to get it to work.
        
           | simonbarker87 wrote:
           | Ok, that's a shame, I wrote my thesis in Latex so am familiar
           | with its ... issues.
           | 
           | The only alternative I can think of is copy the paragraphs to
           | html p tags and then any equations written in latex could be
           | converted to images with LatexIt! (if that's still a thing)
           | 
           | That all assume it's enough if a deal for you, understand if
           | it's not and thanks for the reply
        
           | another-dave wrote:
           | Ah that's a shame to hear -- the book sounds really cool (as
           | a layman with an interest) but I too was hoping for a Kindle
           | edition. May pick up the paperback if I can make some room!
           | -- now operating a strict one in, one out policy for physical
           | books :)
        
           | dewy wrote:
           | Can you convert Latex to Word (or ODT, etc.), perhaps with
           | Pandoc, and then from that to epub (e.g. Google Docs allows
           | you to download a Doc as an epub)?
           | 
           | Book looks interesting, btw!
        
             | andrewnc wrote:
             | Thank you!
             | 
             | Yes, in theory.. but in practice it didn't work when I
             | tried that route. Because I use the Tufte-Book template,
             | there is a constant margin to the side of the main content
             | that holds figures, equations, and such.
             | 
             | This margin gets brutalized by every program (even the
             | propriety ones Amazon built).
             | 
             | Lessons learned. I need to be more mindful of the various
             | conversion processes in the future.
        
               | ghaff wrote:
               | My conclusion with a self-published book a number of
               | years ago was that, once you get beyond flowing text,
               | creating a Kindle version gets a lot harder. It can
               | clearly be done--e.g. there are Kindle format guidebooks
               | that work well. But when I did another self-published
               | book a few years later, I was giving it away anyway so I
               | just did a PDF version for tablets. (My last one is
               | through a publisher but it's also mostly text.)
        
               | e12e wrote:
               | > I use the Tufte-Book template
               | 
               | This?
               | 
               | https://github.com/Tufte-LaTeX/tufte-latex
               | 
               | I've often been disappointed at the type of html it's
               | possible to force out of (La)TeX. And epubs are pretty
               | much html, css and images.
               | 
               | I generally think that some kind of markdown with the
               | help of pandoc is the happy path for pleasant
               | writing/editing and good output for html/epub and
               | print/pdf.
               | 
               | I did find this, that have some hints on how to get xml,
               | and then xhtml with support for equations - but it looks
               | cumbersome:
               | 
               | https://tex.stackexchange.com/questions/1551/use-latex-
               | to-pr...
               | 
               | See also: https://github.com/duzyn/tufte-markdown And in
               | particular (beautiful!):
               | https://edwardtufte.github.io/tufte-css/
        
               | e12e wrote:
               | You know, the book looks interesting - but not in pdf,
               | and not as a physical book (I try to live without
               | physical books - won't I be a sad case when the
               | singularity pushes us into post-apocalypse?)
               | 
               | Did you cconsider selling access to the source (eg:
               | private github repo, suitable license)?
        
       | michaericalribo wrote:
       | Nice! Real-world problems like these are what spurred my interest
       | in statistics/data science, and I agree with your sentiment that
       | intro stats books are a _terrible_ way to cultivate enthusiasm
       | and curiosity about ways to solve problems using data.
       | 
       | In my experience, the most difficult parts of the process are (1)
       | translating from qualitative problems "in the field" to a
       | formalized technical problem, and (2) all the wrangling necessary
       | to implement the formal problem using field-collected data.
       | 
       | I find that I spend most of my time as a data scientist working
       | on these parts of the process, and often don't have the bandwidth
       | or even requirement for more advanced methods. Not that fancy
       | techniques are the goal, per se, but I do notice I rarely have
       | the opportunity to use them.
        
       | rootzmac wrote:
       | I added the book to Goodreads and forgot to add the cover! Ugh,
       | and you can't edit a book unless you have "librarian status",
       | whatever that is. But here's the link if anyone's interested:
       | https://www.goodreads.com/book/show/57197718-everyday-data-s....
        
         | andrewnc wrote:
         | whoa cool! Thank you for doing this. It looks like the cover
         | has been added by some kind Librarian.
         | 
         | I've submitting a petition to "claim" the book as an author.
        
           | rootzmac wrote:
           | Awesome! I just added it to my TBR!
        
       | bradford wrote:
       | I'd love to see a few sample pages from the book (in addition to
       | the snippets that were on the page you linked to).
        
         | andrewnc wrote:
         | Sure! Here are two pages in context.
         | 
         | https://andrewnc.github.io/preview/page_20.png
         | https://andrewnc.github.io/preview/page_94.png
         | 
         | That should give you an idea of the style of writing etc :)
        
           | MitchProbst wrote:
           | Hahaha I loved the humor displayed on page 20. Looks like
           | you're sticking well to your idea of trying to make the book
           | enjoyable and easy to read!
        
           | ornornor wrote:
           | That's my personal opinion but I prefer when line spacing
           | isn't so massive to stretch the page count. I'd rather have
           | smaller line spacing with less pages overall than thinking
           | I'm getting a 120 pages book only to find out each page only
           | has ten lines.
        
             | ajkjk wrote:
             | Yeah agreed. That spacing looks like a college essay, ie,
             | basically unreadable.
        
       | xenihn wrote:
       | I'm probably going to buy the PDF. I think the paperback is too
       | expensive for a book that is only 114 pages.
        
       | ornornor wrote:
       | Do you sell an epub version? I'd rather read this on my kobo but
       | PDFs don't do well on it, and I'd rather not feed Amazon for the
       | hard copy.
        
         | andrewnc wrote:
         | Unfortunately not :/
         | 
         | I tried for a while to get the epub / mobi version but the PDF
         | was always mangled by every conversion technique I tried.
         | 
         | I did just get a nice tip from a HN user about a piece of
         | software that might work. I'm going to give it a try and maybe
         | I'll have an epub one down the road.
         | 
         | But as of now, there isn't an epub version.
        
           | pronoiac wrote:
           | Have you looked at pandoc?
           | 
           | I've been working on converting an old AI book into cleaner
           | markdown for a while, and the biggest issue there has been
           | the lack of a clean source. I'd be surprised if there isn't a
           | way to manage this conversion.
        
           | anotherevan wrote:
           | That's a shame. No epub version was the very first thing I
           | noticed, and searched this HN page on before even reading the
           | comments.
        
       | datadawg wrote:
       | I've been looking for a book that balances technical rigor with
       | real-world application, and more importantly, is fun to read.
       | This looks promising - adding it to my reading list!
        
       ___________________________________________________________________
       (page generated 2021-02-24 23:00 UTC)