[HN Gopher] Colab Pro
       ___________________________________________________________________
        
       Colab Pro
        
       Author : rahidz
       Score  : 42 points
       Date   : 2020-02-08 09:42 UTC (13 hours ago)
        
 (HTM) web link (colab.research.google.com)
 (TXT) w3m dump (colab.research.google.com)
        
       | zapf wrote:
       | There's so much data in this universe, people don't know what to
       | do with it. When people don't know what to do, an industry grows
       | to let them "feel" they are doing something useful.
        
       | fibrennan wrote:
       | Just wanted to share a Colab alternative I work on called
       | Gradient[0] (also includes a free GPU).
       | 
       | Some of the key differences:
       | 
       | - Faster storage. Colab uses Google Drive which is convenient to
       | use but very slow. For example, training datasets often contain a
       | large amount of small files (eg 50k images in the sample
       | TensorFlow and PyTorch datasets). Colab will start to crawl when
       | it tries to ingest these files which is a really standard
       | workflow for ML/DL. It's great for toy projects eg training MNIST
       | but not for training more interesting models that are popular in
       | the research/professional communities today.
       | 
       | - Notebooks are fully persistent. With Colab, you need to re-
       | install everything every time you start your Notebook.
       | 
       | - Colab instances can be shutdown (preempted) in the middle of a
       | session leading to potential loss of work. Gradient will
       | guarantee the entire session.
       | 
       | - Gradient offers the ability to add more storage and higher-end
       | dedicated GPUs from the same environment. If you want to train a
       | more sophisticated model that requires say a day or two of
       | training and maybe a 1TB dataset, that's all possible. You could
       | even use the 1-click deploy option to make your model available
       | as an API endpoint. The free GPU tier is just an entrypoint into
       | a full production-ready ML pipeline. With Colab, you would need
       | to take your model somewhere else to accomplish these more
       | advanced tasks.
       | 
       | - A large repository of ML templates that include all the major
       | frameworks eg the obvious TensorFlow and PyTorch but also MXNet,
       | Chainer, CNTK, etc. Gradient also includes a public datasets
       | repository with a growing list of common datasets freely
       | available to use in your projects.
       | 
       | Those are the main pieces but happy to elaborate on any of this
       | or other questions!
       | 
       | [0] https://gradient.paperspace.com
        
       | bhl wrote:
       | I wonder who made the decision to spin this out into a commercial
       | product; maybe it has to do with Google's push into the cloud
       | further? I always thought Colab was just an experimental tool;
       | it's still under the research.google domain.
        
       | jonbaer wrote:
       | I wish they would connect Colab under https://script.google.com
       | so you can run a notebook at interval times, something akin to
       | what https://github.com/TensorTom/colabctl does.
        
       | ccarpenterg wrote:
       | I've been using Colab for over a year now. I train deep learning
       | models on NLP and medical imaging datasets.
       | 
       | It's a great tool and it lets you focus on the code and the
       | models, instead of the hardware and OS. But $9.99/month is a
       | little expensive for my taste.
       | 
       | You can't customize it and if they change something you have to
       | install software by hand sometimes. It should be $1.99/month,
       | that's the kind of price I'd pay for this basic cloud computing
       | service.
        
         | bhl wrote:
         | Curious to how you've been training with Colab. I tried running
         | 50 epochs on a UNet3D model, each taking about 20 minutes to
         | run; it was a pain in the ass because the session kept
         | disconnecting.
        
       | ludwigschubert wrote:
       | Can anyone see a reason why they wouldn't just allow you to
       | provision (and pay for) a persistent Google Cloud VM instead? (I
       | currently do that manually and need port forwarding to a machine
       | that runs Jupyter.)
       | 
       | It's hard for me to understand why Colab would build such a vague
       | pro tier instead of the simplest possible solution: let me pay
       | for my compute.
       | 
       | There's so much more potential, too; they could offer whole
       | clusters on demand, with really simple Python integrations say
       | using dask, or ray.
        
         | ianhowson wrote:
         | $10/mo doesn't pay for much compute in Google Cloud. Colab Pro
         | is offering GPUs and high memory instances, presumably under
         | the assumption that interactive users will be mostly idle.
        
         | minimaxir wrote:
         | The Deep Learning VM (available as a native option when
         | creating a new GCE VM) has a bunch of DL tools, and starts a
         | JupyterLab instance on launch: https://cloud.google.com/deep-
         | learning-vm
         | 
         | There is an easy native command for port forwarding to Jupyter:
         | https://cloud.google.com/ai-platform/deep-learning-vm/docs/j...
        
         | leoh wrote:
         | Because they're trying to introduce a different paradigm for
         | computing. Notebooks are portable, purpose-oriented, and allow
         | Google to make decisions about the underlying implementation
         | allowing for a $10 monthly price point and an incredible level
         | of performance not unlike what they've done for BigQuery.
        
         | TaylorAlexander wrote:
         | Perhaps this is more profitable. I've heard that Colab has
         | become very popular for people doing certain kinds of deep
         | learning, but that the wait for GPU instances has been
         | frustrating. It seems a paid tier to get priority is directly
         | targeting those users.
        
           | dzhiurgis wrote:
           | I've recently used Colab to run some pretty cool StyleGAN
           | notebooks. Finally I can replicate so many cool projects
           | without bending my head how to setup gcp, virtualbox or
           | install tensorflow into geforce macbook.
           | 
           | Doubt I'll pickup deep learning as a profession by this, but
           | it's a step forward.
        
         | sebasmurphy wrote:
         | You could alternatively use the google AI Platform Notebooks
         | which are basically the same thing and billed per hour. You can
         | now Prebake a docker images with all of the libraries that you
         | need.
        
       | minimaxir wrote:
       | A preemptible P100 + VM on Google Compute Engine is about
       | ~$0.45/hr, so to exceed that value with Colaboratory Pro
       | (ignoring conveience factors) you'd need to train for more than
       | 22 hours in a month. Which, for deep learning, is not too
       | unreasonable.
       | 
       | Reading between the lines of both the signup page and up-to-date
       | FAQ, it seems like the free TPU in Colab notebooks will be
       | depreciated, which isn't too surprising.
        
         | ampdepolymerase wrote:
         | To clarify, does that mean you can continuously train a model
         | for e.g. hundred of hours or are you still limited by the 24
         | hours notebook limit?
        
           | anidh wrote:
           | The page said "Longer running notebooks and fewer idle
           | timeouts mean you disconnect less often." I'm guessing there
           | is a limit but it has been increased for the pro customers.
           | In contrast on GCE you can train for 22 hours if you want.
        
       | freediver wrote:
       | Good idea, but it's the first premium product that I've seen
       | where the pitch is 'you _may_ get certain features if you
       | subscribe '. In another words there is no guarantee and a premium
       | subscriber may still end up with same GPU as a free user. You may
       | end up with a high-end V100 (not available to free) might be a
       | better pitch.
        
         | Confiks wrote:
         | A fair use policy, which you seem to be referring to [1], is
         | pretty standard fare for many 'invididual user' products to
         | exclude heavy-use groups.
         | 
         | However, they could at least define expected and minimum
         | capacity. They might omit it because the business in this -
         | aside from capturing users in their ecosystem - is arbitraging
         | wholesale GPU price against consumer monthly needs, along with
         | scaling the free tier.
         | 
         | [1] "Why aren't resources guaranteed in Colab Pro?" on
         | https://colab.research.google.com/signup
        
           | londons_explore wrote:
           | I would be happy with an explanation of "none of our service
           | is guaranteed, but if we don't manage to give you your chosen
           | GPU more than 80 percent of the time, you are free to cancel,
           | and we will refund your final months subscription payment".
        
       | mark_l_watson wrote:
       | I am tempted to sign up. Colab is very usable on Safari for
       | iOS/iPad.
       | 
       | I invested 18 months ago in a GPU setup for home. Really
       | convenient but I somewhat regret the purchase. I used to spin up
       | GCP GPU instanced when needed and that was not convenient. Colab
       | is very convenient.
       | 
       | $10/month for better GPUs and longer sessions seems like a good
       | deal.
        
       | fulafel wrote:
       | This seems to be a hosted Jupyter service, right mybinder, is
       | that right?
        
         | LegitShady wrote:
         | Google Colab is a hosted jupyter notebooks service and is free.
         | It includes the use of a Tesla k80. Runtimes reset after 12
         | hours
         | 
         | This is the pro/paid offering with fewer limitations and better
         | resources.
        
       | iddan wrote:
       | Colab is the best notebook I've ever used. It is a real game-
       | changer and I can totally understand why people who use daily
       | would pay for it.
        
         | bitxbit wrote:
         | We should really thank Mathematica for the notebook format.
        
           | anidh wrote:
           | Did mathematica invent the notebook format? Just curious..
        
             | acidburnNSA wrote:
             | Looks like a yes according to this page at least:
             | https://en.wikipedia.org/wiki/Notebook_interface
        
           | [deleted]
        
           | williamstein wrote:
           | Mathematica's notebook definitely strongly inspired Colab's
           | notebook. Colab is an implementation of the Jupyter notebook
           | format and UI. Jupyter, which launched around 2011, itself
           | was strongly inspired by (1) the IPython console from around
           | 2003, and (2) the Sage Notebook which I launched around 2006.
           | 
           | I can tell you definitively that Sage Notebook is very
           | Mathematica inspired. The IPython console looked a lot like
           | Mathematica, mainly because Fernando Perez (who was a
           | physicist) had used Mathematica a lot and wanted something
           | similar but (much) better. In 2005 there was a project to
           | make an IPython notebook interface as an OS X graphical
           | application, which got demoed at Sage Day 1 (in Feb 2006).
           | That motivated me to get interested in doing something
           | similar, but using Javascript and HTML instead. I hired Alex
           | Clemesha, who just finished his physics undergrad and was a
           | _heavy_ Mathematica user to work on Sage fulltime. He did a
           | lot of work with me during 2006 to create a web-based
           | notebook interface (and also to provide a mathematica-like
           | graphics compatibility layer for Python, which is in Sage).
           | The Sage notebook felt pretty similar in 2007 to what Jupyter
           | notebook feels like, and it definitely inspired the UI. We
           | developed Sage notebook heavily and then all sort of lost
           | interest and moved on to other things (e.g., Jason Grout, who
           | was involved a lot with the Sage notebook went to work at
           | Bloomberg, where he did a massive amount of work on
           | JupyterLab). Fortunately, Fernando Perez and others got
           | incredible grant support and many fantastic engineers
           | together built the Jupyter notebook. Jupyter notebook
           | provided the same sort of cell /output UI as we had with the
           | Sage notebook, but was much more general purpose (many
           | kernels) and used more "modern" implementation techniques, by
           | 2011 standards at least.
           | 
           | There's a lot of amazing things about the Mathematica
           | notebook that we never even tried to implement. For example,
           | Mathematica has a much more sophisticated nested structure.
           | Also, by default Mathematica shares one kernel across
           | multiple notebooks (or at least it did last time I tried it).
           | 
           | Just to finish the story, in 2013 I started CoCalc to make a
           | fully realtime collaborative notebook interface. Around the
           | same time, many other people started another project called
           | JupyterLab that reimplemented a Jupyter notebook client using
           | much more powerful modern approaches. In addition, there's a
           | lot more going on regarding notebook clients these days,
           | including Nteract, Kaggle kernels, and http://deepnote.com/.
           | Some people like me who work on these surely played around a
           | lot with Mathematica notebooks when they were kids :-).
        
       | dankle wrote:
       | > For now, Colab Pro is only available in the US.
        
       | bitxbit wrote:
       | This is so much better than buying your own hardwares.
        
       ___________________________________________________________________
       (page generated 2020-02-08 23:00 UTC)