[HN Gopher] ClickHouse, Inc.
       ___________________________________________________________________
        
       ClickHouse, Inc.
        
       Author : zX41ZdbW
       Score  : 318 points
       Date   : 2021-09-20 16:13 UTC (6 hours ago)
        
 (HTM) web link (github.com)
 (TXT) w3m dump (github.com)
        
       | whitepoplar wrote:
       | Haven't used any of these yet, but how does ClickHouse compare to
       | Postgres extensions like TimescaleDB and Citus (which recently
       | launched a columnar feature)? I remember reading in the
       | ClickHouse docs some time ago that it does not have DELETE
       | functionality. Does this pose any problems with GDPR and data
       | deletion requests?
        
         | zX41ZdbW wrote:
         | There are many independent comparisons of ClickHouse vs
         | TimescaleDB:
         | 
         | By Splitbee:
         | https://github.com/ClickHouse/ClickHouse/issues/22398#issuec...
         | By GitLab:
         | https://github.com/ClickHouse/ClickHouse/issues/22398#issuec...
         | And others:
         | https://github.com/ClickHouse/ClickHouse/issues/22398#issuec...
         | https://github.com/ClickHouse/ClickHouse/issues/22398#issuec...
         | 
         | If you'll find more, please post it there.
         | 
         | TimescaleDB can work pretty fine in time series scenario but
         | does not shine on analytical queries. For most of time series
         | queries, it is below ClickHouse in terms of performance but for
         | small (point) queries it can be better.
         | 
         | The main advantage of TimescaleDB is that it better integrates
         | with Postgres (for obvious reasons).
         | 
         | There are also many comparisons of ClickHouse vs Citus. The
         | most notable is here: https://blog.cloudflare.com/http-
         | analytics-for-6m-requests-p...
         | 
         | ClickHouse can do batch DELETE operations for data cleanup.
         | https://clickhouse.com/docs/en/sql-reference/statements/alte...
         | It is not for frequent single-record deletions, but it can
         | fulfill the needs for data cleanup, retention, GDPR
         | requirements.
         | 
         | Also you can tune TTL rules in ClickHouse, per table or per
         | columns (say, replace all IP addresses to zero after three
         | months).
        
           | ryanbooz wrote:
           | [Timescale DevRel here]
           | 
           | @zX41ZdbW@ - Thanks for pointing out the various benchmarks
           | that have been run by other companies between Clickhouse and
           | TimescaleDB using TSBS[1]. As we mentioned, we'll dig deeper
           | into a similar benchmark with much more detail than any of
           | those examples in an upcoming blog post.
           | 
           | One notable omission on all of the benchmarks that we've seen
           | is that none of them enable TimescaleDB compression (which
           | also transforms row-oriented data into a columnar-type
           | format). In our detailed benchmarking, queries on compressed
           | columnar data in Timescale outperformed Clickhouse in most
           | queries, particularly as cardinality increases, often by 5x
           | or more. And with compression of 90% or more, storage is
           | often comparable. (Again, blog post coming soon - we are just
           | making sure our results are accurate before rushing to
           | publish.)
           | 
           | The beauty of TimescaleDB columnar compression model is that
           | it allows the user to decide when their workload can benefit
           | from deep/narrow queries of data that doesn't change often
           | (although it can still be modified just like regular row
           | data), verses shallow/wide queries for things like inserting
           | data and near-time queries.
           | 
           | It's a hybrid model that provides a lot of flexibility for
           | users AND significantly improves the performance of
           | historical queries. So yes, we do agree that columnar storage
           | is a huge performance win for many types of queries.
           | 
           | And of course, with TimescaleDB, one also gets all of the
           | benefits of PostgreSQL and its vibrant ecosystem.
           | 
           | Can't wait to share the details in the coming weeks!
           | 
           | [1]: https://github.com/timescale/tsbs
        
             | xdanger wrote:
             | _(although it can still be modified just like regular row
             | data)_
             | 
             | But it can't be updated or deleted, so what do you mean by
             | this?
        
             | stavros wrote:
             | I have a related question, in case anyone knows: We want to
             | store typical analytics data _somewhere_ (currently in
             | BigQuery) to analyze with Looker. Things like  "CI run
             | started", "CI run finished" and then calculate analytics
             | over average CI runtimes.
             | 
             | Which database would be a good fit for this? There isn't
             | too much data, maybe tens of thousands of rows eventually.
             | Would Timescale be a good fit? I'd prefer that, due to
             | existing familiarity with Postgres, but if ClickHouse is
             | better, that's good too.
        
             | zX41ZdbW wrote:
             | Thank you! Looking forward for a blog post. We need more
             | references for comparison to optimize ClickHouse
             | performance.
        
         | lima wrote:
         | > I remember reading in the ClickHouse docs some time ago that
         | it does not have DELETE functionality. Does this pose any
         | problems with GDPR and data deletion requests?
         | 
         | Clickhouse has ALTER ... DELETE and ALTER ... UPDATE
         | functionality now! (and TTLs)
        
         | didip wrote:
         | ClickHouse competes with OLAP storages like Druid or Pinot.
         | 
         | I don't know about ClickHouse but the other 2 uses bitmap
         | indexes to make storing petabytes of data affordable.
         | 
         | Row oriented databases would struggle to compete against
         | ClickHouse. They are easily an order of magnitude slower.
        
           | hodgesrm wrote:
           | ClickHouse uses skip indexes. They basically answer the
           | question "is the value I'm seeking not in the block."
           | 
           | For example, there are a couple varieties of Bloom filters,
           | which allow you to test for presence of string sequences in
           | blocks. This allows ClickHouse to skip reading and
           | uncompressing blocks (actually called granules)
           | unnecessarily.
        
         | ericb wrote:
         | ClickHouse wins on licensing--Apache.
         | 
         | The TimeScale licensing approach, the way it is written,
         | perhaps accidentally, has lots of hidden landmines. The
         | TimeScale license slants toward cloud giant defense to the
         | extent that normal use is perilous.
         | 
         | For example, timescale can be used for normal data (postgres)
         | as well, so any rules seem to apply to all your data in the
         | database. The free license only usable available if:
         | 
         | the customer is prohibited, either contractually or
         | technically, from defining, redefining, or modifying the
         | database schema or other structural aspects of database
         | objects, such as through use of the Timescale Data Definition
         | Interfaces, in a Timescale Database utilized by such Value
         | Added Products or Services.
         | 
         | My read is that if you let a customer do anything that adds a
         | custom field, or table, or database, or trigger, or anything
         | that is "structural" (even in the regular relational stuff)
         | anywhere in your database (metrics or not), you are in
         | violation. There doesn't seem to be a distinction about whether
         | this is "direct" control or not, or whether a setting
         | indirectly adds a trigger. I don't want to be in a courtroom
         | debating whether a new metric is a "structural change!"
         | 
         | Now, none of that might be the _intent_ of the license, but you
         | have to go by what it says, not intentions.
         | 
         | The sad part of that is, I, and I'm sure many folks, have no
         | interest in starting a database company, but we can't rally
         | timescale because of legal risk. Looks awesome otherwise,
         | though.
        
           | akulkarni wrote:
           | [Timescale co-founder here]
           | 
           | Hi Eric, thanks for taking a close look at our license.
           | 
           | I'd like to dispel some misconceptions:
           | 
           | The core of TimescaleDB is Apache2. Advanced features are
           | under the Timescale License.
           | 
           | Regarding this:                 the customer is prohibited,
           | either contractually or technically, from defining,
           | redefining, or modifying the database schema or other
           | structural aspects of database objects, such as through use
           | of the Timescale Data Definition Interfaces, in a Timescale
           | Database utilized by such Value Added Products or Services.
           | My read is that if you let a customer do anything that adds a
           | custom field, or table, or database, or trigger, or anything
           | that is "structural" (even in the regular relational stuff)
           | anywhere in your database (metrics or not), you are in
           | violation. There doesn't seem to be a distinction about
           | whether this is "direct" control or not, or whether a setting
           | indirectly adds a trigger. I don't want to be in a courtroom
           | debating whether a new metric is a "structural change!"
           | 
           | That's not correct, and we took pains to clarify that in the
           | license:                 3.5 "Timescale Data Definition
           | Interfaces" means SQL commands and other interfaces of the
           | Timescale Software that can be used to define or modify the
           | database schema and other structural aspects of database
           | objects in a Timescale Database, including Data Definition
           | Language (DDL) commands such as CREATE, DROP, ALTER,
           | TRUNCATE, COMMENT, and RENAME. [0]
           | 
           | Strictly speaking, if you provide Data Definition Interfaces
           | (DDL) to customers via a SaaS service (ie you are running a
           | TimescaleDBaaS - which applies to < 0.000001% of all possible
           | users) you are in violation of the license. But otherwise you
           | are fine.
           | 
           | If you are looking for more votes of confidence, today there
           | are literally millions of active TimescaleDB instances,
           | including by large companies like Walmart, Comcast, IBM,
           | Cisco, Electronic Arts, Bosch, Samsung, and many many smaller
           | ones. [2]
           | 
           | If you have any other questions, I'm happy to answer them
           | here, or offline (ajay at timescale dot com).
           | 
           | [1] https://www.timescale.com/legal/licenses#section-3-5-time
           | sca...
           | 
           | [2] https://www.timescale.com/
        
             | ericb wrote:
             | Are you sure?
             | 
             | The phrasing "such as" in " _such as_ through use of the
             | Timescale Data Definition Interfaces " looks to me like it
             | can be interpreted as saying "Including but not limited to"
        
               | akulkarni wrote:
               | Yes :-)
               | 
               | It was a little cumbersome to list every DDL SQL command,
               | which why it uses that language. But that is the intent.
               | 
               | If you have a specific question, happy to answer it here
               | (or offline)
               | 
               | Also, we used this language deliberately to provide more
               | clarity. DDL vs DML is a pretty clear line to most
               | developers who use TimescaleDB (vs some other companies
               | who use language like, "you can't compete with us" etc).
        
               | ericb wrote:
               | Right, I'm onboard with your stated intent, and the
               | lovely database, and even sympathetic on the cloud
               | provider threat.
               | 
               | My interpretation doesn't hinge on the DDL/DML question.
               | If what you said is your intent, the legal language used
               | is wrong and you should fix it. Consider this a bug
               | report. Here's a source!
               | 
               | https://www.law.cornell.edu/definitions/uscode.php?width=
               | 840...
               | 
               | In order for the license to be usable, you need to be
               | _limitative_ here.
        
               | akulkarni wrote:
               | Thanks for the feedback, will pass along :-)
        
               | mst wrote:
               | The issue isn't the DDL vs. DML.
               | 
               | The issue is the 'such as', which reads to me as
               | indicating that providing an API endpoint that can add
               | fields would also be a way of letting the customer modify
               | the database schema and therefore covered.
               | 
               | Which means that building a SaaS app backed by timescale
               | that has any level of customisation exposed to the user
               | appears to be prohibited.
               | 
               | This seems a rather stronger level of prohibition than
               | stopping people directly competing with you, and would
               | suggest that if it's intended it would help to make it
               | more explicit, and if it isn't then an explicit statement
               | of that would be worth adding.
        
               | akulkarni wrote:
               | Practically speaking, there are already 1000s+ SaaS apps
               | built on TimescaleDB (and using the Timescale License
               | features)
               | 
               | But I appreciate the feedback on how we could make our
               | language clearer. Will share with the team!
        
             | shadowwolf007 wrote:
             | So this is a very relevant question and I have been trying
             | to figure out if I need to migrate off timescaledb asap
             | coincidentally over the last 2 weeks (We're pre-production
             | anyway, so it's the time to do so!). Doing so has been
             | super low priority, but since you're here .... :)
             | 
             | If I have a table that records timeseries data and then
             | another table that has a customer-provided extensible set
             | of metadata where a customer can define columns and other
             | related tabular data, would that violate the license? The
             | customer doesn't have a direct, like, psql level of access
             | but the API intentionally provides a very similar level of
             | interaction.
             | 
             | Does this qualify as providing Data Definition Interfaces?
             | If none of those additional columns and such appear on
             | tables set up as timescale tables does that make any
             | difference?
        
               | mfreed wrote:
               | One clarification to the above discussion -- about
               | whether these restrictions also get applied to "even the
               | regular relational stuff" -- which I think is relevant to
               | you (and parent):
               | 
               | The Timescale License only covers the TimescaleDB code.
               | Postgres code continues to be covered by the OSS
               | PostgreSQL License [0].
               | 
               | So putting aside the question about API vs. psql level
               | (again, the Timescale License was drafted to enabled this
               | for "Value Added Services", e.g., where "such value-added
               | products or services are not primarily database storage
               | or operations products or services"), this license
               | wouldn't apply for non-Timescale code.
               | 
               | [Timescale co-founder here]
               | 
               | [0] https://github.com/timescale/timescaledb/blob/master/
               | NOTICE
        
               | shadowwolf007 wrote:
               | Ok thank you!
               | 
               | I appreciate from your POV this probably is silly but for
               | us it's very helpful to have explicit clarity since
               | investors are questioning and demand certainty. I once
               | had to rewrite https handshakes because a lawyer thought
               | export compliance laws would be violated, so hopefully
               | you understand my trepidation :-)
        
               | ericb wrote:
               | Not the parent, but great to hear that clarification! By
               | the way, doubt I would complain if Prometheus didn't look
               | impressive. :-)
               | 
               | One other bug report on the license language front, this
               | language could be construed to prohibit uses like
               | Prometheus--unless there's a definition of operations
               | products I missed (possible).
               | 
               | > are not primarily database storage or _operations
               | products_
               | 
               | Suggested edit:
               | 
               | > are not primarily database storage or * _database
               | operations products*_
        
           | goodpoint wrote:
           | > ClickHouse wins on licensing--Apache
           | 
           | How so? An end user should prefer a database under a license
           | that protect the developer and users from
           | cloudification/proprietization/SaaS
        
             | goodpoint wrote:
             | On top of that, Yandex requires a very aggressive CLA to be
             | signed by contributors:
             | https://yandex.ru/legal/cla/?lang=en
             | 
             | This worries me and makes me wonder if they are going for
             | the open-source-only-by-name model.
             | 
             | [Please reply instead of giving silent dowvotes.]
        
               | zX41ZdbW wrote:
               | We don't require Yandex CLA:
               | 
               | > As an alternative, you can provide DCO instead of CLA.
               | You can find the text of DCO here:
               | https://developercertificate.org/ It is enough to read
               | and copy it verbatim to your pull request.
               | 
               | > If you don't agree with the CLA and don't want to
               | provide DCO, you still can open a pull request to provide
               | your contributions.
               | 
               | https://github.com/ClickHouse/ClickHouse/blob/master/CONT
               | RIB...
               | 
               | Anyway, Yandex CLA will be removed in the upcoming days
               | (it should be already removed).
        
         | ryanbooz wrote:
         | (Timescale DevRel here)
         | 
         | We've recently been working through a detailed benchmark of
         | TimescaleDB and Clickhouse. The DELETE/UPDATE question has been
         | an intriguing story to follow - and I honestly hadn't
         | considered the GDPR angle.
         | 
         | ATM, Clickhouse is still OLAP focused and their MergeTree
         | implementation does not allow direct DELETE (or UPDATE) of any
         | data. All DELETE/UPDATE requests are applied asynchronously by
         | (essentially) re-writing/merging the table data (it's referred
         | to as a "mutation") without whatever data was referenced in the
         | DELETE/UPDATE. [1]
         | 
         | [1]: https://clickhouse.com/docs/en/sql-
         | reference/statements/alte...
        
           | kwillets wrote:
           | This question is becoming critical right now, as
           | nonrecoverable deletes are required within 30 days for both
           | GDPR and CCPA.
           | 
           | Most products do the asynchronous rewrite, especially if
           | they're based on immutable storage. That's fine, but it
           | should be tested to verify that it's not triggering on every
           | delete, for example, and that it's resource-efficient.
        
           | nezirus wrote:
           | You are correct, the proper way to do deletions in ClickHouse
           | is to use partitions, and drop partitions. That is probably
           | good enough for most analytical use cases, but YMMV.
        
           | hkolk wrote:
           | We are using Clickhouse combined with GDPR's Data Deletion
           | Requests. We store the user-ids in a separate system, and run
           | the ALTER/DELETE statements once per week. Works pretty
           | smooth, though I would prefer some more automation within
           | Clickhouse for them.
           | 
           | Data for in-active users gets deleted because our clickhouse
           | retention policy is lower than the in-active-user timeout
        
         | [deleted]
        
         | stingraycharles wrote:
         | In a nutshell, my extremely subjective and biased take on it:
         | 
         | * Citus has a great clustering story, and a small data
         | warehousing story, afaik no timeseries story;
         | 
         | * TimescaleDB has a great timeseries story, and an average data
         | warehousing story;
         | 
         | * Clickhouse has a great data warehousing story, an average
         | timeseries story, and a bit meh clustering story (YMMV).
         | 
         | (Disclaimer: I work for a competitor)
        
           | akulkarni wrote:
           | [Timescale co-founder]
           | 
           | This is a really great comparison. I might borrow it in the
           | future :-)
           | 
           | But yes, if you have classic OLAP-style queries (e.g.,
           | queries that need to touch every database row), Clickhouse is
           | likely the better option.
           | 
           | For anything time-series related, and/or if you like/love
           | Postgres, that is where TimescaleDB shines. (But please make
           | sure you turn on compression!)
           | 
           | TimescaleDB also has a good clustering story, which is also
           | improving over time. [0][1]
           | 
           | [0] https://news.ycombinator.com/item?id=23272992
           | 
           | [1] https://news.ycombinator.com/item?id=24931994
        
             | mromanuk wrote:
             | I like how you always chime in with time series stuff. Just
             | by watching you interact here, boost my confidence on your
             | product
        
           | zX41ZdbW wrote:
           | > Disclaimer: I work for a competitor
           | 
           | What competitor btw? I tried to open a link from your profile
           | but it does not work.
        
             | arthurcolle wrote:
             | It appears the competitor is QuasarDB
        
               | stingraycharles wrote:
               | Excellent comment archeology, this is correct. :)
        
               | arthurcolle wrote:
               | actually grabbed it from Keybase -> Googled name ->
               | LinkedIn :)
        
         | fiddlerwoaroof wrote:
         | I benchmarked ClickHouse vs. Timescale, Citus, Greenplum and
         | Elasticsearch for a real-time analytics application. With a
         | couple hours learning for each (although I've used Postgres
         | extensively and so Postgres-backed databases had a bit of an
         | advantage), ClickHouse's performance was easily an order of
         | magnitude or two better than anything except ES. ES had its own
         | downsides with respect to the queries we could run (which is
         | why we were leaving ES in the first place).
        
           | someguy101010 wrote:
           | What did you end up going with?
        
             | fiddlerwoaroof wrote:
             | ClickHouse deployed to EKS with Clickhouse operator
        
       | data_ders wrote:
       | I'm betting we'll see a "Clickhouse Cloud" product announcement
       | in the next 12 months. I'm curious to see if they can provide
       | enough add-on value to their open source product to be
       | profitable. But I'm certainly rooting for them!
        
         | ignoramous wrote:
         | There's definitely market for a managed clickhouse 1p product.
         | It remains to be seen if the product is substantial enough to
         | challenge the incumbents. The engineering pedigree is ample. So
         | that's already 50% of the way there. With money in the bank, it
         | is all about how they suit it up with their sales and
         | marketing. Interesting times ahead for them.
        
           | yigitkonur35 wrote:
           | Aiven will offer a managed Clickhouse service too:
           | https://landing.aiven.io/2020-upcoming-aiven-services-
           | webina...
           | 
           | And also Altinity is a trustable partner with a great know-
           | how about Clickhouse internals. They have started to offer
           | managed instances in AWS: https://altinity.com/altinity-
           | cloud-test-drive/
           | 
           | At last, Alibaba Cloud has an option to use:
           | https://www.alibabacloud.com/product/clickhouse
           | 
           | Are there any other ones?
        
           | polskibus wrote:
           | Yandex offers managed clickhouse :
           | https://cloud.yandex.com/en/services/managed-clickhouse
        
             | ignoramous wrote:
             | To be clear, I meant _1p_ as in Confluent - > Kafka; not
             | AWS -> Managed Kafka.
             | 
             | Despite Yandex (who originally built Clickhouse) offering a
             | managed solution, a substantial investment outlay from the
             | VCs does come off as a huge vote of confidence in the
             | founders.
        
         | nemo44x wrote:
         | I'm guessing that's the entire purpose here. Build Snowflake
         | with Clickhouse branding.
        
         | encoderer wrote:
         | The only value they need to add is that you no longer need to
         | run database clusters yourself. I would hope they don't try to
         | add any special paid features.
        
         | petr_tik wrote:
         | worth keeping in mind that Yandex and russian technology
         | companies in general are used to running lean and profitable
         | operations so unfathomed in the land of 0% interest rates and
         | VC money on tap. If they continue as they are now (15 people)
         | and convert customers like Cloudflare into paying engagements,
         | there is nothing stopping them from being profitable.
        
           | pm90 wrote:
           | How exactly do they get to operate that lean? It seems the
           | most straightforward way is by paying employees less.
        
             | hunterb123 wrote:
             | Get out of the valley. One engineer can do a lot. Even in
             | Texas we run slimmer. I'm the sole frontend dev. I created
             | and maintain our iOS, Android, and Web app for a largish
             | tech company.
             | 
             | When you don't have VC money and you HAVE to profit, you
             | really learn optimize the workflow.
        
               | ctvo wrote:
               | > _One engineer can do a lot. Even in Texas we run
               | slimmer. I 'm the sole frontend dev. I created and
               | maintain our iOS, Android, and Web app for a largish tech
               | company._
               | 
               | What do you mean by "do a lot". Can you deliver as
               | quickly as a team? If so, do you work more hours or are
               | you just better? If you're just better, why do you decide
               | to stay with your largish tech company when we're
               | acknowledging SV pays more? A remote role would increase
               | your salary, no?
               | 
               | Breaking this down:
               | 
               | Russia has a great mathematics and engineering education
               | system. Many graduates, unable to leave, take jobs with
               | Russian tech companies. Russian tech companies pay less
               | than US tech companies.
               | 
               | That's why the situation may be as is with ClickHouse.
               | You're not in Russia.
               | 
               | Texas isn't particularly known for running lean. Every
               | Big Tech has a presence in Austin. Dallas is filled with
               | legacy financial companies burning money on IT.
        
               | hunterb123 wrote:
               | Yeah, I use RN to do all the platforms. I spit out
               | features pretty fast. They apply to all platforms since
               | the codebase is 95% shared. I work normal 40 hours, never
               | overtime. I could make more in SV possibly, but I like
               | working where I'm at and I have a big influence (I decide
               | the tech, style guide, some features etc.) Plus I don't
               | like working with a team or too many meetings etc. I
               | already work remote and I make a good salary. Cost of
               | living would be much higher in SV and most companies
               | would cut pay for remote work over there so it would be
               | similar.
        
               | ctvo wrote:
               | > _Cost of living would be much higher in SV and most
               | companies would cut pay for remote work over there so it
               | would be similar._
               | 
               | I would validate this by getting an offer. Even adjusting
               | for location, my guess is it's still significantly higher
               | than what you're making locally. The adjustment isn't,
               | say, you live in SF so you make 400k, you live in Houston
               | you make 150k. It's ~15-20% for most places, at most.
               | 
               | > _Yeah, I use RN to do all the platforms._
               | 
               | I want to point out those bloated teams that aren't lean
               | from SV? They made ReactNative. They made Flutter too if
               | you were thinking of swapping.
        
               | hunterb123 wrote:
               | 400k is for FAANG, I doubt I'd get that nor do I really
               | want to compete for it. Seems like a lot. I'm really
               | happy where I am, even if I'm not making top dollar.
               | Money isn't everything, I want to be comfortable, happy,
               | and stress-free. I also just don't like California, I
               | prefer small towns in Texas.
               | 
               | I know Google and FB made Flutter and RN respectively,
               | and I thank them for that. That doesn't mean other SV
               | companies aren't bloated, FAANG has a lot of money from
               | their spigots (Google & FB have their ad money streams)
               | and live on another level, not VC funding.
               | 
               | Cross platform has come a long way and enables small
               | companies to do a lot more with less. Flutter is nice but
               | has flaws, RN is the sweet spot. I could talk for days
               | about the pros and cons of both.
               | 
               | A previous comment of mine regarding of Flutter vs RN:
               | https://news.ycombinator.com/item?id=28394396
        
             | spyspy wrote:
             | No scrum masters, no in-house chef, few or no dedicated
             | product managers, no middle manager(s) between eng and
             | C-Suite.
        
               | vxNsr wrote:
               | So theoretically really just more work per dev and less
               | compensation. I guess that makes sense.
        
             | DevKoala wrote:
             | Eastern Europe talent probably. They are fantastic and
             | their salaries are cheap thanks to conversion rates.
        
       | wayneftw wrote:
       | Why would my ad blocker (uBlock Origin) be blocking the domain of
       | a "open-source column-oriented database management system"?
       | 
       | I guess this is used heavily in advertising?
        
         | fiddlerwoaroof wrote:
         | https://news.ycombinator.com/item?id=28595996
        
       | mempko wrote:
       | It's great to see this spin off. ClickHouse is fast but certain
       | use cases are not ideal and having it spin off the team can focus
       | more on the community and what they need instead of just the use
       | cases Yandex had. Cheers to the team and good luck!
        
       | anglinb wrote:
       | We're using Clickhouse to power our in-product analytics. It's
       | awesome but would love a managed service b/c it definitely
       | requires a bit of management overhead. Super excited about this
       | announcement!
        
         | hodgesrm wrote:
         | Check out Altinity.Cloud. It's managed and works today.
         | 
         | Disclaimer: I work for Altinity, which operates the
         | Altinity.Cloud service.
        
       | pachico wrote:
       | I wish the best to them. They forged a game changer software.
       | Hopefully, they will also offer a competitive SaaS model
       | (hopefully also at bare-metal based price).
        
       | didibus wrote:
       | > Most other database management systems don't even permit
       | benchmarks (through the infamous "DeWitt clause"). But we don't
       | fear benchmarks; we collect them.
       | 
       | I love the confidence here.
        
       | hkolk wrote:
       | I wonder how this will play out with https://altinity.com who
       | have been doing enterprise support for quite some time..
        
         | hodgesrm wrote:
         | I run Altinity. We think it's great. This is going to help grow
         | adoption which benefits everyone. Watch our blog for a post in
         | a couple hours.
         | 
         | BTW congrats to Alexey on the new company.
        
           | [deleted]
        
           | acidbaseextract wrote:
           | As a sidenote, I saw your talk on Clickhouse to the CMU
           | database group [1] back when and was extremely impressed with
           | your deep technical knowledge yet down-to-earth presentation.
           | Still haven't had an opportunity to use Clickhouse for
           | production work, but would welcome it.
           | 
           | [1] https://www.youtube.com/watch?v=fGG9dApIhDU
        
             | hodgesrm wrote:
             | Thank you!! That was the most fun I've ever had on a tech
             | talk. Any Pavlo is a one man army when it comes to fun
             | questions and there were more like him in the audience. The
             | whole series of quarantine talks was great.
        
           | zX41ZdbW wrote:
           | Thank you! This is an important milestone for ClickHouse and
           | will benefit the entire ecosystem.
        
           | tnolet wrote:
           | We are using Altinity too. Great support up to now. We are
           | about to go live with it. For us (see my bio for company
           | link) having a company manage the cluster was paramount. We
           | just want to use the data and API, not manage the
           | machines/VM's and k8s clustering stuff.
        
             | hodgesrm wrote:
             | Cool! Thank you so much for posting. We get a huge kick
             | when projects go live. (Being a manager has not beaten it
             | out of me.)
        
           | jordanthoms wrote:
           | We recently setup Clickhouse on GKE using the Altinity
           | operator (and signed up for Altinity support).
           | 
           | There's been so many queries where I've thought 'that's going
           | to need a join and aggregation across tens of billions of
           | rows, no way!' - and then Clickhouse spits back a query
           | result in 10 seconds...
        
         | mobileexpert wrote:
         | I think similar to other situations e.g Starburst with
         | Presto/Trino. There really are a limited number of devs pushing
         | a long the core projects and a lot of people needing support.
         | Each start up in the space can likely grow the pie for support
         | and adoption and a few big enterprises will still hire in house
         | devs.
        
       | mobileexpert wrote:
       | Awesome! Really excited for CH.
        
       | PeterZaitsev wrote:
       | Interesting news indeed! I very much wonder what it means long
       | term in terms of Licenses. I would imagine much better future if
       | Clickhouse would become Foundation driven process which gives
       | good protection from license change (through I'm biased here) -
       | Currently Clickhouse fully under Apache 2.0 license may look too
       | good to be true compared to where many successful VC funded
       | projects took licenses of their projects (think Elastc, MongoDB,
       | Redis)
       | 
       | In any case though I expect a lot of growth in Clickhouse
       | community now and investment both engineering and most
       | importantly Marketing - I think Clickhouse technology has a lot
       | more adoption potential than it currently has
        
         | puppet-master wrote:
         | I'd much rather they relicensed early if they can, to set
         | expectations and to ensure talented people actually get to
         | sustain its development, rather than parasitic jobsworth FAANG
         | types who will inevitably drive development at Amazon. Free
         | software in this context is very dead, let's not pretend
         | network and channel effects of AWS were ever envisaged in the
         | 1980s when most "contemporary" free software licensed were
         | designed.
        
       | legg0myegg0 wrote:
       | Does anyone happen to know which country the new company is
       | incorporated in? I'm still looking for a chance to use ClickHouse
       | because it sounds so excellent!
        
       | monstrado wrote:
       | I'm excited for the future of ClickHouse! I'm hopeful that this
       | move will help smooth out the rough-edges of ClickHouse, mainly
       | around clustering.
        
       | ricardobeat wrote:
       | Totally unrelated, but this part of the readme says
       | Yandex N.V. is the largest          internet company in Europe
       | and employs over 14,000 people
       | 
       | I'm quite sure there are larger "internet companies" in the EU
       | such as Booking, Zalando, etc.
        
         | discobot2 wrote:
         | Booking is 5K employees.
        
       | einpoklum wrote:
       | For those who don't know it:
       | 
       | ClickHouse is a columnar, analytic, close-to-a-DBMS, but not a
       | full-fledged one. The "100x-1000x faster" is compared to row
       | stores. Last time I checked it was mostly single-table-oriented.
        
       | ucarion wrote:
       | Might make more sense to link to the blog post, instead of its
       | underlying markdown in GitHub?
       | 
       | https://clickhouse.com/blog/en/2021/clickhouse-inc/
        
         | vxNsr wrote:
         | Most of the audience here blocks ads with either uBlock or
         | something else and they all include that domain in the
         | blacklist.
        
       | truthwhisperer wrote:
       | one questions is did you implement special algorithms to bias in
       | plus or minus against your president? This is a rumour. Just want
       | to know if it is true or false
        
       | chrismorgan wrote:
       | Canonical link: https://clickhouse.com/blog/en/2021/clickhouse-
       | inc/
       | 
       | But I presume the GitHub link
       | (https://github.com/ClickHouse/ClickHouse/blob/master/website...)
       | has been submitted because clickhouse.com is going to be blocked
       | for a large fraction of HN users ( _Peter Lowe's Ad and tracking
       | server list_ , which I think uBlock Origin has enabled by
       | default, includes ||clickhouse.com^). I'm actually a bit curious
       | why clickhouse.com (or more likely a subdomain?) would be being
       | used this way; I'd have thought that they'd separate any such
       | uses to a different domain so as not to hinder their main domain
       | which is about the software and nothing to do with ads or
       | tracking at all (even if that's probably the main end use of such
       | an OLAP DBMS).
        
         | pgl wrote:
         | Someone just reported this to me and I've removed the entry
         | from my blocklist.
         | 
         | This was a very old entry - it was added on Fri, 06 Jun 2003
         | 19:53:00. Back then it was a marketing company that served ads.
         | 
         | I pride myself on knowing the entries in my list very well, but
         | I have to admit I forgot about this one, which is ironic
         | because I use Clickhouse at my job these days.
        
           | chrismorgan wrote:
           | Great! That makes much more sense.
           | 
           | Thank you for maintaining such lists. You and a few others
           | like you save me much time and aggravation.
        
           | Maxburn wrote:
           | Thank you for this service!
        
           | wayneftw wrote:
           | Thanks! This worked instantly just now as I purged cache and
           | updated all lists in uBlock Origin.
           | 
           | I don't look forward to when Chrome enforces Manifest v3 when
           | I'll probably have to wait for a whole extension to be
           | updated instead of just a list file.
        
           | f0e4c2f7 wrote:
           | I'm surprised and impressed that you would remember what's on
           | the list.
           | 
           | Thank you for your hard work. Every day it makes my
           | experience of the internet 100x better.
        
           | sikhnerd wrote:
           | Thank you so much for the hard work you do! It makes the web
           | sooo much more usable.
        
         | newman314 wrote:
         | FWIW, clickhouse.com is also blocked by "Malvertising filter
         | list by Disconnect"
        
           | nacs wrote:
           | It's also blocked on my Pi-Hole install network-wide
           | apparently.
        
       | KitDuncan wrote:
       | Just got started with clickhouse. Super cool software.
        
       | jenny91 wrote:
       | These some really great technology coming out of Russia in the
       | information retrieval/database world: ClickHouse, a bunch of
       | Postgres stuff that Yandex is working on, 2gis.ru (a super
       | detailed vector map on a completely different stack to
       | Google/MapBox), etc.
        
       | nhoughto wrote:
       | We just did months of testing on a bunch of dbs for a time-series
       | workload, and whilst we really liked the story and devs behind
       | clickhouse. The ops burden of not separating storage and compute
       | ended up being a turn off. Good to see it is progressing and will
       | hopefully get more investment, although I wonder what this means
       | for companies like Altinity.
       | 
       | We compared Timescale, Clickhouse, Snowflake and Firebolt. Ended
       | up really liking Firebolt, some amazing tech with a few
       | roughedges (its pretty new), basically Clickhouse speed meets
       | Snowflake simplicity definitely one to watch.
       | 
       | https://www.firebolt.io/
        
         | qaq wrote:
         | reads like marketing copy
        
       ___________________________________________________________________
       (page generated 2021-09-20 23:00 UTC)