[HN Gopher] Apache Hop 2.0 ___________________________________________________________________ Apache Hop 2.0 Author : CharlesW Score : 52 points Date : 2022-06-08 18:58 UTC (4 hours ago) (HTM) web link (hop.apache.org) (TXT) w3m dump (hop.apache.org) | waynesonfire wrote: | I'd be curious how this contrasts with apache nifi. | mring33621 wrote: | potato, potahto | frellus wrote: | Aside from this platform, which I've never heard about until now, | I'm wondering what others are using in the workflow orchestration | space? | | I'd assume Airflow is the most prevalent, but there's also Argo | getting quite a bit of momentum lately. | yamrzou wrote: | In my previous job, I used Dagster. It has served us well. | | See my comment about it here: | https://news.ycombinator.com/item?id=28803117 | cmcconomy wrote: | "What is HOP?" | | https://hop.apache.org/manual/latest/getting-started/hop-wha... | chrisweekly wrote: | > "VISUAL DESIGN AND METADATA | | > Apache Hop, short for Hop Orchestration Platform, is a data | orchestration and data engineering platform that aims to | facillitate all aspects of data and metadata orchestration. Hop | lets you focus on the problem you're trying to solve without | technology getting in the way. Simple tasks should be easy, | complex tasks need to be possible. | | > Hop allows data professionals to work visually, using | metadata to describe how data should be processed. Visual | design enables data developers to focus on what they want to do | instead of how that task needs to be done. This focus on the | task at hand lets Hop developers be more productive than they | would be when writing code." | Simon_O_Rourke wrote: | Thanks for expanding that, it reads like it's some Airflow | competitor. Would be curious how it handles all the | authentication management for the various pipeline elements. | jjtheblunt wrote: | I'm misunderstanding how so many Apache hosted projects P let | someone focus on X without Y getting in the way, totally | ignoring the complexity of introducing P and altering | everything to align with P, thereby forbidding focus on X. | | Are these really often useful? | waynesonfire wrote: | "Hop initially (late 2019) started as a fork of the Kettle | (Pentaho Data Integration)." | mason55 wrote: | Wow. PDI is one of the worst pieces of software I've ever | used. Possibly only second to Pentaho Report Designer. | | From looking at the Apache HOP docs, it doesn't look like | they have changed the UI much (if at all). I wonder if they | at least made it less buggy. | arthurcolle wrote: | Wow I was expecting this to be the 463rd distributed computing | streaming framework since its Apache. Shocked that its not | mi_lk wrote: | tangent - what is it about Apache or big data that the associated | softwares are mostly written in Java? | oaiey wrote: | Java, like .NET, are just solid application platforms which are | statically typed and their performance is good. | | Java has a history in big systems for soon 30 years. | | Rust, Python and Go are just not there yet. Rust is too low | level, Python is not statically typed and will always suffer | performance wise and Go ... I is a youngster :). And .NET is | always not everyone's free choice. | | And Apache, well they just liked Java for their applications. | They started with some C/C++ code but then quickly aggregated a | lot of Java tech. | GordonS wrote: | > And .NET is always not everyone's free choice. | | Hey, sometimes it really is!! | ojhughes wrote: | MapReduce and HDFS were written in Java and they paved the way | for a lot of the other big data tools | manish_gill wrote: | Something I've been asking for a long time as well. Java/JVM | are great, but it would be great to see _some_ diversity in the | Big Data ecosystem when it comes to implementations. :) | mrtweetyhack wrote: | qbasic_forever wrote: | From like 2000 to 2010 or even 2015 either java or .NET was the | default choice for big enterprise companies. Nobody ever got | fired for picking Microsoft or Java (I would add), as they say. | A lot of these Apache projects have been donated from work at | big enterprises so it comes out of that background from | enterprise I imagine. | zmmmmm wrote: | Performance, portability, stability, scalability, concurrency, | ecosystem (libraries, etc) .... despite all the new languages | around, there actually still aren't many alternatives that give | you the same combination of all these to the same level as Java | does. | zekrioca wrote: | Good question, but maybe due to Java's stability and | portability. | rektide wrote: | Ooh Drools plugins, for rule based event-processing. Neat. Hope I | can find some examples! | | I havent used Airflow, but my impression is this fits a similar | role. That itcs built atop good tech like Apache Beam & can use | things like Flink is, in my book, a nice win. ___________________________________________________________________ (page generated 2022-06-08 23:00 UTC)