[HN Gopher] Apache Hop 2.0
       ___________________________________________________________________
        
       Apache Hop 2.0
        
       Author : CharlesW
       Score  : 52 points
       Date   : 2022-06-08 18:58 UTC (4 hours ago)
        
 (HTM) web link (hop.apache.org)
 (TXT) w3m dump (hop.apache.org)
        
       | waynesonfire wrote:
       | I'd be curious how this contrasts with apache nifi.
        
         | mring33621 wrote:
         | potato, potahto
        
       | frellus wrote:
       | Aside from this platform, which I've never heard about until now,
       | I'm wondering what others are using in the workflow orchestration
       | space?
       | 
       | I'd assume Airflow is the most prevalent, but there's also Argo
       | getting quite a bit of momentum lately.
        
         | yamrzou wrote:
         | In my previous job, I used Dagster. It has served us well.
         | 
         | See my comment about it here:
         | https://news.ycombinator.com/item?id=28803117
        
       | cmcconomy wrote:
       | "What is HOP?"
       | 
       | https://hop.apache.org/manual/latest/getting-started/hop-wha...
        
         | chrisweekly wrote:
         | > "VISUAL DESIGN AND METADATA
         | 
         | > Apache Hop, short for Hop Orchestration Platform, is a data
         | orchestration and data engineering platform that aims to
         | facillitate all aspects of data and metadata orchestration. Hop
         | lets you focus on the problem you're trying to solve without
         | technology getting in the way. Simple tasks should be easy,
         | complex tasks need to be possible.
         | 
         | > Hop allows data professionals to work visually, using
         | metadata to describe how data should be processed. Visual
         | design enables data developers to focus on what they want to do
         | instead of how that task needs to be done. This focus on the
         | task at hand lets Hop developers be more productive than they
         | would be when writing code."
        
           | Simon_O_Rourke wrote:
           | Thanks for expanding that, it reads like it's some Airflow
           | competitor. Would be curious how it handles all the
           | authentication management for the various pipeline elements.
        
           | jjtheblunt wrote:
           | I'm misunderstanding how so many Apache hosted projects P let
           | someone focus on X without Y getting in the way, totally
           | ignoring the complexity of introducing P and altering
           | everything to align with P, thereby forbidding focus on X.
           | 
           | Are these really often useful?
        
         | waynesonfire wrote:
         | "Hop initially (late 2019) started as a fork of the Kettle
         | (Pentaho Data Integration)."
        
           | mason55 wrote:
           | Wow. PDI is one of the worst pieces of software I've ever
           | used. Possibly only second to Pentaho Report Designer.
           | 
           | From looking at the Apache HOP docs, it doesn't look like
           | they have changed the UI much (if at all). I wonder if they
           | at least made it less buggy.
        
       | arthurcolle wrote:
       | Wow I was expecting this to be the 463rd distributed computing
       | streaming framework since its Apache. Shocked that its not
        
       | mi_lk wrote:
       | tangent - what is it about Apache or big data that the associated
       | softwares are mostly written in Java?
        
         | oaiey wrote:
         | Java, like .NET, are just solid application platforms which are
         | statically typed and their performance is good.
         | 
         | Java has a history in big systems for soon 30 years.
         | 
         | Rust, Python and Go are just not there yet. Rust is too low
         | level, Python is not statically typed and will always suffer
         | performance wise and Go ... I is a youngster :). And .NET is
         | always not everyone's free choice.
         | 
         | And Apache, well they just liked Java for their applications.
         | They started with some C/C++ code but then quickly aggregated a
         | lot of Java tech.
        
           | GordonS wrote:
           | > And .NET is always not everyone's free choice.
           | 
           | Hey, sometimes it really is!!
        
         | ojhughes wrote:
         | MapReduce and HDFS were written in Java and they paved the way
         | for a lot of the other big data tools
        
         | manish_gill wrote:
         | Something I've been asking for a long time as well. Java/JVM
         | are great, but it would be great to see _some_ diversity in the
         | Big Data ecosystem when it comes to implementations. :)
        
         | mrtweetyhack wrote:
        
         | qbasic_forever wrote:
         | From like 2000 to 2010 or even 2015 either java or .NET was the
         | default choice for big enterprise companies. Nobody ever got
         | fired for picking Microsoft or Java (I would add), as they say.
         | A lot of these Apache projects have been donated from work at
         | big enterprises so it comes out of that background from
         | enterprise I imagine.
        
         | zmmmmm wrote:
         | Performance, portability, stability, scalability, concurrency,
         | ecosystem (libraries, etc) .... despite all the new languages
         | around, there actually still aren't many alternatives that give
         | you the same combination of all these to the same level as Java
         | does.
        
         | zekrioca wrote:
         | Good question, but maybe due to Java's stability and
         | portability.
        
       | rektide wrote:
       | Ooh Drools plugins, for rule based event-processing. Neat. Hope I
       | can find some examples!
       | 
       | I havent used Airflow, but my impression is this fits a similar
       | role. That itcs built atop good tech like Apache Beam & can use
       | things like Flink is, in my book, a nice win.
        
       ___________________________________________________________________
       (page generated 2022-06-08 23:00 UTC)