[HN Gopher] Why isn't there a universal data format for resumes? ___________________________________________________________________ Why isn't there a universal data format for resumes? Author : ColinWright Score : 79 points Date : 2022-01-16 21:27 UTC (1 hours ago) (HTM) web link (toot.cat) (TXT) w3m dump (toot.cat) | forrestthewoods wrote: | There is. It's called LinkedIn. | alberto7 wrote: | That's like asking "Why isn't there a universal format for | dating?" | andy_ppp wrote: | I think the same thing about invoicing... especially given the | different hoops every company makes you jump through to make your | invoice compliant. | dqv wrote: | Don't they kind of do that at larger organizations with EDI? Or | is it still nonstandard between organizations? | wolfgang42 wrote: | EDI is only semi-standardized; in my experience usually each | pair of trading partners requires a separate effort (usually | the bigger partner imposes the requirement and the smaller | one has to make changes). Some difficulties include: | | * There are multiple protocols for exchanging EDI documents | (AS1, SMTP, FTPS, SFTP...) | | * There are multiple standards for the documents themselves | (EDIFACT, X12, GS1 XML...) | | * There are partner-specific business rules that need to be | set up (this partner needs an ASN, that partner requires | invoices to be for only one PO each, the other partner can | only accept invoices which use their ERP's internal codes...) | | Some of these problems can be papered over with a "VAN" that | can translate between standards, but I have yet to see one | business send a non-PDF invoice to another without a lot of | fuss. | frabert wrote: | Italy actually has a standard for electronic invoicing, and | most companies are required to use it ("Fattura Elettronica"). | It works by having a centralized portal to which invoices are | submitted. | | Of course, being a SERIOUS, CORPORATE standard it needs to be | overly complex, based on XML and SOAP and WebServices and | whatnot: | https://www.agenziaentrate.gov.it/portale/documents/20143/23... | jfengel wrote: | I used to work in ontologies, and what I learned is that people | would rather get an 80% heuristic solution for dirty data rather | than a 100% correct solution for data they have to clean. | | The big job sites have resume parsers that work well enough from | a PDF or Word doc, and then they don't have to worry about you | forgetting a close tag or a mandatory field. Sure, stuff gets | lost, but they get 80% of a billion resumes rather than 100% of a | million of them. They can't exchange data or even trust what they | have, but it's good enough for them to make money. Meanwhile, a | competitor demanding good data from its clients never gets off | the ground. | | Anyway, every data format for human information ends up being | either vague (to allow in everything) or impossible (see the | myths that programmers believe about names, time, addresses, | etc.) You end up giving a string for each field... Then give up, | just accept any string, and hope for the best. | | Everybody wants Google or some machine learning solution because | the formats never work for the information people want to convey. | Better solutions could exist but the hacky ones are first to | market, in a natural monopoly where there really only needs one | good enough product. | | If you think a lack of a good resume format is bad, look at | electronic health records. Those are far more important to be | correct and exchangeable, and even there the cleanup effort is | always enormous. | kaderno wrote: | Nobody? https://xkcd.com/927/ | wintorez wrote: | CTRL+F | JulianMorrison wrote: | Anti-competitive practise. If they make entering your resume | suck, maybe you will only do it on one site (theirs). | thaumasiotes wrote: | I've been hearing people react to current (American) college | admissions by saying the problem is the Common Application | allowing people to apply to many schools at once. | | I would be more likely to locate the problem in increasing | numbers of applicants rather than increasing numbers of | applications per applicant, but I wouldn't be opposed to seeing | a more centralized system go up. American medical schools | accept students through such a centralized system. Chinese | university admissions isn't centralized to the same degree as | American medical schools, but it's still quite centralized, and | _more_ formalized. | ianai wrote: | I'm pretty sure PDFs are the universal for resumes or close to | it. Don't send a word doc though. | netcan wrote: | Hiring companies might prefer to just have this as a "hoop." | | There are, inevitably, some people who apply to a _lot_ of jobs. | Jobs they aren 't qualified for. Low intent applications, where | the applicant isn't really that interested. Etc. | | Even if these are a minority, they apply to a lot of jobs. In any | case, standardizing job applications (OP seems to be talking | about application forms that are mostly resume-ish fields) just | means more of these. More volume, more noise, probably not many | more successful hires. | | There's kind of the same dynamic on the other side. Most workers | don't love the idea of submitting an indexable resume for | employers to leaf through. | | Sometimes a modicum of friction is helpful. | version_five wrote: | They should ask for a cover letter. | | I agree that a hoop is needed - I recently posted a job on | LinkedIn which has some kind of "click to apply" functionality, | and it was clear that most of the applicants were just lazy | clickers that probably hadn't even read the posting. On the | other hand, if you make people jump through pointless | procedural hoops, you're screening for people who are ok doing | that. | | Ask for a cover letter, you get people who actually want the | job and hear from them why they are interested. | aaron695 wrote: | ardel95 wrote: | Because humans are too complex to fit into schemas. | simonw wrote: | Because there's no demand for it. The vast majority of people who | read and write resumes have no idea what an "open standard" is, | and would be unable to create or read any resume that used such a | format. | | They need the software to be built first. But the software won't | be built unless there is demand for it. | FpUser wrote: | I very much like to write my resume the way I think represents me | the best. The last think I need is to tailor it so some | programmer who struggles with parsing. | | When hire I also like to receive custom resume that helps me to | understand better why hire. Again I am totally uninterested | tailoring my offering to formalized set of checkboxes concocted | by some "industry expert". | NKosmatos wrote: | Indeed there is a lack of a standard format for CVs, but this is | the same for many other things in computing/IT/software in | general. See the well known XKCD comic for standards [0] :-) I | don't know if it helps, but in EU we have a well known and | familiar format for the Europass CV [1] allowing someone to | create a CV and then store it in 29 languages and also download | it in various formats (xml included). | | [0] https://xkcd.com/927/ [1] | https://europa.eu/europass/en/create-europass-cv | nojito wrote: | Because a resume isn't a job application. It's a description of | who/what you think you are. | DamonHD wrote: | Could one use schema.org/Person to mark up HTML5 with appropriate | microdata? | ivan_gammel wrote: | schema.org/Person is a terribly designed format, that I would | not bring into new projects. Here are some of the reasons: | | 1. It lacks diversity (e.g. GenderType is incompatible with | German laws, requiring "diverse" gender in official forms, | which is definitely not "unisex" defined in the format). | | 2. It looks very much like a US-centric God Object. It is not | clear why DUNS, ISICv4 or NAICS are there, but other | identifiers like national ID or SSN are not. It would be better | to have a single "identifiers" key-value map instead of them, | that would be extensible. | | 3. Contacts would deserve a dedicated structure and key-value | map (why single telephone field? why messenger IDs not there?). | jeroenhd wrote: | In my opinion, there shouldn't be. | | Documents like these shouldn't be automatically processed, they | should be reviewed by humans. Reducing someone's life history to | a list of educational institutions and employers feels robotic | even for a software developer's mindset. | | I understand that there are real life problems because companies | do use automated processing on applications, but that kind of | behaviour shouldn't be encouraged. | ravenstine wrote: | I wonder if there's any advantage in using fonts and layout | that are adversarial to automated resume processing, as in if | something fails to scan then perhaps a human is more likely to | actually look at it. But perhaps HR just throws those out. | viraptor wrote: | Does anyone OCR? The systems I've heard of just extract the | text from PDFs/docs. Then if some bits cannot be extracted, I | was asked to type them myself. | brunellus wrote: | Indeed if there were a standard format, there would be | competitive advantages to using other channels to showcase your | experience | gingkoguy wrote: | Also having this it will give you the User to make 1 resume and | apply to millions of jobs. It's a win / win situation | drivingmenuts wrote: | Downside is when you have to make slight changes each time to | better emphasize some aspect of your experience, depending on | the job applied for. Recruiting agencies often do this to sex | up their candidates chance of winning the job lottery. | selimthegrim wrote: | Unless you're the employer being deluged | stemlord wrote: | There is maybe 1 in 50 jobs that I'd actually want, so I | imagine bulk applying to would guarantee I land something | awful. | lethologica wrote: | Not everyone has the luxury of choice though when it comes | to employment. | jbverschoor wrote: | .docx | vortico wrote: | Was going to say PDF but this is close enough :) | d--b wrote: | Well... Would you rather apply to a job by sending a pdf that you | carefully crafted, or by sending your linkedin profile? | Kiro wrote: | LinkedIn profile for sure. | grayclhn wrote: | LinkedIn profile | heikkilevanto wrote: | Why should there be? Resumes have to be read by humans to judge | if they match what is needed. Checking boxes and matching | buzzwords only gets you so far. | yjftsjthsd-h wrote: | Because many job applications make me check boxes already; I'd | rather automate filling the same details into forms over and | over again. | duxup wrote: | I think because some things like work history, education, they | lend themselves to pretty standardized fields. | | And we have to keep filling them out by hand ... | threeseed wrote: | Resumes go through many systems i.e. job sites, recruiters, HR, | hiring managers and are often poorly parsed by many of them. | | It's not like people are managing dozens/hundreds of candidates | with a pen/paper. | | There is definitely a need for a standard format. | scollet wrote: | I think the most charitable and equitable function would be | sorting and solving without filtering. | | It shouldn't take too long to parse the desirable skills from | the top after that. | | Maybe this standard can match against desired/offered | compensation brackets as well to get that sweet spot on the | bell curve. | complexworld wrote: | I don't think it's used very much but there is this one | http://xml.coverpages.org/HR-XML-ResumeSpecification200205.h... | | This page mentions a few more formats: | http://microformats.org/wiki/resume-formats | | It seems like the problem isn't a lack of standards, but rather a | lack of adoption and/or agreement on which standard to follow | AtlasBarfed wrote: | If companies wanted switching jobs or hiring to be streamlined | and simple, they wouldn't do the job games they play. | MattGaiser wrote: | I have had a lot of success with "September 2019 - September | 2020." Seems to work fine in nearly every case. | ghaff wrote: | Note that there is some number of people who feel pretty | strongly about obfuscating their age. I'm not sure what to feel | about it personally but it's understandable. | [deleted] | travisjungroth wrote: | I would have found it much more interesting and useful if the | author had tried to sincerely answer or even ask the question of | the title. Instead it's just a complaint. | bee_rider wrote: | I guess the answer must be: Companies have no incentive to make | it easy to apply to their job without manual intervention. | Which makes sense -- if it were easy enough to apply to | companies automatically, I guess some people would just write a | script to apply to every company in their field, right? | quickthrower2 wrote: | The underlying problem is you need a JSON format about the | company. But the things candidates care for generally are | easy to lie about (good culture, career progression, even | remote working options), so scatter guns are required. As | they say recruitment is broken - but its not easily fixable | because it's broken because of game theory and human | behaviour. | ghaff wrote: | This is something of an issue with conference proposals as | well. In fact, I've seen some conferences recently that put a | strict limit on number of submittals. As a sometimes | conference reviewer, I hate people throwing a bunch of | overlapping and often generic stuff at the wall. | yepthatsreality wrote: | Please insert girder. | dClauzel wrote: | There is, thanks to the semantic web (the real web 3 :) ). | | See for example the FoaF ontology | https://en.wikipedia.org/wiki/FOAF_(ontology) | Blikkentrekker wrote: | I do miss the glory days of when the W3C was still in control | and had some very good ideas that browsers did not like. | | Some of these old AAA XHTML 1.1+RDFa websites were well | engineered and a joy for both man and machine to behold. | Waterluvian wrote: | Is the "semantic web" used in sensible, productive ways these | days or does it remain an academic daydream? | | I'm highly skeptical of the whole concept and feel like I | should have organically come across it by now. | codingdave wrote: | Yeah, it is - not to the level people envisioned years ago | where people made up their own tags and found a blend of XML | and HTML, but it absolutely is in place for more meaningful | HTML. We have tags that tell you the purpose of content - | header, nav, section, article, footer, etc. Those tags are | read by screen readers and other accessibility tools, and do | bring semantic value to those readers. | ognarb wrote: | There is some neat application for example many tickets/hotel | reservation/restaurant do include them and this allow to make | some email clients displays these tickets in a standard way. | | KDE Itinerary and Apple Wallet make use of this. For the KDE | Itinerary part, you can read more on this here: | https://volkerkrause.eu/ (look at the KDE Itinerary posts) | gregoryl wrote: | Parsing CVs is awful. We outsource this entire feature to Daxtra, | who do a very average job (but, to be fair, better than anything | we have time to write!). | zxcvbn4038 wrote: | HR people don't like changing their procedures. | | I've done numerous interviews large corporations where the first | part of the interview process involved copying my printed resume | by hand onto sheets of paper so that someone could then type in | what I had written into a web browser. Why they couldn't just | copy from my printed resume or accept soft copy in word or ascii | I have no answers for. | | When I joined at Chase Manhattan it was very obvious that their | onboarding process is designed for large groups - many dozens of | people at a time - but the day I joined I was the only one being | hired. I spent a couple hours with just myself and a single HR | rep going through a half dozen rooms, in each room I had to sit | as far back and to the left as possible whereas she sat at the | front right of the room. She could not pass out forms until I was | seated, at which point I would have to come get the form from her | and return to the far side of the room to fill it out, then bring | it back to her, return to my seat, then she would announce we | were moving to the next room, and it would begin again. When I | tried to sit at the front of the room she became extremely | agitated and refused to continue until I returned to the back of | the room, when I tried to get a form from her without first | sitting in the back of the room same result. The rest of the | company is pretty much the same, it never got better. | csmcg wrote: | That sounds like an incredibly unsettling experience. | jay_kyburz wrote: | I'm for it. The resume _is_ just data and the least interesting | thing in an implication. The folio / work sample and the cover | letter is the interesting part that will get you an interview. | 6LLvveMx2koXfwn wrote: | Because individuality is as valuable a commodity in the workplace | as elsewhere. | Traster wrote: | If you really want a job at my company, and the level at which | you decide "this isn't worth my time" is 2 minutes to enter your | details then you weren't a good fit in the first place. There are | genuine reasons to turn down stupid requests in interview | (ridiculous coding tests) but "Re-enter all your details into | this form" isn't that hard, and if you really think you want the | job then it's a very small price to pay. What is does is force | unqualified candidates to distill their lack of qualifications. | So for the hiring company it let's them cut out that bottom 50% | of applications (not candidates, applications) that are just | absurd. If you standardize it all you're doing is giving the | mass-application candidates a cheat sheet. | version_five wrote: | There are two separate things going on in the linked post. One is | the absence of a standard format as identified. The second is | that HR software generally is enterprise software sold to | leadership where usability is an afterthought, and so it only | gets minimal development focus on having a quality resume | ingestion algorithm. So you get the terrible parsing the author | complains about. I'm confident that if any buyer really | prioritized this feature being better, it would be. | polote wrote: | There is one, it is just proprietary https://linkedin.com | CalRobert wrote: | There is! https://jsonresume.org/ | | But nobody really uses it for data interchange. I use it to | render my resume in new layouts now and then. | | Although, it does let style dictate content sometimes (some | templates force you to have dates down to the day for job start | and end dates, etc.) | spondyl wrote: | I used a JSON resume once and I like the idea. It didn't help | when I had to apply for an unemployment benefit (while still | trying to enter the industry) and they asked for my resume as a | word document. | | As you can imagine, I felt like a clown trying to explain that | I didn't have a word document because my resume was generated | from a JSON file. | | I did, however, have a PDF on a USB drive I always kept on me | but they refused to accept USBs out of fear that I was trying | to give them a virus. Eventually the lady processing my | application gave up and printed the PDF off that was hosted on | my website but also scolded me for not having a word doc. | | The whole ordeal was pointless anyway since they said they can | give me 70% of my rent. | Aeolun wrote: | My JSON resume is transformed to docx, and only latter to PDF | for exactly this reason. | ethbr0 wrote: | tl;dr - HR is underfunded / under-resourced / un- | aspirational. | | Even _if_ there were a turn-key software platform they all | used, which natively supported a standard data format, they | 'd still find a way to screw it up. | version_five wrote: | I have my resume in markdown so I can create a word/pdf/html | etc. with pandoc. It would be possible to write something | that parses a json resume into markdown so it could be | trivially format shifted | thomasfromcdnjs wrote: | Json Resume is still going strong. I am one of the founders, I | try to do a couple major maintenance periods per year. | Currently I am working on updating all the community projects | built, still got quite a few to add but currently there is -> | https://jsonresume.org/projects/ | | I think over 3k+ people use the new Gist hosting. (In our old | hosting we had around 10k resumes. Not including those who by | pass the free community hosting) | | === | | On a personal note, I've loved having my resume in a standard; | | - Depending on what type of company/person I am applying to I | will change my theme on the fly. (Startup vibes I will make it | look hipster, if it's a more formal role I will use a simple | black and white theme) | | - I use to lose my most recent resume constantly, having it in | a Gist called resume.json that I just edit seems to have solved | that for me. | | - Hopefully one day a standard will get integration adoption so | I can just upload my resume.json and not have to fill out the | same form fields a hundred times. | rospaya wrote: | I'm surprised nobody mentioned Europass. Personally, I'm not a | fan precisely because it only shows raw info and doesn't leave | anything for interpretation - it's cold and bureaucratic. But I | understand why people love to us it. Around half of the CVs I see | (in Europe) are in this format. | | https://en.wikipedia.org/wiki/Europass | jim-jim-jim wrote: | iirc there is a standard cv form in Japan. I never had to go | through the job search in this manner, but a colleague said he | just picked up a pack from a convenience store, filled them out, | and sent them to every company. | pestaa wrote: | I always liked the idea of Europass, but it's easy to understand | why it didn't get traction. | | https://europa.eu/europass/en/create-europass-cv | boomer918 wrote: | Let's make one. A universal format could help with bias and | resume anxiety. | ashtonkem wrote: | It's because the function of a resume is largely social | signaling, not a matter-of-fact reporting of career history. You | can't make a data format to encode that information, which is why | a lot of job platforms require a resume _and_ a list of your past | jobs in a form. | | Any solution for resumes is going to fall into the much harder | category of social change, rather than technical solutions. | kzrdude wrote: | resumes are an adversarial game; the author and the reader have | partly different goals. | | Does the author want the resume to be objectively judged? Maybe | not | awb wrote: | Exactly. The goal is to stand out from the crowd, not be part | of it. | renewiltord wrote: | I think that would be very cool. Coupled with a universal data- | format for jobs, one could imagine a much better job match engine | that doesn't rely on the right person finding the right job! | | Like a job market that matches employees to employers but in an | open database. I can't think of an easy way to monetize it in an | open data sense, sadly, but it seems like everyone would benefit | from it and so you could charge a small fee and kill recruiters | if your job match engine was good. | | The market being so lucrative, I imagine that someone has already | tried it and the devil is in the details. Perhaps if I cared more | about this and didn't have a great network, I might give it a | shot. Could bootstrap off Github profiles or something, and be | tech focused to start with. | stillicidious wrote: | Clearly written by an applicant rather than employer. Your resume | is your chance to shine, use the opportunity or pay a service to | do it for you. | | The reason the author is trapped long enough in meat grinder | hiring to notice this "problem" is likely precisely because of | some indistinguishable cookie cutter bullet point soup getting | them nowhere. If you can capture it in a data structure, it's not | a resume! | | See also: why isn't there a universal UI for web sites? | robertlagrant wrote: | > If you can capture it in a data structure, it's not a resume! | | Pretty sure a Word file is a data structure. | [deleted] | _jal wrote: | There is HRXML: | | https://workforce.com/news/what-hr-xml-means-and-why-youll-c... | | But I don't think anyone really cared. ___________________________________________________________________ (page generated 2022-01-16 23:00 UTC)