Site icon SAP Human Resources Experts And Solutions

Resume parsing

<p>Want to talk about&quest;<&sol;p>&NewLine;<p>It&&num;8217&semi;s a boring time for a recruiter&period; SAP knows well how to it his job&comma; there is nothing to do about&period; Position requests from managers are being processed themselves&comma; posted to internal and external web-portals or agencies&period; Feedbacks are being sent back to managers by themselves and automatically&period; Everything is integrated&period; CV is being read from email  hireme&commat;saphcmsolutions&period;com&comma; parsed to bones and stored in candidate database&period; Interviews are being initiated from a mobile phone&comma; rooms are reserved&period; Boring&comma; no fun at all&period;<&sol;p>&NewLine;<p>Everything is clear except CV&period; We know every resume is made of a typical skeleton&comma; where is personal info&comma; contacts&comma; work experience&period; Every part could be formalized&comma; parsed to its components and analyzed by a number of factors and variants of appearance&period;<&sol;p>&NewLine;<p>We understand that First and Last names could match file name&comma; never is written with punctuation characters&comma; always start with a capital letter or are all capital and resides in the top part of a doc&period;<&sol;p>&NewLine;<p>We also understand that contact phone number has fixed number of digits&comma; patterns are also well known and it&&num;8217&semi;s placed somewhere by name or e-mail address&period;<&sol;p>&NewLine;<p>We understand that work experience is a consequence of the same type blocks with company&comma; period&comma; position and job functions specification&period; It&&num;8217&semi;s just a table which can be retrieved from CV somehow&period; Let&&num;8217&semi;s say exported in XML format&comma; where we can easily find repeating elements that appear more than once&period;<&sol;p>&NewLine;<p><&excl;--more--><br &sol;>&NewLine;But how to find it in a plain text&quest; Elementary&excl; There are so-called regular expressions&period; <a href&equals;"https&colon;&sol;&sol;regexr&period;com&sol;" rel&equals;"noopener" target&equals;"&lowbar;blank">Google it<&sol;a>&period; Let&&num;8217&semi;s use them to do resume parsing&period;<&sol;p>&NewLine;<p>Here is the way to work with them in  ABAP&colon;<br &sol;>&NewLine;FIND REGEX &OpenCurlyQuote;A&ast;B’ IN &OpenCurlyQuote;ABAP’&period;<br &sol;>&NewLine;REPLACE ALL OCCURRENCES OF REGEX regpattern<br &sol;>&NewLine;IN text<br &sol;>&NewLine;WITH new<br &sol;>&NewLine;REPLACEMENT COUNT cnt&period;<&sol;p>&NewLine;<p>Or any email address could be found in a text with this code&colon;<&sol;p>&NewLine;<p>Data v&lowbar;pattern &equals; &OpenCurlyQuote;&Hat;&lpar;&lbrack;0-9a-zA-Z&rsqb;&lpar;&lbrack;-&period;&bsol;w&rsqb;&ast;&lbrack;0-9a-zA-Z&rsqb;&rpar;&ast;&commat;&lpar;&lbrack;0-9a-zA-Z&rsqb;&lbrack;-&bsol;w&rsqb;&ast;&lbrack;0-9a-zA-Z&rsqb;&bsol;&period;&rpar;&plus;&lbrack;a-zA-Z&rsqb;&lbrace;2&comma;9&rcub;&rpar;&dollar;’&period;<&sol;p>&NewLine;<p>lr&lowbar;matcher &equals; cl&lowbar;abap&lowbar;matcher&equals;&gt&semi; create &lpar;pattern &equals; v&lowbar;pattern Text &equals; ’test&commat;gmail&period;com’&rpar;&period;<&sol;p>&NewLine;<p>CALL METHOD lr&lowbar;matcher-&gt&semi;match RECEIVING success &equals; v&lowbar;sucess&period;<&sol;p>&NewLine;<p>IF v&lowbar;sucess &equals; abap&lowbar;false&period;<&sol;p>&NewLine;<p>Message &OpenCurlyQuote;Invalid email id’ TYPE &OpenCurlyQuote;I’&period;<&sol;p>&NewLine;<p>ENDIF&period;<&sol;p>&NewLine;<p>&lpar;&quest;&rpar; Code samples are from SDN&period; Author is&colon;<br &sol;>&NewLine;Author&colon; Shaira Madhu<br &sol;>&NewLine;Company&colon; Applexus Software Solutions &lpar;P&rpar; Ltd<br &sol;>&NewLine;Created on&colon; 25 October 2010<&sol;p>&NewLine;<p>&nbsp&semi;<&sol;p>&NewLine;

Exit mobile version