freeCodeCamp/curriculum/challenges/english/10-coding-interview-prep/rosetta-code/tokenize-a-string-with-esca...

---
title: Tokenize a string with escaping
id: 594faaab4e2a8626833e9c3d
challengeType: 5
forumTopicId: 302338
---

## Description
<section id='description'>
Write a function or program that can split a string at each non-escaped occurrence of a separator character.
It should accept three input parameters:
<ul>
  <li>The <strong>string</strong></li>
  <li>The <strong>separator character</strong></li>
  <li>The <strong>escape character</strong></li>
</ul>
It should output a list of strings.
Rules for splitting:
<ul>
  <li>The fields that were separated by the separators, become the elements of the output list.</li>
  <li>Empty fields should be preserved, even at the start and end.</li>
</ul>
Rules for escaping:
<ul>
  <li>"Escaped" means preceded by an occurrence of the escape character that is not already escaped itself.</li>
  <li>When the escape character precedes a character that has no special meaning, it still counts as an escape (but does not do anything special).</li>
  <li>Each occurrences of the escape character that was used to escape something, should not become part of the output.</li>
</ul>
Demonstrate that your function satisfies the following test-case:
Given the string
<pre>one^|uno||three^^^^|four^^^|^cuatro|</pre>
and using <code>|</code> as a separator and <code>^</code> as escape character, your function should output the following array:
<pre>
  ['one|uno', '', 'three^^', 'four^|cuatro', '']
</pre>
</section>

## Instructions
<section id='instructions'>

</section>

## Tests
<section id='tests'>

```yml
tests:
  - text: <code>tokenize</code> should be a function.
    testString: assert(typeof tokenize === 'function');
  - text: <code>tokenize</code> should return an array.
    testString: assert(typeof tokenize('a', 'b', 'c') === 'object');
  - text: <code>tokenize('one^|uno||three^^^^|four^^^|^cuatro|', '|', '^') </code> should return <code>['one|uno', '', 'three^^', 'four^|cuatro', '']</code>
    testString: assert.deepEqual(tokenize(testStr1, '|', '^'), res1);
  - text: <code>tokenize('a@&bcd&ef&&@@hi', '&', '@')</code> should return <code>['a&bcd', 'ef', '', '@hi']</code>
    testString: assert.deepEqual(tokenize(testStr2, '&', '@'), res2);

```

</section>

## Challenge Seed
<section id='challengeSeed'>

<div id='js-seed'>

```js
function tokenize(str, sep, esc) {
  return true;
}
```

</div>


### After Test
<div id='js-teardown'>

```js
const testStr1 = 'one^|uno||three^^^^|four^^^|^cuatro|';
const res1 = ['one|uno', '', 'three^^', 'four^|cuatro', ''];

// TODO add more tests
const testStr2 = 'a@&bcd&ef&&@@hi';
const res2 = ['a&bcd', 'ef', '', '@hi'];
```

</div>

</section>

## Solution
<section id='solution'>


```js
// tokenize :: String -> Character -> Character -> [String]
function tokenize(str, charDelim, charEsc) {
  const dctParse = str.split('')
    .reduce((a, x) => {
      const blnEsc = a.esc;
      const blnBreak = !blnEsc && x === charDelim;
      const blnEscChar = !blnEsc && x === charEsc;

      return {
        esc: blnEscChar,
        token: blnBreak ? '' : (
          a.token + (blnEscChar ? '' : x)
        ),
        list: a.list.concat(blnBreak ? a.token : [])
      };
    }, {
      esc: false,
      token: '',
      list: []
    });

  return dctParse.list.concat(
    dctParse.token
  );
}

```

</section>
feat(challenge-md): Add initial markdown challenge files 2018-09-30 22:01:58 +00:00			`---`
			`title: Tokenize a string with escaping`
			`id: 594faaab4e2a8626833e9c3d`
			`challengeType: 5`
fix(curriculum): Added forumTopicId to remaining 1200 challeng… (#36558) 2019-08-05 16:17:33 +00:00			`forumTopicId: 302338`
feat(challenge-md): Add initial markdown challenge files 2018-09-30 22:01:58 +00:00			`---`

			`## Description`
			`<section id='description'>`
			`Write a function or program that can split a string at each non-escaped occurrence of a separator character.`
			`It should accept three input parameters:`
fix(challenges): T problems 2019-03-10 13:12:52 +00:00			`<ul>`
Changed bold to strong or code tags where possible 2019-06-14 11:04:16 +00:00			`<li>The <strong>string</strong></li>`
			`<li>The <strong>separator character</strong></li>`
			`<li>The <strong>escape character</strong></li>`
fix(challenges): T problems 2019-03-10 13:12:52 +00:00			`</ul>`
			`It should output a list of strings.`
			`Rules for splitting:`
			`<ul>`
			`<li>The fields that were separated by the separators, become the elements of the output list.</li>`
			`<li>Empty fields should be preserved, even at the start and end.</li>`
			`</ul>`
			`Rules for escaping:`
			`<ul>`
			`<li>"Escaped" means preceded by an occurrence of the escape character that is not already escaped itself.</li>`
			`<li>When the escape character precedes a character that has no special meaning, it still counts as an escape (but does not do anything special).</li>`
			`<li>Each occurrences of the escape character that was used to escape something, should not become part of the output.</li>`
			`</ul>`
			`Demonstrate that your function satisfies the following test-case:`
			`Given the string`
			`<pre>one^\|uno\|\|three^^^^\|four^^^\|^cuatro\|</pre>`
			`and using <code>\|</code> as a separator and <code>^</code> as escape character, your function should output the following array:`
			`<pre>`
			`['one\|uno', '', 'three^^', 'four^\|cuatro', '']`
			`</pre>`
feat(challenge-md): Add initial markdown challenge files 2018-09-30 22:01:58 +00:00			`</section>`

			`## Instructions`
			`<section id='instructions'>`

			`</section>`

			`## Tests`
			`<section id='tests'>`

			```yml
chore(curriculum): Remove files in wrong format 2018-10-04 13:37:37 +00:00			`tests:`
fix(curriculum): changed test text to use should for Coding Interview Prep - part 2 of 2 (#37766) * fix: changed test text to use should * fix: corrected typo Co-Authored-By: Tom <20648924+moT01@users.noreply.github.com> * fix: removed extra period Co-Authored-By: Tom <20648924+moT01@users.noreply.github.com> * fix: removed extra period Co-Authored-By: Tom <20648924+moT01@users.noreply.github.com> * fix: removed extra period Co-Authored-By: Tom <20648924+moT01@users.noreply.github.com> * fix: removed extra period Co-Authored-By: Tom <20648924+moT01@users.noreply.github.com> * fix: corrected typo Co-Authored-By: Tom <20648924+moT01@users.noreply.github.com> 2019-11-20 15:01:31 +00:00			`- text: <code>tokenize</code> should be a function.`
fix(curriculum): Remove unnecessary assert message argument from English Coding Interview Prep challenges - 02 (#36412) * fix: removed assert msg argument * fix: removed msgs surrounded by 2 single quotes * fix: removed missing 2 assert msg arguments * fix: remove msg surrounded by two single quotes * fix: removed unnecessary assert msg args * fix; remove msgs surrounded by double quotes * fix: removed unnecessary assert msg args * fix: remove unnecessary assert msg args * fix: removed unnecessary assert msg arg * fix: removed unnecessary assert msg args * fix: removed unnecessary assert msg arg * fix: removed unnecessary assert msg args * fix: removed unnecessary assert msg args * fix: removed unnecessary assert msg args * fix: removed unnecessary assert msg args * fix: removed unnecessary assert msg args * fix: removed unnecessary assert msg arg * fix: removed unnecessary assert msg args * fix: Restore expected values to assertions * fix: remove assertion message Co-authored-by: Vivek Agrawal <vivekmittalagrawal@gmail.com> 2019-07-26 12:24:52 +00:00			`testString: assert(typeof tokenize === 'function');`
chore(curriculum): Remove files in wrong format 2018-10-04 13:37:37 +00:00			`- text: <code>tokenize</code> should return an array.`
fix(curriculum): Remove unnecessary assert message argument from English Coding Interview Prep challenges - 02 (#36412) * fix: removed assert msg argument * fix: removed msgs surrounded by 2 single quotes * fix: removed missing 2 assert msg arguments * fix: remove msg surrounded by two single quotes * fix: removed unnecessary assert msg args * fix; remove msgs surrounded by double quotes * fix: removed unnecessary assert msg args * fix: remove unnecessary assert msg args * fix: removed unnecessary assert msg arg * fix: removed unnecessary assert msg args * fix: removed unnecessary assert msg arg * fix: removed unnecessary assert msg args * fix: removed unnecessary assert msg args * fix: removed unnecessary assert msg args * fix: removed unnecessary assert msg args * fix: removed unnecessary assert msg args * fix: removed unnecessary assert msg arg * fix: removed unnecessary assert msg args * fix: Restore expected values to assertions * fix: remove assertion message Co-authored-by: Vivek Agrawal <vivekmittalagrawal@gmail.com> 2019-07-26 12:24:52 +00:00			`testString: assert(typeof tokenize('a', 'b', 'c') === 'object');`
Fix: remove quote from challenge where not needed [english] (#35493) 2019-03-19 09:34:03 +00:00			`- text: <code>tokenize('one^\|uno\|\|three^^^^\|four^^^\|^cuatro\|', '\|', '^') </code> should return <code>['one\|uno', '', 'three^^', 'four^\|cuatro', '']</code>`
fix(curriculum): Remove unnecessary assert message argument from English Coding Interview Prep challenges - 02 (#36412) * fix: removed assert msg argument * fix: removed msgs surrounded by 2 single quotes * fix: removed missing 2 assert msg arguments * fix: remove msg surrounded by two single quotes * fix: removed unnecessary assert msg args * fix; remove msgs surrounded by double quotes * fix: removed unnecessary assert msg args * fix: remove unnecessary assert msg args * fix: removed unnecessary assert msg arg * fix: removed unnecessary assert msg args * fix: removed unnecessary assert msg arg * fix: removed unnecessary assert msg args * fix: removed unnecessary assert msg args * fix: removed unnecessary assert msg args * fix: removed unnecessary assert msg args * fix: removed unnecessary assert msg args * fix: removed unnecessary assert msg arg * fix: removed unnecessary assert msg args * fix: Restore expected values to assertions * fix: remove assertion message Co-authored-by: Vivek Agrawal <vivekmittalagrawal@gmail.com> 2019-07-26 12:24:52 +00:00			`testString: assert.deepEqual(tokenize(testStr1, '\|', '^'), res1);`
fix(curriculum): quotes in tests (#18828) * fix(curriculum): tests quotes * fix(curriculum): fill seed-teardown * fix(curriculum): fix tests and remove unneeded seed-teardown 2018-10-20 18:02:47 +00:00			`- text: <code>tokenize('a@&bcd&ef&&@@hi', '&', '@')</code> should return <code>['a&bcd', 'ef', '', '@hi']</code>`
fix(curriculum): Remove unnecessary assert message argument from English Coding Interview Prep challenges - 02 (#36412) * fix: removed assert msg argument * fix: removed msgs surrounded by 2 single quotes * fix: removed missing 2 assert msg arguments * fix: remove msg surrounded by two single quotes * fix: removed unnecessary assert msg args * fix; remove msgs surrounded by double quotes * fix: removed unnecessary assert msg args * fix: remove unnecessary assert msg args * fix: removed unnecessary assert msg arg * fix: removed unnecessary assert msg args * fix: removed unnecessary assert msg arg * fix: removed unnecessary assert msg args * fix: removed unnecessary assert msg args * fix: removed unnecessary assert msg args * fix: removed unnecessary assert msg args * fix: removed unnecessary assert msg args * fix: removed unnecessary assert msg arg * fix: removed unnecessary assert msg args * fix: Restore expected values to assertions * fix: remove assertion message Co-authored-by: Vivek Agrawal <vivekmittalagrawal@gmail.com> 2019-07-26 12:24:52 +00:00			`testString: assert.deepEqual(tokenize(testStr2, '&', '@'), res2);`
feat(challenge-md): Add initial markdown challenge files 2018-09-30 22:01:58 +00:00
			```

			`</section>`

			`## Challenge Seed`
			`<section id='challengeSeed'>`

			`<div id='js-seed'>`

			```js
commit 7/8 Rosetta tokenize (#39213) 2020-07-08 22:21:01 +00:00			`function tokenize(str, sep, esc) {`
feat(challenge-md): Add initial markdown challenge files 2018-09-30 22:01:58 +00:00			`return true;`
			`}`
			```

			`</div>`


			`### After Test`
			`<div id='js-teardown'>`

			```js
fix(curriculum): quotes in tests (#18828) * fix(curriculum): tests quotes * fix(curriculum): fill seed-teardown * fix(curriculum): fix tests and remove unneeded seed-teardown 2018-10-20 18:02:47 +00:00			`const testStr1 = 'one^\|uno\|\|three^^^^\|four^^^\|^cuatro\|';`
			`const res1 = ['one\|uno', '', 'three^^', 'four^\|cuatro', ''];`

			`// TODO add more tests`
			`const testStr2 = 'a@&bcd&ef&&@@hi';`
			`const res2 = ['a&bcd', 'ef', '', '@hi'];`
feat(challenge-md): Add initial markdown challenge files 2018-09-30 22:01:58 +00:00			```

			`</div>`

			`</section>`

			`## Solution`
			`<section id='solution'>`


			```js
			`// tokenize :: String -> Character -> Character -> [String]`
			`function tokenize(str, charDelim, charEsc) {`
fix(challenge-md): Fix file names and preserve challenge order in meta.json 2018-10-02 14:02:53 +00:00			`const dctParse = str.split('')`
feat(challenge-md): Add initial markdown challenge files 2018-09-30 22:01:58 +00:00			`.reduce((a, x) => {`
			`const blnEsc = a.esc;`
			`const blnBreak = !blnEsc && x === charDelim;`
			`const blnEscChar = !blnEsc && x === charEsc;`

			`return {`
			`esc: blnEscChar,`
fix(curriculum): quotes in tests (#18828) * fix(curriculum): tests quotes * fix(curriculum): fill seed-teardown * fix(curriculum): fix tests and remove unneeded seed-teardown 2018-10-20 18:02:47 +00:00			`token: blnBreak ? '' : (`
			`a.token + (blnEscChar ? '' : x)`
feat(challenge-md): Add initial markdown challenge files 2018-09-30 22:01:58 +00:00			`),`
			`list: a.list.concat(blnBreak ? a.token : [])`
			`};`
			`}, {`
			`esc: false,`
fix(curriculum): quotes in tests (#18828) * fix(curriculum): tests quotes * fix(curriculum): fill seed-teardown * fix(curriculum): fix tests and remove unneeded seed-teardown 2018-10-20 18:02:47 +00:00			`token: '',`
feat(challenge-md): Add initial markdown challenge files 2018-09-30 22:01:58 +00:00			`list: []`
			`});`

			`return dctParse.list.concat(`
			`dctParse.token`
			`);`
			`}`

			```

			`</section>`