New command: `shell` #57

anatoly-scherbakov · 2020-09-13T08:49:41Z

columns:
  uuid:
    shell:
      command: uuid -v4
      pipe: false

ysv should leverage the ecosystem of UNIX command line tools. It should permit the user to process the values of a given column through an external program.

There are multiple use cases to this.

As we have seen in practice, sometimes ysv's built in filters are not enough. We have to write custom code in another language to do some sort of complex processing for particular columns.

With shell command, we could teach ysv to call our Python script in a separate process and feed the values, line by line, to that script. It will read the output from stdout of the script and incorporate the resulting values into the output CSV dataset.

This would make ysv enormously extensible. Moreover, we could allow it to run multiple instances of the external program and thus facilitate the multiprocessing capabilities of modern hardware (which, say, Python alone cannot easily do).

Even without custom code, the communication using UNIX pipes allows to use standard command line tools, for example awk.

In both of these cases, we will get substantial expansion in functionality by leveraging tools that already exist out there, – and we can do that with great efficiency.

The text was updated successfully, but these errors were encountered:

anatoly-scherbakov · 2020-09-13T08:53:12Z

More examples.

columns:
  number:
    input: number_plus_five
    shell: awk { $1 + 5 }

or

columns:
  phone_number:
    input: Phone
    shell: python run.py validate_united_states_phonenumbers

In each case, ysv runs the provided shell command as another process (or processes) and feeds the input values to the stdin of that command. It then reads the processed values from stdout and inserts them into the output CSV dataset.

anatoly-scherbakov · 2020-09-13T14:56:01Z

It seems jq team is working on something similar: jqlang/jq#147

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New command: `shell` #57

New command: `shell` #57

anatoly-scherbakov commented Sep 13, 2020

anatoly-scherbakov commented Sep 13, 2020 •

edited

Loading

anatoly-scherbakov commented Sep 13, 2020

New command: shell #57

New command: shell #57

Comments

anatoly-scherbakov commented Sep 13, 2020

anatoly-scherbakov commented Sep 13, 2020 • edited Loading

anatoly-scherbakov commented Sep 13, 2020

New command: `shell` #57

New command: `shell` #57

anatoly-scherbakov commented Sep 13, 2020 •

edited

Loading