Skip to content

tekmans/bquery

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 

Repository files navigation

bquery

bQuery is a simple tool to quickly parse html streams with a jQuery-like expression.

Examples:

get h1 element text content

wget -qO- http://en.wikipedia.org/wiki/Secure_Shell |./bquery h1

multiple results => multiple lines

wget -qO- http://en.wikipedia.org/wiki/Secure_Shell |./bquery h2

get the first result

wget -qO- http://en.wikipedia.org/wiki/Secure_Shell |./bquery h2:first

=============================================================================

By default, the inner plain text is output. If the found element is an input, the value is output. To get the html content instead of plain text, just pass --html swith.

We can extract a specific attribute instead of text:

This example extracts all page links.

wget -qO- http://en.wikipedia.org/wiki/Secure_Shell |./bquery a -a href

Download all zip or exe files from audacity download page

wget -qO- http://audacity.sourceforge.net/download/windows |./bquery a -a href|grep -e zip -e exe|xargs -n1 wget

============================================================================= Installation

bquery requires python, python-pyquery, python-argparse (pyquery with pyquery module)

With python setuptools installed, run "python setup.py install" to install it

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 50.4%
  • Roff 49.6%