Padre binaries: padre-sr (Padre show results)

Managed by | Updated .

What is it for?

padre-sr sequentially reads the index.results file, without reference to the doc table.

Usage

./padre-sr stem|results_file [-titleonly] [-unco] [-ifff|-embedded|-text|-html|-textsigs] [starting_doc|starting_url]

Output is of all or part of the file to stdout in a choice of formats:

  • html (default)
  • embedded (incomplete html suitable for embedding in another html document)
  • text
  • textsigs (generate stem.textsigs file suitable for neardup detection.)

If -titleonly is given only the document titles are output. (not applic. to textsigs)

Use -unco to specify that the input doc. is in old uncompressed format.

If a starting document number or URL is given, output commences only when that point in the file is reached. Output continues to the end of the file unless num_docs_to_show is given.

Examples

$ ./padre-sr /opt/funnelback/...../index
<html>
<head><meta http-equiv="content-type" content="text/html; charset=UTF-8">
</head>
<h3>Document 1:
<a href='test-data.funnelback'>http://test-data.funnelback.com/Shakespeare/lear/lear.1.2.html'>test-data.funnelback.
com/Shakespeare/lear/lear.1.2.html</a></h3>
<table border=1>
<tr><td>56</td><td>101</td><td>102</td><td>4358</td><td>At offset 0</td></tr></table
<b>Title: </b>SCENE II. The Earl of Gloucester's castle. <br>
<b>Description: </b></br>
<b>Body: </b><i>Enter EDMUND, with a letter\ 01 EDMUND.<br>Thou, nature, art my goddess; to thy
law.<br>My services are bound\ 01 Wherefore
.....
$ ./padre-sr /opt/funnelback/...../index -text
DOCUMENT 1: http://test-data.funnelback.com/Shakespeare/lear/lear.1.2.html
TITLE: SCENE II. The Earl of Gloucester's castle.
DESC:
Enter EDMUND, with a letter\ 01 EDMUND.
Thou, nature, art my goddess; to thy law.
My services are bound\ 01 Wherefore should I.
Stand in the plague of custom, and permit.
The curiosity of nations to deprive me,.
For that I am some twelve or fourteen moon-shines.
....
$ ./padre-sr /opt/funnelback/...../index -titleonly
<html>
<head><meta http-equiv="content-type" content="text/html; charset=UTF-8">
</head>
<h3>Document 1:
<a href='test-data.funnelback'>http://test-data.funnelback.com/Shakespeare/lear/lear.1.2.html'>test-data.funnelback.
com/Shakespeare/lear/lear.1.2.html</a></h3>
<table border=1>
<tr><td>56</td><td>101</td><td>102</td><td>4358</td><td>At offset 0</td></tr></table
<b>Title: </b>SCENE II. The Earl of Gloucester's castle. <br>
<h3>Document 2:
<a href='test-data.funnelback'>http://test-data.funnelback.com/Shakespeare/lear/lear.2.2.html'>test-data.funnelback.
com/Shakespeare/lear/lear.2.2.html</a></h3>
...
$ ./padre-sr /opt/funnelback/...../index -titleonly 1 1
<html>
<head><meta http-equiv="content-type" content="text/html; charset=UTF-8">
</head>
<h3>Document 2:
<a href='test-data.funnelback'>http://test-data.funnelback.com/Shakespeare/lear/lear.2.2.html'>test-data.funnelback.
com/Shakespeare/lear/lear.2.2.html</a></h3>
<table border=1>
<tr><td>56</td><td>96</td><td>97</td><td>3971</td><td>At offset 0</td></tr></table
<b>Title: </b>SCENE II. Before Gloucester's castle. <br>
<hr>
<b>Documents Actually Read: 1</b>
</html>
$
Was this artcle helpful?