|
|
xml2tsv.1 - xml2tsv - a simple xml-to-tsv converter, based on xmlparser |
|
|
 |
Log |
|
|
 |
Files |
|
|
 |
Refs |
|
|
 |
Tags |
|
|
 |
README |
|
|
 |
LICENSE |
|
|
|
--- |
|
|
|
xml2tsv.1 (1894B) |
|
|
|
--- |
|
|
|
1 .TH XML2TSV 1 "04/01/2020" "" "" |
|
|
|
2 .SH NAME |
|
|
|
3 xml2tsv \- simple xml-to-tsv converter |
|
|
|
4 .SH SYNOPSIS |
|
|
|
5 .PP |
|
|
|
6 xml2tsv |
|
|
|
7 .PP |
|
|
|
8 .SH DESCRIPTION |
|
|
|
9 .PP |
|
|
|
10 xml2tsv is a simple tool to convert XML given on standard input to a list of |
|
|
|
11 tab-separated-values records. Each XML entry is converted to a line |
|
|
|
12 like: |
|
|
|
13 .EX |
|
|
|
14 |
|
|
|
15 /full/path/to/current/entry[TAB]attr1=value1[TAB]attr2=value2[TAB]...[TAB]data[NEWLINE] |
|
|
|
16 |
|
|
|
17 .EE |
|
|
|
18 where |
|
|
|
19 .I "/full/path/to/current/entry" |
|
|
|
20 represents the full hierarchy of entries down to the current one. For |
|
|
|
21 instance, the XML snippet: |
|
|
|
22 .EX |
|
|
|
23 |
|
|
|
24 <html> |
|
|
|
25 <head> |
|
|
|
26 <title>This is a title</title> |
|
|
|
27 </head> |
|
|
|
28 <body> |
|
|
|
29 <h1>It works!</h1> |
|
|
|
30 <a href="https://my.wonderful.website.net">Click here</a> |
|
|
|
31 </body> |
|
|
|
32 </html> |
|
|
|
33 |
|
|
|
34 .EE |
|
|
|
35 will produce the output: |
|
|
|
36 .EX |
|
|
|
37 |
|
|
|
38 /html |
|
|
|
39 /html/head |
|
|
|
40 /html/head/title This is a title |
|
|
|
41 /html/body |
|
|
|
42 /html/body/h1 It works! |
|
|
|
43 /html/body/a href=https://my.wonderful.website.net Click here |
|
|
|
44 |
|
|
|
45 .EE |
|
|
|
46 By default, xml2tsv quotes '\\n', '\\t', and '\\', and strips other |
|
|
|
47 control characters. |
|
|
|
48 .SH CONFIGURATION |
|
|
|
49 The maximum length of an entry name and the maximum depth of an entry |
|
|
|
50 are fixed to STR_MAX and DEPTH_MAX, and can be changed by editing the |
|
|
|
51 file |
|
|
|
52 .BI config.h |
|
|
|
53 and rebuilduing xml2tsv. It is also possible to change the separator |
|
|
|
54 used on output (SEP, by default set to '\\t'), and the character used to |
|
|
|
55 separate the name of an attribute from its value (SATTR, by default set |
|
|
|
56 to '='). |
|
|
|
57 .SH BUGS |
|
|
|
58 |
|
|
|
59 xml2tsv currently lacks an option to force printing control characters |
|
|
|
60 on output, if desired. |
|
|
|
61 |
|
|
|
62 .SH AUTHORS |
|
|
|
63 xml2tsv is written and maintained by Vincenzo "KatolaZ" Nicosia |
|
|
|
64 <katolaz@freaknet.org>. The code is based on |
|
|
|
65 .BI xmlparser |
|
|
|
66 by Hiltjo Posthuma <hiltjo@codemadness.org>. You can use, distribute, |
|
|
|
67 modify, and redistribute xml2tsv under the terms of the ISC License. |
|