Structured text document parser
Class instance calls [ex.=> x()] require a structured text structure. Doc will then parse each paragraph in the structure and will find the special structures within each paragraph. Each special structure will be stored as an instance. Special structures within another special structure are stored within the ‘top’ structure EX : ‘-underline this-‘ => would be turned into an underline instance. ‘-underline this‘ would be stored as an underline instance with a strong instance stored in its string
Search the paragraph for each special structure
SGML text is ignored and outputed as-is
Parse accepts a raw_string, an expr to test the raw_string, and the raw_string’s subparagraphs.
Parse will continue to search through raw_string until all instances of expr in raw_string are found.
If no instances of expr are found, raw_string is returned. Otherwise a list of substrings and instances is returned
Document with images
Structured text character classes
Core document model.
Represents a section of a document with a title and a body
StructuredTextColumn is a cell/column in a table. A cell can hold multiple paragraphs. The cell is either classified as a StructuredTextTableHeader or StructuredTextTableData.
Represents a section of a document with a title and a body
Represents a section of a document with a title and a body
Represents a section of a document with a title and a body
A StructuredTextDocument holds StructuredTextParagraphs as its subparagraphs.
Represents a section of document with literal text, as for examples
A simple embedded image
Represents a section of a document with a title and a body
Represents a section of a document with a title and a body
Represents a section of a document with a title and a body
rows is a list of lists containing tuples, which represent the columns/cells in each rows. EX rows = [[(‘row 1:column1’,1)],[(‘row2:column1’,1)]]
return a tuple where each item is a column/cell’s contents. The tuple, result, will be of this format. (“r1 col1”, “r1=col2”, “r2 col1”, “r2 col2”)
texts is going to a tuple where each item is the result of being mapped to the colortext function. Need to insert the results appropriately into the individual columns/cells
Runs through the structure and prints out the paragraphs. If the insertion works correctly, display’s results should mimic the orignal paragraphs.
Runs through the structure and prints out the paragraphs. If the insertion works correctly, display’s results should mimic the orignal paragraphs.
Remove all level information of levels with a greater level of indentation. Then return which level should insert this paragraph
Find the number of leading spaces. If none, return 0.
Find what will be the parant paragraph of a sentence and return that paragraph’s sub-paragraphs. The new paragraph will be appended to those sub-paragraphs
Accepts paragraphs, which is a list of lines to be parsed. structurize creates a structure which mimics the structure of the paragraphs. Structure => [paragraph,[sub-paragraphs]]
DOM implementation in StructuredText: read-only methods
Attr interface - The Attr interface represents an attriubte in an Element object. Attr objects inherit the Node Interface
Returns the name of this attribute.
The name of this node, depending on its type
A code representing the type of the node.
The value of this node, depending on its type
If this attribute was explicitly given a value in the original document, this is true; otherwise, it is false.
Element interface
Retrieves an attribute value by name.
Retrieves an Attr node by name or None if there is no such attribute.
Returns a NodeList of all the Elements with a given tag name in the order in which they would be encountered in a preorder traversal of the Document tree. Parameter: tagname The name of the tag to match (* = all tags). Return Value: A new NodeList object containing all the matched Elements.
The name of this node, depending on its type
A code representing the type of the node.
The parent of this node. All nodes except Document DocumentFragment and Attr may have a parent
The name of the element
NamedNodeMap interface - Is used to represent collections of nodes that can be accessed by name. NamedNodeMaps are not maintained in any particular order.
Python extensions: can use sequence-style ‘len’, ‘getitem’, and ‘for..in’ constructs, and mapping-style ‘getitem’.
The length of the NodeList
Retrieves a node specified by name. Parameters: name Name of a node to retrieve. Return Value A Node (of any type) with the specified name, or None if the specified name did not identify any node in the map.
Returns the index-th item in the map
Node Interface
Returns a NamedNodeMap containing the attributes of this node (if it is an element) or None otherwise.
Get a Python sequence of children
The node immediately preceding this node. If there is no such node, this returns None.
The name of this node, depending on its type
The value of this node, depending on its type
The Document object associated with this node, if any.
The parent of this node. All nodes except Document DocumentFragment and Attr may have a parent
The node immediately preceding this node. If there is no such node, this returns None.
Returns true if the node has any children, false if it doesn’t.
NodeList interface - Provides the abstraction of an ordered collection of nodes.
Python extensions: can use sequence-style ‘len’, ‘getitem’, and ‘for..in’ constructs.
The length of the NodeList
Returns the index-th item in the collection
This is an acquisition-like wrapper that provides parent access for DOM sans circular references!
The node immediately preceding this node. If there is no such node, this returns None.
The Document object associated with this node, if any.
The parent of this node. All nodes except Document DocumentFragment and Attr may have a parent
The node immediately preceding this node. If there is no such node, this returns None.
A node that can have children, or, more precisely, that implements the child access methods of the DOM.
Returns a NodeList that contains all children of this node. If there are no children, this is a empty NodeList
The first child of this node. If there is no such node this returns None
The last child of this node. If there is no such node this returns None.
HTML renderer for STX documents.
Render STX document as docbook.
Structured text document renderer for Docbook.
Zope structured text markeup
Consider the following example:
>>> from zope.structuredtext.stng import structurize
>>> from zope.structuredtext.document import DocumentWithImages
>>> from zope.structuredtext.html import HTMLWithImages
>>> from zope.structuredtext.docbook import DocBook
We first need to structurize the string and make a full-blown
document out of it:
>>> struct = structurize(structured_string)
>>> doc = DocumentWithImages()(struct)
Now feed it to some output generator, in this case HTML or DocBook:
>>> output = HTMLWithImages()(doc, level=1)
>>> output = DocBook()(doc, level=1)