Push Button Paradise
Micah Dubinko
Wed, 17 Aug 2005
TextPath opened
I have released my work on TextPath as part of the pyxmlwiki project, under the same Apache 2.0 license.
(As you will recall, my initial strategy was to use the code as 'advertising' for my consulting business. Since that business is closing down, the new strategy is to open it up for whoever can find it useful. Enjoy!)
Like the rest of the code there, it is available under CVS. I'm also just now playing around with the SourceForge File Release System, so there's a downloadable package. If you spot any problems, let me know.
Code notes: features a hand-crafted tokenizer with strategic lookaheads to really figure out what the current token is without a lot of futz. Includes unit tests inline (that is, separate from the rest of the PyXMLWiki tests).
-m
posted at: 22:02 | under: 2005-08 | 4 comment(s)
No code up on CVS to show how I might use it?
'd appreciate it if you
could provide some outline code?
Posted by Dave Pawson at Fri Aug 19 05:31:17 2005
Posted by Micah Dubinko at Fri Aug 19 06:39:14 2005
tp = TextPath.TextPathEngine()
parseDictionary ={
'TOPIC:':'topic',
'DATE:':'date',
'KEYWORDS:':'kwds'
}
def parsefile(filename):
"""Parse a file for parseDictionary terms, process using prLine"""
tp = TextPath.TextPathEngine()
for line in open(filename).readlines():
for patt,tag in parseDictionary.iteritems():
prLine(line,patt,tag)
def prLine(line,patt,tag):
"""Process the line, looking for patt.
Mark up with tag on matches"""
res = getLineContent(patt, line)
if (res):
print mu(res,tag)
def getLineContent(pattern, string):
"""Use the textPath to seek matches on pattern in string"""
patt="(%s)...eol()" % pattern
res=tp.selectStrings(patt,string)
if len(res) > 0:
return res
else:
return 0
def mu(line,tag):
"""Mark up the line with the given tags, stripping whitespace"""
retval=""
for i in line:
retval = retval + "<%s>%s</%s>"% (tag,i.strip(),tag)
return retval
Posted by Dave Pawson at Tue Aug 23 05:35:37 2005