from a Yahoo! and XML geek

Quick Links: Consulting | Book info | Brain Attic | Home

Push Button Paradise

Micah Dubinko

Wed, 17 Aug 2005

TextPath opened

I have released my work on TextPath as part of the pyxmlwiki project, under the same Apache 2.0 license.

(As you will recall, my initial strategy was to use the code as 'advertising' for my consulting business. Since that business is closing down, the new strategy is to open it up for whoever can find it useful. Enjoy!)

Like the rest of the code there, it is available under CVS. I'm also just now playing around with the SourceForge File Release System, so there's a downloadable package. If you spot any problems, let me know.

Code notes: features a hand-crafted tokenizer with strategic lookaheads to really figure out what the current token is without a lot of futz. Includes unit tests inline (that is, separate from the rest of the PyXMLWiki tests).

-m

posted at: 22:02 | under: 2005-08 | 4 comment(s)



I like the idea Micah. Makes lots of sense.
No code up on CVS to show how I might use it?

'd appreciate it if you
could provide some outline code?
Posted by Dave Pawson at Fri Aug 19 05:31:17 2005

Once you get the hang of it, the unit tests make pretty good samples. -m
Posted by Micah Dubinko at Fri Aug 19 06:39:14 2005

Block level markup is a breeze Micah :-)


tp = TextPath.TextPathEngine()

parseDictionary ={
'TOPIC:':'topic',
'DATE:':'date',
'KEYWORDS:':'kwds'
}



def parsefile(filename):
  """Parse a file for parseDictionary terms, process using prLine"""
  tp = TextPath.TextPathEngine()
  for line in open(filename).readlines():
  for patt,tag in parseDictionary.iteritems():
  prLine(line,patt,tag)

def prLine(line,patt,tag):
  """Process the line, looking for patt.
  Mark up with tag on matches"""
  res = getLineContent(patt, line)
  if (res):
  print mu(res,tag)



def getLineContent(pattern, string):
  """Use the textPath to seek matches on pattern in string"""
  patt="(%s)...eol()" % pattern
  res=tp.selectStrings(patt,string)
  if len(res) > 0:
  return res
  else:
  return 0

def mu(line,tag):
  """Mark up the line with the given tags, stripping whitespace"""
  retval=""
  for i in line:
  retval = retval + "<%s>%s</%s>"% (tag,i.strip(),tag)
  return retval
Posted by Dave Pawson at Tue Aug 23 05:35:37 2005


Syndicate: RSS feed

What am I reading?
Don Quixote
Self-Editing for Fiction Writers
The Complete Joy of Homebrewing
Analog magazine
Compilers
TAOCP


What am I browsing?
BlogFour
Blake Ross
Brianstorms
Caveat Lector
Claus Wahlers
Copia
Cringely
David Temkin
Dave Hyatt
Groklaw
Mark Birbeck
M.David
Miguel de Icaza
Mitch Kapor
Norm Walsh
Omar Tazi
Sean McGrath
Sjoerd Visscher
Ted Leung
Tom Bradford
Wil Wheaton


Archives:
Link

Powered by PyBlosxom