Post by patteoks »

I have a list for example

1 Corinthians
2 Corinthians
1 Timothy
1 John
2 John
1 Kings

I want to turn it into


The Regex filter from (RegexBuddy) Perl Flavour I tried to use was

Find Pattern (Perl Style)
([123])\ ([SKCTPJ][aihoe]\w*\ ?\d{0,3}:?)|([12])\ (Thessalonians|THESSALONIANS ?\d{0,3})

Replace with:

It gave me an error although it worked in RegexBuddy and EditPad Pro

It only worked after I read this forum and discovered that I had to use

Replace with:

What I'm puzzled about is that I couldn't find an example or documentation (Extract below) that indicates I had to use $1$$2$$3$$4$

What am I missing ?

Does anyone in this forum also uses RegexBuddy with TextPipe and able to provide any insights as to why it (\1\2\3\4) worked in RegexBuddy but not with Textpipe ?

Thanks for your help.

Re: Back References

Post by dfhtextpipe »

I use TextPipe regularly in connection with work on Biblical texts. I have 9 years experience in this field.

You didn't indicate whether the English Bible book names you gave as examples are part of free text or part of a structured document.

If they are in a structured document, it would be much simpler to use a restrict filter to govern the replacements.

The actual replacement then becomes much simpler.

Code: Select all

Perl pattern [(\d) (\w+)] with [$1$$2]
   [X] Match case
   [X] Whole words only
   [ ] Case sensitive replace
   [ ] Prompt on replace
   [ ] Skip prompt if identical
   [ ] First only
   [ ] Extract matches   Maximum text buffer size 4096
   [X] Maximum match (greedy)
   [ ] Allow comments
   [ ] '.' matches newline
   [X] UTF-8 Support

   [ ] Process longest strings first
   [ ] Simultaneous search
So the more important question is what kind of structure does your input file have?

Best regards,

David Haslam
An active volunteer for the CrossWire Bible Society

PS. I don't use RegexBuddy.
My two favourite Unicode text editors are Notepad++ and BabelPad.
On rare occasions I have used EditPad Lite for file format conversions.

btw. When quoting from something such as the TextPipe Help file, it's sensible to use the Quote feature of phpBB.
Re: Back References

Post by DataMystic Support »

Back references are used inside the search pattern, not inside the replace pattern.

Different tools use different ways of encoding this, some use %, some $, some \, some @, some [ etc. There is no standard.

We use \ for regex escape sequences (\r\n\t etc), $ for captured variables ($1, $2 etc, or $1$ when it is hard-up against the next variable as in $1$$2$), @ for macros (e.g. @fullInputFilename) or named captured variables (@phonenumber), and % for environment variables (e.g. %PATH).

The help file is quite clear on this!
