Extracting regex from multiple pdfs and lines that surrounds
Posted: Thu Feb 15, 2007 7:38 am
Hello.
I've got a certain problem. I need to find a specific data using the regular expression and save everything that surround it,
from multiple pdfs to one file:
For exalmple I've got:
and i need to find ex. Forty-Niners using regex and get 1 surrounding line each side (or perhaps 50 surrounding characters) to get
How to do this, should I use find/replace or sth else?
Which type of regex will fit here best?
thanks
I've got a certain problem. I need to find a specific data using the regular expression and save everything that surround it,
from multiple pdfs to one file:
For exalmple I've got:
Code: Select all
The California Gold Rush started in January 1848, when gold was
discovered at Sutter's Mill. As news of the discovery spread, some
300,000 people came to California from the rest of the United States and
abroad. These early gold-seekers, called "Forty-Niners," traveled to
California by sailing ship and in covered wagons across the continent,
often facing substantial hardship on the trip
Code: Select all
300,000 people came to California from the rest of the United States and
abroad. These early gold-seekers, called "Forty-Niners," traveled to
California by sailing ship and in covered wagons across the continent,
Which type of regex will fit here best?
thanks