Created Summary Author Referred Summary Tags Select 2020-06-26 09:59:15.233000 Without the r the string is co...
Without the r the string is cooked and the backslashes are eaten, the string is consistent but the regex fails. I use Ultraedit generally which includes syntax colouring and it parses the r. But expl
syntax, 2020-06-26 09:59:14.876000 The lexical whirlpool of keeping...
The lexical whirlpool of keeping syntax colouring sane. A regex for extracting the DOI body is \b(10[.][0-9]{4,}(?:[.][0-9]+)*/(?:(?![\"&\'])\S)+)\b Including escaped quotes. For python regex it'
syntax, 2020-06-26 09:59:15.531000 So the escaped ' turns off the co...
So the escaped ' turns off the colouring for strings and the rest of the file is coloured as a string. But the file is fine but unuseable. Python lets parses r as uncased so R is perfectly fine. Ultr
syntax, 2020-07-01 12:01:24 Emailing: Phoenicia_Identity_and_Geopolitics_in_th.pdf
This is a tricky article and has been reprocessed to see if it's now straightened out/ Phoenicia_Identity_and_Geopolitics_in_th.pdf
2020-06-24 17:07:40 First test of Apache Tika, fourth attempt
Using the marmite server to extract metadata and text from a pdf as the initial start of getting the abstract if possible. First time the metadata was returned but not the text. Now using PUT fo