Toobib another case Zotero may not work and email interface

Mike Marchywka marchywka at hotmail.com
Fri Feb 4 14:09:00 CET 2022



I'm trying to clean up Toobib to the point where it is safe to
make available by email. In some quick tests, it is nice
if you can wait minutes for the results and want a record
in email. I also realized that with the log file it is possible
to generate suggestions and commentary about other users as well as include
text ads :)

Right now, the handler reacts to "toobib toobib" in the subject
line and finds bibtex for each line in the message
body of the form,
toobib [clip,all] url
where "clip" returns the first reasonable bibtex
and "all" returns an entry from each candidate source
( and these can differ a lot ). Eventually an option
for high quality automated merge from the differne sources may exist.

But the reason I'm posting is to document another link that
fails on the Zotero webform,


https://journals.lww.com/soilsci/Abstract/1974/02000/Effect_of_Pore_Size_on_Diffusion_Coefficients_in.2.aspx


Looking at my old bash script, the LWW handler used to look for a doi 
but this entry from 1974 does not have one. The html however
contains a bunch of entries that start "wkhealth_" using a motiff
similar to "citation_" entries. Once found it was easy to add
and maybe a prefix finder could be devised ( this is somewhere between
a bunch of hacks and AI lol ). 
I'm not sure how Zotero is organized but there are probably two
issues: robot exclusion and quirky biblio info. Headless chrome
in emulation mode probably gets around most detectors
that don't want to slow down humans. The quirks generally
fall into patterns that are easy to add once familiar with this
mess. 

Further, their "cite" button is limited to a specific rendering,
RIS , and EndNote or similar. This kind of junk is ideal for automation
or gasp standards gasp lol. 


Adding this, Toobib came up with,

% mjmhandler: toobib handleadhochtml<-wkhealth
% date 2022-02-04:06:45:42 Fri Feb 4 06:45:42 EST 2022
% srcurl: https://journals.lww.com/soilsci/Abstract/1974/02000/Effect_of_Pore_Size_on_Diffusion_Coefficients_in.2.aspx
% citeurl: https://journals.lww.com/soilsci/Abstract/1974/02000/Effect_of_Pore_Size_on_Diffusion_Coefficients_in.2.aspx
@article{EFFECTPORESIZESAXENA1974,
X_TooBib = {publisher: ReWriteParse be.get(s)= be.get(dest)=},
abstract_html_url = {https://journals.lww.com/soilsci/Abstract/1974/02000/EFFECT_OF_PORE_SIZE_ON_DIFFUSION_COEFFICIENTS_IN.2.aspx},
article_publication_date = {February 01, 1974},
author = {SAXENA, S. K. and BOERSMA, L. and LINDSTROM, F. T. and YOUNG, J. L.},
doi = {},
ejpid = {soilsci:1974:02000:00002},
firstpage = {80},
fulltext_html_url = {https://journals.lww.com/soilsci/Fulltext/1974/02000/EFFECT_OF_PORE_SIZE_ON_DIFFUSION_COEFFICIENTS_IN.2.aspx},
issn = {0038-075X},
issue = {2},
journal = {Soil Science},
journal_title = {Soil Science},
journal_title_legacy = {Soil Science},
month = {2},
ovid_accession_number = {00010694-197402000-00002},
pagetitle = {EFFECT OF PORE SIZE ON DIFFUSION COEFFICIENTS IN POROUS MEDI... : Soil Science},
pdf_url = {https://journals.lww.com/soilsci/_layouts/15/oaks.journals/downloadpdf.aspx?an=00010694-197402000-00002},
title = {EFFECT OF PORE SIZE ON DIFFUSION COEFFICIENTS IN POROUS MEDIA},
toc_section = {ARTICLE},
toc_sub_section = {PDF Only},
volume = {117},
x_toobib_mutatemissing = {ds=journal (*ii).second=Soil Science ss=journal_title},
year = {1974},
url={https://journals.lww.com/soilsci/Abstract/1974/02000/Effect_of_Pore_Size_on_Diffusion_Coefficients_in.2.aspx},
srcurl={https://journals.lww.com/soilsci/Abstract/1974/02000/Effect_of_Pore_Size_on_Diffusion_Coefficients_in.2.aspx},
xsrcurl={https://journals.lww.com/soilsci/Abstract/1974/02000/Effect_of_Pore_Size_on_Diffusion_Coefficients_in.2.aspx},
citeurl={https://journals.lww.com/soilsci/Abstract/1974/02000/Effect_of_Pore_Size_on_Diffusion_Coefficients_in.2.aspx}

}

now back to actual research and writing ...


-- 

mike marchywka
306 charles cox
canton GA 30115
USA, Earth 
marchywka at hotmail.com
404-788-1216
ORCID: 0000-0001-9237-455X


More information about the texhax mailing list.