<div dir="ltr"><div dir="ltr"><br></div><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Tue, Oct 19, 2021 at 4:55 PM Mike Marchywka <<a href="mailto:marchywka@hotmail.com">marchywka@hotmail.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">Here is a link that turns up on google scholar, <br>
<br>
<a href="https://www.ijvsbt.org/index.php/journal/article/download/1386/1058" rel="noreferrer" target="_blank">https://www.ijvsbt.org/index.php/journal/article/download/1386/1058</a><br>
<br>
It has a doi in it but a scraper would be hard pressed to find it. Zotero did<br>
not find it and I did not find it.</blockquote><div><br></div><div>The link is to a PDF file that absolutely does NOT contain any DOI numbers.</div><div><br></div><div>The scrapper I use which is a half-an-hour job on BeatifulSoup finds a candidate</div><div>to DOI in the string:</div><div><br></div><div> 10.21887/ijvsbt.17.1.24</div><div><br></div><div>by validation marks it as non-valid DOI on 2021-10-20-03:03:28 UTC.</div><div><br></div><div>I was wondering what do you think is a DOI in this document. Our script thinks</div><div>there are none and a quick check confirms that.</div><div><br></div><div>We all know that if you start with a false-premise in math you can prove anything</div><div>you want, so it is essential to start with something that is valid.</div><div> <br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"> It turns out however, that the url<br>
can be modified to find the bibtex but this is harder with a local file<br>
and no URL info. I got my code to work as another special<br>
case. However, it would be nice if there was some simplicity and<br>
uniformity to the process especially for works with no DOI.<br></blockquote><div><br></div><div>Our scrapper knows the rules for finding DOIs in some 500 math journals. We used to </div><div>have that many rules and they were numbered rule-1, rule-2, .... we have now merged </div><div>them into about 50, and they are named: rule-springer, rule-elsevier, rule-ams, ... and</div><div>it is becoming a bit more manageable. </div><div><br></div><div>It is very hard to even come up with a rule -- for a journal -- since there are journals</div><div>with certain rules for years under JStor and another set of rules for years-published</div><div>under somebody else.</div><div><br></div><div>The only solution here will be to associate a unique identifier (ISSN + Year) to a set </div><div>of well-defined rules.... but we need first to define the language that describes these rules.</div><div><br></div><div><br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
What objections would there be to just including machine readable <br>
citation info in a PDF file? Absent that, a domain specific document<br>
number and look up facility? lol. <br></blockquote><div><br></div><div>Try! I'll give you the database of the managers of some 2000 math journals and you </div><div>can try asking them ...</div><div><br></div><div>Paulo Ney</div><div><br></div><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<br>
<br>
% mjmhandler: toobib guessijvsbt<-guessijvsbt<-handleadhochtml<-citation<br>
% date 2021-10-19:19:39:12 Tue Oct 19 19:39:12 EDT 2021<br>
% srcurl: <a href="https://www.ijvsbt.org/index.php/journal/article/view/1386" rel="noreferrer" target="_blank">https://www.ijvsbt.org/index.php/journal/article/view/1386</a> <a href="https://www.ijvsbt.org/index.php/journal/article/download/1386/1058" rel="noreferrer" target="_blank">https://www.ijvsbt.org/index.php/journal/article/download/1386/1058</a><br>
% citeurl: <a href="https://www.ijvsbt.org/index.php/journal/article/view/1386" rel="noreferrer" target="_blank">https://www.ijvsbt.org/index.php/journal/article/view/1386</a><br>
@article{ClinicalManagementHypothyroidismGunajitPubaleem2021,<br>
X_TooBib = {publisher: ReWriteParse be.get(s)= be.get(dest)=},<br>
abstract_html_url = {<a href="https://www.ijvsbt.org/index.php/journal/article/view/1386" rel="noreferrer" target="_blank">https://www.ijvsbt.org/index.php/journal/article/view/1386</a>},<br>
author = {Gunajit Das and Pubaleem Deka and Kongkon Jyoti Dutta},<br>
author_institution = {Department of Veterinary Medicine, Lakhimpur College of Veterinary Science, Assam Agricultural University, Joyhing, Assam, India and Department of Veterinary Epidemiology and Preventive Medicine, College of Veterinary Science, Assam Agricultural University, Khanapara, Assam, India and Department of Veterinary Pathology, Lakhimpur College of Veterinary Science, Assam Agricultural University, Joyhing, Assam, India},<br>
date = {2021/01/25},<br>
day = {25},<br>
doi = {10.21887/ijvsbt.17.1.24},<br>
firstpage = {91},<br>
issn = {2395-1176},<br>
issue = {01},<br>
journal = {THE INDIAN JOURNAL OF VETERINARY SCIENCES AND BIOTECHNOLOGY},<br>
journal_abbrev = {IJ Vet Sci \& Bio},<br>
journal_title = {THE INDIAN JOURNAL OF VETERINARY SCIENCES AND BIOTECHNOLOGY},<br>
keywords = {.},<br>
language = {en},<br>
lastpage = {92},<br>
month = {01},<br>
pagetitle = {Clinical Management of Hypothyroidism Associated Dermatological Signs in a Labrador: A Case Report | THE INDIAN JOURNAL OF VETERINARY SCIENCES AND BIOTECHNOLOGY},<br>
pdf_url = {<a href="https://www.ijvsbt.org/index.php/journal/article/download/1386/1058" rel="noreferrer" target="_blank">https://www.ijvsbt.org/index.php/journal/article/download/1386/1058</a>},<br>
title = {Clinical Management of Hypothyroidism Associated Dermatological Signs in a Labrador: A Case Report},<br>
volume = {17},<br>
year = {2021},<br>
url={<a href="https://www.ijvsbt.org/index.php/journal/article/download/1386/1058" rel="noreferrer" target="_blank">https://www.ijvsbt.org/index.php/journal/article/download/1386/1058</a>},<br>
srcurl={<a href="https://www.ijvsbt.org/index.php/journal/article/download/1386/1058" rel="noreferrer" target="_blank">https://www.ijvsbt.org/index.php/journal/article/download/1386/1058</a>},<br>
xsrcurl={<a href="https://www.ijvsbt.org/index.php/journal/article/view/1386" rel="noreferrer" target="_blank">https://www.ijvsbt.org/index.php/journal/article/view/1386</a>},<br>
citeurl={<a href="https://www.ijvsbt.org/index.php/journal/article/view/1386" rel="noreferrer" target="_blank">https://www.ijvsbt.org/index.php/journal/article/view/1386</a>}<br>
<br>
}<br>
<br>
<br>
<br>
<br>
<br>
Mike Marchywka <br>
306 Charles Cox Drive <br>
Canton, GA 30115<br>
470-758-0799<br>
404-788-1216 <br>
<br>
<br>
<br>
</blockquote></div></div>