[biblio] cdc web page bibliography info and Zotero output / sort of becoming a newsletter or report on cite bib performance :)

Mike Marchywka marchywka at hotmail.com
Fri Aug 20 23:08:45 CEST 2021


I've had mixed luck with the US gov sites and was just surprised at the CDC
page below for having a bunch of stuff.  

For this link, 
https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w

Zotero webform at zbib.org returned  this,

@article{olsen_changes_2021,
	title = {Changes in influenza and other respiratory virus activity during the covid-19 pandemic — united states, 2020–2021},
	volume = {70},
	issn = {0149-21951545-861X},
	url = {https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm},
	doi = {10.15585/mmwr.mm7029a1},
	abstract = {This report describes the circulation of influenza and other ...},
	language = {en-us},
	urldate = {2021-08-20},
	journal = {MMWR. Morbidity and Mortality Weekly Report},
	author = {Olsen, Sonja J.},
	year = {2021},
}

Which is perfectly usable but it only got one author although it did pick up the issn which I missed with TooBib
in default mode. However, once again, it looks like the publisher ends up with different info on different 
bibtex paths. If you really want a good bib, it is probably best to look at all of these and integrate. I don't do that,
just take the first one that works, but you can see there is a lot of info here. 
I found 7 entries ( some redundant sources - handlegsmeta-xxx all scrape the dirty html in different ways )  
and you can see how the publisher supplies different info in the different sources
 ( some of these were modified by my code, they are not verbatim from the source )
Also, it may be possible for researchers and authors to complain to sites and get
better citation uniformity :) My "thing" now is retractions and getting them noted
in automated bibtex. 


 % mjmhandler: toobib handledoi
% date 2021-08-20:16:55:14 Fri Aug 20 16:55:14 EDT 2021
% srcurl: https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w
% citeurl: http://api.crossref.org/works/10.15585/mmwr.mm7029a1/transform/application/x-bibtex
@article{2021_Sonja_Olsen_Amber_Winn_Alicia_Budd,
author = {Sonja J. Olsen and Amber K. Winn and Alicia P. Budd and Mila M. Prill and John Steel and Claire M. Midgley and Krista Kniss and Erin Burns and Thomas Rowe and Angela Foust and Gabriela Jasso and Angiezel Merced-Morales and C. Todd Davis and Yunho Jang and Joyce Jones and Peter Daly and Larisa Gubareva and John Barnes and Rebecca Kondor and Wendy Sessions and Catherine Smith and David E. Wentworth and Shikha Garg and Fiona P. Havers and Alicia M. Fry and Aron J. Hall and Lynnette Brammer and Benjamin J. Silk},
doi = {10.15585/mmwr.mm7029a1},
month = {jul},
number = {29},
pages = {1013--1019},
publisher = {Centers for Disease Control {MMWR} Office},
title = {Changes in Influenza and Other Respiratory Virus Activity During the {COVID}-19 Pandemic {\textemdash} United States, 2020{\textendash}2021},
url = {https://doi.org/10.15585%2Fmmwr.mm7029a1},
volume = {70},
year = {2021},
srcurl={https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w},
xsrcurl={https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w},
citeurl={http://api.crossref.org/works/10.15585/mmwr.mm7029a1/transform/application/x-bibtex}

}

% mjmhandler: toobib handleadhochtml<-citation
% date 2021-08-20:16:55:16 Fri Aug 20 16:55:16 EDT 2021
% srcurl: https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w
% citeurl: https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w
@article{ChangesInfluenzaSonjaOlsen,
X_TooBib = {date: FixBeKvp s=2021 cmd=date -f -  "+%Y-%m-%d" d=2021-08-20 dn=date},
X_TooBib = {publisher: ReWriteParse be.get(s)= be.get(dest)=},
author = {Sonja J. Olsen},
categories = {Full Report},
date = {2021-08-20},
day = {20},
doi = {10.15585/mmwr.mm7029a1},
issn = {0149-21951545-861X},
journal = {MMWR. Morbidity and Mortality Weekly Report},
journal_abbrev = {MMWR Morb Mortal Wkly Rep},
journal_title = {MMWR. Morbidity and Mortality Weekly Report},
month = {08},
pagetitle = {Changes in Influenza and Other Respiratory Virus Activity During the COVID-19 Pandemic � United States, 2020�2021  | MMWR},
publication_date = {2021},
title = {Changes in Influenza and Other Respiratory Virus Activity During the COVID-19 Pandemic � United States, 2020�2021},
volume = {70},
year = {2021},
url={https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w},
srcurl={https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w},
xsrcurl={https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w},
citeurl={https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w}

}

% mjmhandler: toobib handleadhochtml<-DC
% date 2021-08-20:16:55:16 Fri Aug 20 16:55:16 EDT 2021
% srcurl: https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w
% citeurl: https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w
@article{ChangesInfluenza2021,
X_TooBib = {author: ReWriteParse be.get("author")=},
X_TooBib = {publisher: ReWriteParse be.get(s)= be.get(dest)=},
date = {2021-07-22T05:08:16Z},
day = {22},
month = {07},
pagetitle = {Changes in Influenza and Other Respiratory Virus Activity During the COVID-19 Pandemic � United States, 2020��2021  | MMWR},
title = {Changes in Influenza and Other Respiratory Virus Activity During the COVID-19 Pandemic � United States, 2020��2021  | MMWR},
year = {2021},
url={https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w},
srcurl={https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w},
xsrcurl={https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w},
citeurl={https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w}

}

% mjmhandler: toobib handleadhochtml<-og
% date 2021-08-20:16:55:16 Fri Aug 20 16:55:16 EDT 2021
% srcurl: https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w
% citeurl: https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w
@article{ChangesInfluenzacdc,
X_TooBib = {publisher: ReWriteParse be.get(s)=Centers for Disease Control and Prevention be.get(dest)=},
abstract = {This report describes the circulation of influenza and other ...},
author = {CDC},
day = {},
description = {This report describes the circulation of influenza and other ...},
image = {https://www.cdc.gov/mmwr/volumes/70/wr/social-media/mm7029a1_RespiratoryViralActivity_IMAGE_23July2021_1200x627.jpg},
image:type = {image/jpeg},
month = {},
pagetitle = {Changes in Influenza and Other Respiratory Virus Activity During the COVID-19 Pandemic �� United States, 2020�2021  | MMWR},
publisher = {Centers for Disease Control and Prevention},
title = {Changes in Influenza and Other Respiratory Virus ...},
type = {article},
url = {https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm},
srcurl={https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w},
xsrcurl={https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w},
citeurl={https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w}

}

% mjmhandler: toobib handleadhochtml<-cdc
% date 2021-08-20:16:55:16 Fri Aug 20 16:55:16 EDT 2021
% srcurl: https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w
% citeurl: https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w
@article{ChangesInfluenzaDEPUTYDIRECTORPUBLIC,
X_TooBib = {publisher: ReWriteParse be.get(s)= be.get(dest)=},
author = {DEPUTY DIRECTOR FOR PUBLIC HEALTH SCIENCE AND SURVEILLANCE},
content_id = {81515},
content_source = {DEPUTY DIRECTOR FOR PUBLIC HEALTH SCIENCE AND SURVEILLANCE},
date = {July 22, 2021, 01:00 PM},
day = {},
last_published = {2021-07-22T17:08:47Z},
last_reviewed = {July 22, 2021},
last_updated = {July 22, 2021, 01:00 PM},
maintained_by = {DEPUTY DIRECTOR FOR PUBLIC HEALTH SCIENCE AND SURVEILLANCE},
month = {},
pagetitle = {Changes in Influenza and Other Respiratory Virus Activity During the COVID-19 Pandemic  United States, 2020�2021  | MMWR},
template_version = {4.0},
title = {Changes in Influenza and Other Respiratory Virus Activity During the COVID-19 Pandemic  United States, 2020�2021  | MMWR},
wcms_build = {4.9.15 - b.1044},
year = {},
url={https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w},
srcurl={https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w},
xsrcurl={https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w},
citeurl={https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w}

}

% mjmhandler: toobib handlegsmeta(html)
% date 2021-08-20:16:55:16 Fri Aug 20 16:55:16 EDT 2021
% srcurl: https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w
% citeurl: https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w
@article{ChangesInfluenzaSonjaOlsen,
X_TooBib = {author: ReWriteParse be.get("author")=},
X_TooBib = {publisher: ReWriteParse be.get(s)= be.get(dest)=},
author = {Sonja J. Olsen},
categories = {Full Report},
date = {2021},
doi = {10.15585/mmwr.mm7029a1},
firstpage = {},
issn = {0149-21951545-861X},
journal = {MMWR. Morbidity and Mortality Weekly Report},
journal_abbrev = {MMWR Morb Mortal Wkly Rep},
lastpage = {},
title = {Changes in Influenza and Other Respiratory Virus Activity During the COVID-19 Pandemic � United States, 2020��2021},
volume = {70},
url={https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w},
srcurl={https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w},
xsrcurl={https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w},
citeurl={https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w}

}

% mjmhandler: toobib handlegsmeta(scraper)
% date 2021-08-20:16:55:16 Fri Aug 20 16:55:16 EDT 2021
% srcurl: https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w
% citeurl: https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w
@article{ChangesInfluenzaSonjaOlsen,
X_TooBib = {author: ReWriteParse be.get("author")=},
X_TooBib = {publisher: ReWriteParse be.get(s)= be.get(dest)=},
author = {Sonja J. Olsen},
categories = {Full Report},
date = {2021},
doi = {10.15585/mmwr.mm7029a1},
firstpage = {},
issn = {0149-21951545-861X},
journal = {MMWR. Morbidity and Mortality Weekly Report},
journal_abbrev = {MMWR Morb Mortal Wkly Rep},
lastpage = {},
title = {Changes in Influenza and Other Respiratory Virus Activity During the COVID-19 Pandemic �� United States, 2020��2021},
volume = {70},
url={https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w},
srcurl={https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w},
xsrcurl={https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w},
citeurl={https://www.cdc.gov/mmwr/volumes/70/wr/mm7029a1.htm?s_cid=mm7029a1_w}

}


./mjm_med2bib_guesses.h1013  saving to  df=xxxx
./mjm_med2bib_guesses.h1027  have citation   nfound=7 cite=\cite{2021_Sonja_Olsen_Amber_Winn_Alicia_Budd} something=1 paste_citation=0
mjm>






note new address
 Mike Marchywka 306 Charles Cox Drive Canton, GA 30115
470-758-0799
404-788-1216




More information about the biblio mailing list.