The ACIC Project

Independent Study Opportunity for INFO and MSIM Students—2 to 3 Credits—Spring of 2022

This is an independent study opportunity for both INFO and MSIM students with a strong technical and programming background (object-oriented programming, MySql experience, maybe some php programming—familiarity with AWS and git will also be beneficial).

Context

As part of the ongoing twin project of bibliometric research and curation of the Digital Government Reference Library (DGRL) and the Disaster Information Reference Library (DIRL), which are  geared at systematically collecting peer-reviewed academic publications in the two separate subject domains of (a) Digital Government and (b) Disaster Information Management, we intend to expand the data collection and curation efforts from a manual and human-expert-only process to an ICT-supported screening, collecting, and sorting process, which creates an initial pool of preselected and narrowed-down candidate records, which can be vetted by human experts for ultimate eligibility, completeness, and inclusion in one of the two reference libraries.

Significance

The DGRL with thoroughly curated reference entries to some 17,000 English-language peer-reviewed academic publications in Digital Government represents the body of knowledge in this study domain and has become a major source for researchers when starting a research project, when reviewing a research manuscript, or when analyzing research trends in the study domain. Ever since its inception in 2005, the DGRL has been updated on a semiannual basis. Over the years, peer-reviewed research articles on topics surrounding Digital Government has increased in numbers from an annual going rate of 400+ to over 2,000. At this increased volume, the continuation  of manual curation becomes more and more unfeasible. Automated and ICT-supported curation appears as the timely and appropriate avenue to secure this highly valued and significant resource to the Digital Government research community.
In parallel, the yet smaller DIRL, which was only launched in 2018, appears to follow the trajectory of the DGRL with a current going rate of 500+ new entries per annum. While the DIRL has not yet assumed the widespread recognition, which the DGRL is already enjoying, it seems to be only a matter of time, when a similar impact in the domain of disaster-related information management will be seen. In both cases, the move from manual curation to automated and ICT-supported curation has become a necessity.

Automated Collection and ICT-supported Curation (ACIC)—The Project

The envisioned automated collection and ICT-supported curation (ACIC) project would be informed by and most likely follow the current manual approach. ACIC shall be programmed and documented as open source code.http://faculty.washington.edu/jscholl/

The manual curation process unfolds like this: 

“First, we pull keywords from a set list. The DGRL includes 51 keywords, while the smaller DIRL includes only 10. These keywords can be phrases, such as “digital democracy,” or include Boolean operators, such as “disaster OR crisis OR emergency AND information management.” In Google Scholar, we then use advanced search features and settings to filter results. First, under Settings, we limit the results to pages written in English. Second, we enter our keywords into the Advanced Search page and further limit the results to only return articles dated within a certain range. 

Unfortunately, Google Scholar only allows for limiting the date range by year, making it difficult to narrow the results to the true window of interest, usually 6 months since the last iteration of the database. After hitting search and being navigated to the results page, we sort the results using the “sort by date” feature, which brings the newest results to the top and works backward chronologically. With our limited (manual) capacity, we look through results going back 10 pages (some 100 entries); however, this often does not bring us sufficiently back in time to the last database iteration, meaning that we are possibly missing relevant resources published in the few months after the last iteration.

Finally, we check relevant sources for relevancy and quality.
Based on an understanding of the criteria for inclusion in the database, we open potentially relevant results and determine whether the source is credible. Because the results from Google Scholar are so varied, this understanding is necessary to know to exclude something like “Travelling While Black: Essays Inspired by a Life on the Move,” but to include “Digital Democracy, Social Media, and Disinformation,” results that appear next to each other in a keyword search of “digital democracy.” 
If the title seems relevant, we open the source, but if it is clearly an un-credited PDF, a dissertation, or not written in English, we do not include it. If it appears to be from a journal that we are not familiar with, we run the journal title through Ulrichsweb (available from the UW Library system: https://ulrichsweb-serialssolutions-com.offcampus.lib.washington.edu) to determine whether the journal is legitimate. If it is, we save the source to a Zotero library (https://www.zotero.org/), and if not, it is excluded.”
The open-source ACIC code to be developed for, stored in, and executed from the Cloud would crawl the scholar.google.com site in the same fashion human expert curators would do. The crawler finds potential new records for a given timeframe along a pre-specified list of keywords, and it deposits the full reference record in RIS format on a Cloud-based stack for further inspection. The crawler suspends operation once the hit rate for potentially eligible records has fallen to less than 1 in 10 inspected records for a given annual timeframe.

Once the initial crawl has produced the stack of candidate records, the collected records need to be inspected for completeness. Incomplete records need to be flagged. The list of keywords found in candidate records needs to be inspected. New keywords provided by the candidate records need to be added into a monitor stack. New keywords with high frequency counts need to be flagged and considered for inclusion in the respective keyword registry. Likewise, (old) keywords that produced very low or no numbers of candidate records need to be flagged in the keyword registry and considered for future exclusion in searches.

The ACIC algorithm performs further checks of eligibility along the pre-specified list of criteria and eliminates records, which would not qualify for inclusion in the respective reference library. The code also marks records, which it found eligible for inclusion and complete.
Since the candidate records cannot fully automatically be curated, the final list of candidates including the incomplete records need to be inspected by human subject matter experts.
In order to make that happen, ACIC must provide a web-based user interface, which lets the human expert inspect, add, edit, and delete records; it also needs to let the human expert inspect, add, edit, and delete keywords.

ACIC must also provide an administrative function, by which it can be set up and maintained. The administrative function needs to allow for the bulk export of records in RIS format, which were marked eligible for inclusion by the human expert curators.
Integral part of the project is the detailed documentation of functionality (annotated code) and an online editable user manual (for laymen users).

Academic Impact and Recognition

There will be the opportunity to be a contributing author on a paper from this work. Once ACIC has been successfully developed and tested we will craft and submit a written academic report on the approach and the test results in a technical journal. 
The independent study covering the programming and testing of the above extensions is worth 2 to 3 credits. It is ideally suited for students who want to work in a team of two.

Registering

For registering, please contact Student Services for the Independent Study Form (INFO499 or IMT600, respectively). In the description field, you can use the contents of this announcement.

Digital Government Reference Library (DGRL) Version 17.5 Released

Now Listing 16,531 references of Peer-reviewed Research Articles in the English Language

Version 17.5 of the Digital Government Reference Library (DGRL) has been published as of December 15, 2021. The library now contains 16,531 references of predominantly English-language, peer-reviewed work in the study domains of digital government, digital governance, and digital democracy.

This marks a 5.1% increase in references from version 16.5 (December of 2020) and a 12.3% increase from version 16.5 (December of 2020). This past publication period has yet been another good one for Digital Government- related publishing adding another 4-digit number (2,004) of new peer-reviewed academic references within the past 12 months.

The DGRL has become an indispensable tool for Digital Government scholars. In particular, reviewers of paper submissions are reported to rely heavily on this reference library. Packaged in a 32.1.7 MB zip file, bibTeX, RIS, and Endnote (package) versions are available. Mendeley or Zotero versions can easily be created by importing from RIS or bibTeX files. Please get back to us in case of any errors or omissions. Next scheduled update: 06/15/2022.

Thank you for your interest and cooperation.

Please also note: The DGRL is provided on basis of self- service. Do not request any support.

No curator can do her work alone. Under the curator and editorship of Hans Jochen Scholl, the DGRL has been maintained and expanded over the years with the help of teams led by Jan Boyd and Galen Guffy and graduate student team members Colin Anderson, Andrea Berg, Emily Cunningham, Erika Deal, Gary Gao, Kreg Hasegawa, Jackie Holmes, Julia Hon, Christine Lee, Andrew Mckenna-Foster, Jessie Novotny, Marie Peeples, Hannah Robinson, Richard Robohm, Kelle Rose, Stephanie Rossi, Christopher Setzer, and Daniel Wilson.

Citation: Scholl, H. J. (2021). The Digital Government Reference Library (DGRL). Versions 17.0—17.5. Retrieved from http://faculty.washington.edu/jscholl/dgrl/

The DGRL can be downloaded following this link: http://faculty.washington.edu/jscholl/dgrl/

PDF version of this announcement

Academic Research on Disaster Information Management is Growing Rapidly and Steadily

As per November 1, 2021, version 4.0 of the Disaster Information Reference Library (DIRL) has been released, which is about two weeks ahead of the original schedule (November 15). The library now contains 3,933 references of predominantly English-language, peer-reviewed work in the study domains of disaster information and information technologies and their uses in the context of disasters. This represents an increase over the previous version of 423 references, or 11.4%. The DIRL release history reveals that from the inaugural DIRL version 1.0 to this version (DIRL v. 4.0), the peer-reviewed academic literature has almost quadrupled in the course of little over four years. This is a remarkable increase of the body of academic knowledge in the particular area of disaster information management and disaster information technology within a relatively short period of time.

The DIRL is intended to become an indispensable tool for Disaster Information and Technology-interested scholars. In particular, reviewers of paper submissions may want to rely on this reference library.

Packaged in a zip file, bibTeXRIS as well as an Endnote package (enlp) versions are available. Mendeley or Zotero versions can easily be created by importing from RIS or bibTeX files. Please get back to us in case of any errors or omissions. Thank you for your interest and cooperation. The DIRL can be downloaded from the DIRL website.

Please also note: The DIRL is provided on basis of self-service. Do not request any support.

Recipient of IFIP Service Award

On September 22, by decision of the the General Assembly, the International Federation for Information Processing (IFIP) presented the IFIP Service Award to Prof. Hans Jochen Scholl. The honor was awarded in recognition of “outstanding contributions to IFIP and the Informatics Community.”

IFIP was established in 1960 under the auspices of UNESCO. The federation’s activities are coordinated by 13 Technical Committees (TCs) which are organized into more than 100 Working Groups (WGs), bringing together over 3,500 ICT professionals and researchers from around the world to conduct research, develop standards, and promote information sharing. Each TC covers a particular aspect of computing and related disciplines.

Wrote IFIP President Mike Hinchey, “This reward is in recognition of your considerable and sustained contributions to IFIP both technically and in volunteer and support capacities. We are grateful for what you have done for IFIP, and this is a token of our appreciation.”

Upon reception of the award, Scholl stated, “Over the past two decades it has been my honor and also my obligation to help advance information and information-systems-related knowledge in academia and practice. Meeting and working with high-caliber colleagues from around the world on a number of important subjects, projects, workshops, and major conferences has always been my pleasure. I feel humbled by the award, and I thank my colleagues in the General Assembly for their kind recognition of my work.”

Scholl is member of two IFIP working groups (WG 8.5, TC8—Information Systems in Public Administration and WG 5.15, TC5—Information Technology in Disaster Risk Reduction (ITDRR)).

2021 Granada Keynote Calls for Focus on Existential Threats to Humanity and Dangerous Successes in Digital Government Research

On Thursday, September 9, 2021,  Hans Jochen Scholl gave the conference keynote speech (47 min including Q&A, mp4 format) entitled “Digital Government Research — Then, Now, and in Years to Come,” at EGOV-CeDEM-ePart. The conference was held at the University of Granada, Spain, in a hybrid format with about on-site 40 attendees and some 60 attendees online. EGOV-CeDEM-ePart, organized by the IFIP Working Group 8.5 (Information Systems in Public Administration), is the top-rated Digital Government Conference in Europe, which attracts submissions from across the globe.

Within the Digital Transformation track of the conference, Jochen also presented a paper co-authored with Erich E. Holdeman under the title “Practitioners’ Perceptions of Fitness to Task of a Leading Disaster Response Management Tool,” which will appear in the conference proceedings published within the Springer Lecture Notes in Computer Science (LNCS) series.