Specialist expertise & interdisciplinary communication

2010 August 11
by Shane Landrum

The Society of American Archivists is meeting this week in Washington, DC. Last week they had a pre-conference about EAC-CPF, which they twittered under the hashtag #eacdc. I’m not formally trained as an archivist, but I’ve used a lot of different manuscript/papers collections and am really interested in digital tools for archival researchers, so I was following their reportage.

I hadn’t ever given a thought to the technical and ethical issues involved in authority databases until a few months ago, when I heard Simon Wiles speak at Yale about his Buddhist authority databases project. Among other interesting technical challenges, it handles records for texts in multiple Asian languages (including non-Unicode ones) and multiple time systems used in various Buddhist cultures.

With the approval of his supervisors at Dharma Drum College, Wiles publishes all his project’s databases under an open-source license, encouraging them to be copied in full to any server that will host them. This is a preservation and access strategy designed to work around issues of political and religious freedom in various Asian countries; Wiles also talked better than I can summarize about how this plan flows from the college’s Buddhist principles.

EAC-CPF (Encoded Archival Context for Corporate Bodies, Persons, and Families) is an XML format that makes it possible for archivists to mark off when items in their collections relate to particular named people or groups. It’s related to what archivists call authority control,” the practice of standardizing headings in a cataloging system so that users can use a single heading to find what they’re looking for. (Don’t look up “Grace Abbott,” look up “Abbott, Grace, 1878-1939.”) More important in an old-style card catalog, but not unimportant in a computer-based system either.) From what I could tell, #eacdc was talking about the relative merits of establishing a single authority file or setting up a distributed system. This is an arcane topic for general readers, but a specialist topic for library professionals.

Fortunately, when I tweeted a question about #eacdc, Maureen Callahan at The Patriarchive gave a good explanation on EAC and whether we should care. More digital-humanities people should be writing pieces like this: explanations of technical issues in their field for others who may need to know. Here’s why.

Although I’m trained as a historian, in the professional field of archives-management I’m a curious amateur. Authority control is interesting to me because I use a number of under-processed women’s history collections, and I’d like to experiment with crowdsourcing approaches for generating finding aids.

RG 102, the Papers of the United States Children’s Bureau, is an important collection for historians who study the early 20th century; women, gender, and family; the emergence of the federal welfare state; and the history of public health. I use it a lot in my dissertation.

I and the +/- 5 other historians who use any particular chunk of NARA RG 102 ought to be able to generate a pretty good collaborative finding aid if we have the right tools to make it easy enough. After all, we index our own digital photos of materials as part of our working notes; why should anyone else ever have to make a list of the items in that box and folder (or, for that matter, photograph them in person)? And if I’ve got subject, date, and topic notes on a folder of letters sent by Grace Abbott, why shouldn’t anyone who’s looking for material on Abbott be able to know what’s in that folder without having to travel to Maryland to do it? (Yes, there’s a microfilm edition, but it’s selected.)

This toolsmithing work and information compilation is an important project, and I think that EAC-CPF may be an important bit of the infrastructure we need for making usable, collaboratively-created finding aids. But without reading more work on emerging technical standards for digital archives, I can’t know for sure. To find other people (historians, programmers, archivists) who’d be interested in collaborating on this toolsmithing project, I have to know who can explain authority control (or linked data, or GIS correspondence mapping) to people outside their own professional specialty.

Explaining what you know for people outside your discipline— particularly specialist methods or theory— is a vital part of building meaningfully interdisciplinary communities. Given that many of the best digital-humanities projects are collaborative team efforts, these explanations might also help potential future collaborators understand why they want to work with you.

Happy July, especially if you were born in Puerto Rico.

2010 July 7

Due to Puerto Rico‘s history as a Spanish colony (1493-1898) and a US colony/possession (since 1898), it has some unusual legal traditions and practices. Until very recently, common practice on the island was that if you needed to present a birth certificate as a proof of identity, you ordered extra copies of that document, and the (school/city/etc) kept that copy on file for its records.

That practice led to lots of extra birth certificates floating around, which happen to have a high resale value. There’s been at least one criminal operation busted for stealing birth certificate copies and reselling them.

Puerto Ricans have been US citizens since 1917, when the Jones Act took effect, but the territorial government’s been making it very difficult to prove one’s citizenship lately.

Because some people were using Puerto Rican birth certificates to evade US immigration-restriction policies, Puerto Rico’s territorial government was taking a lot of heat from the federal government for not doing anything to prevent identity fraud. The legislature decided to implement new, “more secure” birth certificate documents, and so it declared that as of July 1, all Puerto Rican birth certificates ever issued would be invalid.

This decision affects an estimated 4-5 million US citizens, most of whom happen to have Spanish last names and/or brown skin. Furthermore, because US economic policies have hobbled Puerto Rico’s economy, many people born there now live on the US mainland, particularly in cities like New York, Atlanta, and Chicago. If they want to travel back to the island, they have to go by air, which means they need photo ID— which they can’t acquire or renew unless they present a birth certificate or passport. Moreover, they have to know about the invalidation of their existing birth certificates in order to know that they need a new one, and the news hasn’t been particularly widely disseminated.

read more…

Digital humanities summer listening

2010 June 22
by Shane Landrum

I’ve been working on some thorny research problems lately which are, mostly, about handling large volumes of archival data. When I get stuck, I try to go for a bike ride. Lately, following Ryan Cordell’s advice on podcasts as a way to broaden your intellectual horizons, I’ve been listening to scholarly talks on my rides. Here are 2 lectures I’ve particularly enjoyed:

Jeffrey McClurken on Confederate veteran families and database methods

Jeff McClurken’s recent talk at the Virginia Historical Society, “Take Care of the Living: Reconstructing Confederate Veteran Families in Virginia” is based on his newish book of the same name. He talks about the Civil War’s impact on Confederate veteran families after the war, including families whose their military relatives died in the conflict, were physically wounded, or were mentally traumatized.

McClurken’s conclusions are interesting, but what really got my attention was his use of a database to build his conclusions. For reasons he describes in the talk, he focuses on soldiers from a particular town in Virginia– Danville– and Pittsylvania County, which surrounds it. He put each veteran’s name into a database, then reconstructed how they and their families appear in the written record over time as they made claims on relief systems: local elites, churches, state military pensions, state asylums, and such.

On one hand, this is a method we don’t see a lot of anymore, reminiscent of the early 1970s New England community studies– but he has a database, rather than punched cards, which makes his analysis more flexible. He can do numerical analysis, talking (for example) about the differences in family finances caused by having a relative killed or having a relative wounded in Confederate service— but he can also track individuals and their families over time. I’m very interested in this, since it’s similar to some methods I’d like to use; I haven’t read the book yet, but I’m told that the appendix contains more technical details, and I’m looking forward to it.

Julie Meloni et al. on annotatable digital archives

Julie Meloni’s talk at the University of Virginia’s Scholars Lab, “N-Dimensional Archives,” is more conceptual and theoretical. She builds on the work of Johanna Drucker, Bethany Nowviskie, and others at UVa’s Speculative Computing Lab— particularly their concept of a ‘Patacritical Demon. Nowviskie’s introduction gives good context for why this is important. Meloni describes a system which allows multiple readers to annotate an electronic text and to see one another’s annotations.

Although Meloni’s trained in English and approaches much of this problem as a literary-studies scholar, I can imagine this system’s use for historians. (Note that she hasn’t actually built it, but she describes a little bit about the technologies which would enable such a system. The Open Annotation Collaboration is currently building some systems which might head towards this goal, and they’re worth keeping an eye on.) Jerome McGann also follows with some remarks. Although the Q&A session at the end didn’t come through the recording system well, I found the whole session good for sparking my thoughts.

If you’re interested in digital humanities and want to know more about the field, you could do far worse than to subscribe to the podcasts from the Scholars’ Lab and from the Maryland Institute for Technology in the Humanities. Every time I listen to one of them, I get more excited about the new interdisciplinary work being done at the intersection of computing and the humanities.

If you have other suggestions for podcasts on history or digital humanities that you’ve enjoyed, please share!

Digital methods, history dissertations, and scholarly careers

2010 June 2

Good morning, readers. It’s been a while since I wrote here, for one reason and another; those of you who’ve written dissertations may understand. This piece, inspired by some of the essays published in Hacking the Academy, tackles several different threads I see emerging out of early-career scholars’ work in and around “digital humanities,” particularly in history.

In the humanities at least, the dissertation is a process of learning what a fully-formed, well-supported, authoritative book-length work looks and feels like. Before you start it, you should have already practiced your scholarly skills (research methods, analysis, writing, revision), usually with the close advising of a faculty member. By the end of writing the dissertation, you’ve presumably demonstrated that you’ve mastered the set of scholarly skills necessary for your discipline— or at least for the topic you’ve chosen.

I see something different happening among my peers: early-career historians, those of us who finished dissertations in the last year or 2 or who will finish in the next year or two. We did our coursework and methods training, as Tanya Roth points out, with scholars who relied primarily on “scrawled ideas, transcriptions, and neatly-ordered index cards about sources.” (They also produced amazing work that way, or we wouldn’t have wanted to study with them!) And then we went to archives with laptops and digital cameras to shoot our archives— a relatively uncommon practice even a few years ago, but which promised to stretch our meager travel budgets— and we came back with laptops full of photos that we had to invent organizational systems for.

We had this great new tool (affordable digital cameras), immensely exciting possibilities for what we could accomplish, and very few mentors who could coach us on how to use it best.

I distinctly remember approaching an academic librarian sometime in 2007-8 asking about tools for organizing gigabytes of images. I’d just returned from the first research trip I’d done with my camera. I wasn’t asking for OCR or crowdsourcing; I just wanted to know if there was a more sophisticated, flexible option than expensive commercial software, possibly one that would help me with subject categories and citations for a large number of documents. 15-20 minutes of conversation boiled down to “You know more about this problem than we do; let us know what you figure out.”

Well, that was almost like help… but totally not.

If you’d asked me what I do (as a scholar) the day I had that frustrating conversation, I’d have told you something like this:

I study the history of women, gender, sexuality, law, and government in the United States, mostly since the mid-19th century. I also have a magpie-like fascination with histories of science, medicine, and technology. My dissertation’s about how Americans developed birth certificate systems for demographic and identification purposes, how ordinary people learned to use birth certificates, how these documents categorized American identities, and how American law changed once people started to assume that anyone born here can prove it with a birth certificate.

2-3 years later, my sense of what I do as a historian is pretty much the same. But as I’ve gotten deeper into the writing process, I’ve discovered that the analytical tools I want are fundamentally digitally-enabled, and that I have to return to my computer-nerd roots to get what I want as a humanities scholar. This means that my methods are looking, in some ways, less and less like those I was trained with (although in other ways, they’re remarkably similar.) And so I’d have to add a follow-up paragraph to that statement:

As part of my dissertation research, I’ve been designing and building digital tools for mapping and analyzing correspondence between ordinary Americans and government agencies during the early 20th century. I’ve spent more time writing code than most historians, and more time thinking about the politics of race, gender, and categorization than most computer nerds. I’m increasingly interested in research tools that help humanities scholars visualize uncertainty and gaps in knowledge while retaining the citational rigor of their disciplines.

That’s not the traditional intellectual profile of a historian, but it’s a lot of fun. It gives me a sense that I’m working on one of the biggest problems in my field, methodologically, as well as the big problem of my topic: how to write a social history of documents (birth certificates) that I can’t, in most cases, actually get access to because of confidentiality laws passed in the 1940s? (The answer to that second question is, in short, digital archival methods.)

And so when I read Josh Greenberg’s essay on neglecting his academic blog because he’s been busy writing code, I understand all too well, and worry a little about how to make sure that I keep engaged both with my historian colleagues and with my tech-focused communities. When I read Lee Ann Ghajar’s essay on starting her dissertation research with digital methods, I feel a glimmer of recognition. I’ve sat in that same research room at the Library of Virginia with my camera setup, answering the same kinds of curiosity-seeker questions, and I hope that her dissertation process results in new answers to the kinds of methods questions I’ve been asking since that day several years ago.

These days, when I’m not trying to get that next chapter written, I spend time thinking about whether the traditional tenure-track faculty path in history makes sense for me; experimental software-hacking isn’t something most history departments know how to recognize as scholarly work. I tell potential employers that I’m looking for postdoctoral opportunities that “leave my options open” as to whether I’ll head for the tenure track later on. (Bethany Nowviskie’s persistent advocacy for #alt-ac careers has shaped how I think about this, as did a long-ago conversation with some feminist adjunct professors about the too-often-intellectually-conservative effects of the tenure process.)

And when I tell my department’s cohort of new ABDs what I’m working on as I push my dissertation towards completion— about the fact that this correspondence-mapping-and-analysis tool I’m working on has been cutting into my prose-writing time— some of them say, jokingly, “Hey, wanna work on my research?”

Digital humanities in New England

2010 March 26
by Shane Landrum

As I’ve become more and more interested in the emerging interdisciplinary field sometimes called “digital humanities” (DH) or (especially in Europe) “humanities computing,” I’ve also noticed that it’s hard to find DH people in Boston or New England, other than at MIT’s HyperStudio. Mark Sample’s post on the death of the digital humanities center gives examples of some of these elsewhere in the US. Sample points out that, although many scholars might wish to have institutional infrastructure/support for our DH work, budget cuts happen first in interdisciplinary programs. Here’s his call for action:

So if you’re interested in the transformative power of technology upon your teaching and research, don’t sit around waiting for a digital humanities center to pop up on your campus or make you a primary investigator on a grant.

Act as if there’s no such thing as a digital humanities center.

Instead, create your own network of possible collaborators. Don’t hope for or rely upon institutional support or recognition. To survive and thrive, digital humanists must be agile, mobile, insurgent. Decentralized and nonhierarchical.

Stop forming committees and begin creating coalitions. Seek affinities over affiliations, networks over institutes.

Centers, no. Camps, yes.

To that end, and inspired by the new Digital Humanities Southern California blog, I’m trying to help build more local and regional community among DH practitioners. I’ve started a new group blog, Digital Humanities in Boston and Beyond, which will (as it develops) feature posts about local DH specialists, what we do, and why it’s exciting. Please join us there.