16 May 2008

Do You del.icio.us?


Today I am going to start a two-part posting on using del.icio.us in genealogy research.

If you already use del.icio.us as one of your online tools, you know what I mean when I say that it really revolutionizes the way in which you access, bookmark and organize sites of interest you find on the web. Established users will be more interested in the second post of this series, when I'll talk about some features of del.icio.us aimed at the intermediate user.

If you don't already use del.icio.us, or are still stuck on wondering what in the world I am talking about, then this first post is for you!

GETTING STARTED
Learning about del.icio.us isn't too hard; they bill themselves as a social bookmarking site, taking all those links that people used to keep to themselves within the confines of their browser, and aggregating them together to create a user-defined bookmark (or favorite) list of the entire web. del.icio.us uses the tagging features which are one of the hallmarks of web 2.0 (more on tags below).

The nitty-gritty is simple: instead of bookmarking (or favoriting) web sites you find along your online travels using your browser, you post them to your account in del.icio.us. When you post those links, you add descriptive tags. Those tags, and all your links (unless they are specifically marked private) are then available to be searched and browsed by anyone else using del.icio.us. Posting to del.icio.us is made easy (especially for Internet Explorer and Firefox users) by the addition of a button to your browser's toolbar (more on that later).

Let's learn by example here:

1. The first thing you'll want to do is sign up for an account at del.icio.us! Install the browser extension for your particular browser, or drag the bookmarklets into your toolbar.

2. Next, save your current bookmarks as a file on your computer, so that you can import them into del.icio.us:

In Firefox: Bookmarks > Organize Bookmarks > File (In the File Menu on the pop-up window) > Export.

In Internet Explorer
: File > Import and Export > Click the "Next" Button > Export Favorites > Export to File or Address > Choose a location to save your file > Next > Finish. You should get a confirmation message that your favorites were exported.

3. Upload your bookmarks to del.icio.us. Once you are logged in to your account, you will see a link to your "settings" page. On that page, under the "Bookmarks" heading, choose "Import/Upload" and follow the instructions to upload your file. Once you're done, you should see your bookmarks on your page!

WHAT'S IN A LINK?
Let's look a little closer at the anatomy of a del.icio.us link:



First you see the title of the bookmark (which is also the link); this title is generally taken from the title of the html page bookmarked. (This is editable when you post new links). Next to the title of the bookmark are links which allow you to edit or delete the bookmark you have posted.

Below the bookmark link you see the tags which have been attributed to this bookmark (in this case, genealogy and research).

Clicking on one of those tags will take you to a page which displays all of the other bookmarks you have posted and tagged with the same term. For instance, clicking on genealogy beneath the link above, takes me to a page (actually 1 page out of 17) with all links also tagged genealogy:



All items with the word "genealogy" as one of their tags appear here, regardless of what other terms they are tagged with. Now, back to our link:



Next to the tags, beneath the bookmark title, you will be able to see how many other people on del.icio.us have also posted the same bookmark. In this case, del.icio.us displays a link which tells me that 238 other people have also posted this link. This is where the "social" comes into social bookmarking!

Clicking on the "saved by # other people" link, we are brought to a rundown of all of the people who have bookmarked the site, any notes they have made regarding the link, and the common tags they have used to tag the site:



GETTING SOCIAL
Now that we've covered the rudimentary basics of del.icio.us, let's take just a moment to talk about what really makes del.icio.us useful, unique, and a standard bearer of web 2.0: the interactivity and the ability to obtain and share information in non-linear ways.

What do I mean by non-linear? Traditionally, if I wanted to share a set of links with you, I would write up something, perhaps a list, which would be ordered (and therefore prioritized) in some manner. Web 2.0 (and its various hallmarks) is sort of like that fuzzy strange school that gives out stickers instead of grades and has the students grade the teachers: it shuns hierarchy and calls for the user to generate his or her own path through any particular set of information. This is exemplified by the tag cloud:



Each user has his or her own tag cloud. If you were to come to my profile on del.icio.us, this is part of the tag cloud you would see. Clicking on any one of these links would take you to a page of the bookmarks I have posted using that tag. For instance, clicking on the tag "louisiana" would take you to a page with all of my links using the tag "louisiana". On that page, del.icio.us will also show you, on the right sidebar, a list of "related tags", tags which I have used often in conjunction with the "louisiana" tag:




From these tags, you can explore whole hosts of other links, while also exploring the subjects which I am interested in, and for which I often find and bookmark sites. Assuming you are interested in the same topics, chances are you will discover one or two sites that you never knew existed! When you multiply the amount of information and different ways of browsing through a user's tags by the number of del.icio.us users (over 2 million) you can imagine the sheer volume of information and links you can find.

ONE OTHER WAY TO SEARCH

You can also follow a more direct path to finding other links, by using the del.icio.us search bar, which you can find at the top of every page:



A search for genealogy brings up nearly 45,000 different links from all users!

I hope this post inspires you to give del.icio.us a try, if you haven't already. In the second post in this series, I will cover some of the features available through del.icio.us that will take your usage of the site to the next level.

14 May 2008

Finding Images in the Deep Web

[Via ResourceShelf]: The Association of College and Research Libraries presents a listing of some prominent online image repositories.

This is a perfect example of what lurks within the deep web!

12 May 2008

Reorder Your iGoogle Page Tabs

[Via GOS]: You can now reorder the positioning of tabs on your iGoogle page... a heretofore un-understandable and frustrating limitation of iGoogle. Useful knowledge if you are a type-A like me!

11 May 2008

Extra! Extra! Rethinking Newspaper Research

In my work over the past two years reading thousands of issues of 19th-century newspapers, I have become fairly well-acquainted with the typical layout and content of such, so I wanted to share some thoughts and tips with you today.

This post may be somewhat anti-technology for this blog, in that I think it admits a deficiency in today's OCR technology as applied to historic newspapers. Take it from me... there are prints, scans, fonts, fadings, scratches, lines, creases and tears lurking in all of those spiffy online digitized newspapers that are obliterating the abilities of even the best OCR programs. I would hazard to say that a huge percentage of names in any given issue of a historic newspaper are not properly read and indexed by OCR and search software. What does this mean? In short: if you've been relying on search functions in newspaper databases to do your research in newspapers, you haven't done your research at all.

Why It's Worth It, and What It Takes

If you haven't taken the time to really dive into the local papers available for the community of your ancestor, you are missing out on a fabulous opportunity to obtain a rich and textured understanding of your ancestor's world. This holds true in particular for newspapers from the mid- to late-1800's, when the era of local and community news was really at its prime, and papers seemed to follow very predictable ways of presenting and publishing their rags.

That said, if you're like me (and many others [see comments]) who are frustrated by the inadequacies of some digitized collections' search functions (and the haphazard OCR renderings which exacerbate such problems), going issue by issue through a particular paper in a given time-range may be your only option to locate information regarding individuals.

Know The Bones

This site has some good information on the basic structures of 19th-century newspapers, extrapolated out from a typified example (unfortunately this page seems to be on its way out of maintanence). I have found this structure typical of the four-page dailies or weeklies in a number of states, including California and Louisiana. A basic understanding of what you are likely (and unlikely) to find in a newspaper from a given time-frame is important. Styles of journalism and what is considered "news worthy" has changed over time and with massive population growth in many areas. Over one hundred years ago, news about divorce cases, drunk-in-public charges, and trips abroad made the newspaper. Today, unless they involve murder, incest or embezzlement, we are unlikely to hear about such things. On the flip-side, extended obituaries for individuals were much rarer in those days. Understanding the differences between old newspapers and modern newspapers can keep you from goose-chasing.

Items typically found in 19th-century newspapers include:

  • Births, marriages, deaths
  • Obituaries of more prominent people or people from "old" families
  • Obituaries or write-ups on deaths of the young, the old, or those who died due to accident or sudden/strange illness
  • Divorce cases
  • Spousal desertions or elopements
  • Murders, Suicides (Actuated or attempted)
  • Comings and Goings. Info on who is traveling where, who is in town visiting.
  • Local reports. Small correspondent columns from towns or areas outlying the town in which the paper is printed. Typically covered the major gossip in the town, including who died, was born and got married or divorced.
  • Court reports. Who got arrested, why, who's in jail, who got drunk and fined, etc. Also, information from probate and civil court cases.
  • Accidents and injuries. Horse runaways, train deaths and injuries, and gun accidents are always favorites.
  • Illness reports. Typical during the months when la grippe would be rampant, some papers would list who was sick with what level of severity.


Names and even biographical information show up in the strangest contexts, and in the weirdest ways. Probate case write-ups can mention birth dates and places, as well as death dates and places. Accident stories can include information on when a person moved to a particular town or county. Personal or comings-and-goings reports can include information on family relationships like the married names of daughters and the places of residence for relatives. The amount of information and the intimacy with which you can come to know the people of a particular town is really amazing. All it takes is a little background. Oh, and a whole lot of patience.

Have Patience, Young Jedi

Going through a newspaper issue by issue can be fascinating, but also exhausting. If you're looking for a particular item in the paper, your eyes can glaze over and your brain can sputter out. Keep the following in mind as you dig for specks of gold in tons of stone:

  • Things change. Just because the death notices, for instance, have been printed on the 2d page of a publication for the past 3 months' worth of papers you have scanned, doesn't mean you can be lazy and look only at the second page of all the papers for the next 3 months. Papers then, like papers today, moved things around to suit special features, special coverage, and ads ads ads. It's unfortunate for the weary researcher, but every page of a paper needs to looked at if you're searching for a particular item.

  • Places change. As towns grew along with the population, the papers often expanded to more pages, usually from four to eight, then with periodic twelve-page issues (usually Sunday). In consequence, the amount of information available in the paper doubles or triples. A typical case can be seen in my indexing of The Oakland Tribune. For 1875 I found about 650 instances of names with genealogically-significant information, from the entirety of each paper, for the whole year. Just fifteen years later, I indexed close to 6,000 names, just from the standard vital records notices. Oakland had grown, and its development as a "bedroom community" to San Francisco also meant that it received more cross-published notices from San Francisco. In fifteen years the amount of information of interest to a genealogist had grown at least ten-fold.

  • People change. Following from the above, you should be aware of changes in ownership or editorship at a paper. Changes of this sort are almost always followed by changes in format, typefaces, or coverage of the news. Papers often posted this information under the masthead for a period of time before such changes took effect, but not always. It helps to be armed with this information so that layout and type changes don't make you miss what you're looking for. This is particularly important as you get more acquainted with a particular newspaper and your scanning (naturally) gets more cursory. The font one editor loved to use for advertisements can be the font the next editor loves to use for the local probate and police court write-ups. If you haven't been aware of an editor change, you may scan over the court write-ups believing them to be the same old ads for dyspeptic syrups!

  • The only consistent thing is change. Sometimes changes just happen and are reflected in the newspapers for reasons we may not understand. A paper that one year prints hundreds of vital record notices may post less than a hundred the next. A paper may begin to print articles with the names of all individuals to whom marriage licenses were issued, only to stop a week later. Why? As your Mom always said, "Because."


It is said often, but newspapers really are a very under-utilized resource. As I have gone through issue after issue creating indices for various newspapers, I often marvel at the amount of information available on some people in different articles, and I get excited at the thought that the information printed in a few inches in a paper 150 years ago could solve the mysteries or brick walls of one of their ancestors today.

I also spend about 4 hours a day indexing and transcribing newspapers in my area because I believe, or rather KNOW, that people are missing out on valuable information because they are relying too much on the accuracy of OCR, a technology that is currently somewhat sub-standard to the task of rendering microfilmed newspapers. I hope this post encourages you somewhat to invest the time to really take advantage of newspapers as a resource.

If you're interested in reading more about digitization of newspapers, the LOC Newspaper Digitization Project has some interesting articles and links.

09 May 2008

How Did That Civil War Soldier Really Die?

Part of the ongoing Old School theme, taking it back to the paperbound resource! Here's a tidbit of interest from Life of Billy Yank: The Common Soldier of the Union, by Bell Irvin Wiley... something to contemplate when considering how your Civil War-fighting ancestor actually died:

"It is a sad fact of Civil War history that more men died of looseness of the bowels than fell on the field of combat. The best available figures show 57,265 deaths from diarrhea and dysentary as against 44,238 killed in battle." (Page 124)

07 May 2008

Firefox 3 and Your Online Research

If you're considering making the move to Firefox 3 (currently still in beta, but nearing full release), be aware that there are still lingering issues with Ancestry and the Enhanced Image Viewer. Currently, users are having difficulty with the plug-in, or, as in my case, are unable to install the plugin at all and are forced to use the Standard Viewer (which, in case you don't remember, really leaves alot to be desired).

Zotero is also not currently functioning in the 3.0b5 version, (though it did function in earlier versions). Zotero says it will be fully compatible with Firefox 3 when it is finally released.

Google Notebook remains buggy and unstable in 3.0b5, (and virtually unusable) though it looks to be an issue for Google to resolve.

I'll update as more issues come to light.

06 May 2008

Trawling the Deep Web

When it comes to research, I am a firm believer that whatever I am looking for exists, and if it exists it is probably online (or will be at some point). Needless to say, I have alot of unbridled optimism, and infinite patience for the blossoming of the internet.

Along those lines, I was thinking about the "deep web" or "invisible web" which alludes to the gazillions of bytes of information out there on the internet that are not accessible via traditional search engine results. This includes, of course, things like information stored in databases... the life blood of the online genealogy researcher.

Some Resources

I found a great article from OEdb which provided some interesting links for exploring the deep web. Of special note are the following sites:


Some Rethinking

Ultimately, of course, we can't believe that a link to some deep, as-of-yet-undiscovered database will solve all of our research problems. The truth is that the deep web is not so much invisible as it is demanding... demanding that what can be wandering, aimless time spent on the internet take the same disciplined, goal-oriented approaches that most research resembles. The deep web reminds us that we must reassess and re-evaluate the ways in which we work online, in order to maximize both the efficiency and integrity of our internet research.

This article, from UC Berkeley, says it all, and provides a fabulous matrix for rethinking your approach to online research. My goal is to follow the guidelines set out here and re-research a current brick wall. Perhaps the issue is not that the information is not out there, but that only that I am unable to find it. Optimistic? Perhaps, but the only down-side is a more thorough understanding of what is and is not currently available online on a given topic. And that's not much of a down-side, now is it?