You are viewing lcwk86

Final Thoughts...

Tuesday, May. 9th, 2006 | 10:51 pm

I am finishing my paper now (proofreading!) and tomorrow I will reindex with SearchMaker Pro and do some testing on a high speed connection. When this is finished, I will email the site to General Mills.

After this, I have two summer classes,practicum and comps. My practicum is to be at the St. Louis Art Museum Library digitizing their bulletins and annual reports. A huge project that I will not be able to finish, but other students can continue.

I am very grateful to Paul Turner for this learning experience! I have so much more to learn, but at least I tipped the iceberg.

Link | Leave a comment | Add to Memories | Share

ReIndexing

Monday, May. 8th, 2006 | 10:27 pm

I have reindexed PicoSearch (you have to go to the site, log in, and reindex). To reindex the SearchMaker Pro, I will have to go back to Dad's. So tomorrow night...

But check out the new title tags--
Search by page number: 103, 104, 105 and 110.

Link | Leave a comment | Add to Memories | Share

New Count

Monday, May. 8th, 2006 | 10:02 pm

Since I am revising the site, I need a recount of the digital objects:

4 page images
4 text pages
3 Illustrations
2 color photos (including cover)
3 black and white photos
11 recipes (or recipe suggestions)
--------
27 digital objects

I am updating the pages and rewriting the title tags now....
Obviously, it is important to know your intent for the material before requesting copyright. But you also need to be able to rethink the project to fit your copyright permission. A chicken-egg-chicken situation, if you get my drift...

Link | Leave a comment | Add to Memories | Share

Copyright Permission

Monday, May. 8th, 2006 | 09:12 pm

It has arrived, not exactly as hoped, but I have permission for some of the materials:

TO: Lisa Wolfe
RE: Copyright Permission

Dear Lisa:

Thank you for your inquiry requesting permission to use Betty Crocker Cookbook content for your digitizing project. We will be able to grant you partial permission, to use a limited number of cookbook pages from the 1956 edition of Betty Crocker's Picture Cook Book. The pages you have permission to use are:

The Cookbook cover
The Yeast Breads intro page
The Ways with Bread Photo Page
Page 105
Page 110
If you choose to use these, we ask that you abide by the following guidelines:

1. The recipes and other content must be reprinted exactly as it appears in the original source;
2. The original source must be identified (i.e., Betty Crocker’s Picture Cook Book);
3. The following acknowledgment should be included, “Reprinted with the permission of General Mills, Inc.”;
4. Permission for reprint of recipes does not include the use of “BETTY CROCKER” or “General Mills” in any advertising or promotional materials; and
5. A copy of the finished publication or link to website should be forwarded to my attention.

Thank you for the courtesy of recognizing our rights to this material by requesting permission for its use. Best of luck with your project.

Sincerely,

Sxxx Sxxxxx
VP, Equity Enterprises

A re-design is now in order!

Link | Leave a comment | Add to Memories | Share

Arrrgggh!

Thursday, May. 4th, 2006 | 10:09 pm

It's the last crunch week in other classes... so this is on hold for a few days.

That's ok, save the best for last...

Link | Leave a comment | Add to Memories | Share

Blog Tagging

Wednesday, May. 3rd, 2006 | 09:52 pm

When I started this blog, I did not know what tags were (as in tags for the blog--well other types as well but that's what I am talking about here). Since then, I have been reading other blogs, using Bloglines, searching Technorati and learning more about it. Funny, but I wasn't thinking of the blog tags as metatags or a controlled vocabulary, but it really seems to be... Here are some blogs entries about this topic:

Tame the Web
Metablog
Information on Demand
Tags:

Link | Leave a comment | Add to Memories | Share

Refining Titles

Tuesday, May. 2nd, 2006 | 07:45 pm

Here are Paul's comments:
You could refine the title tags for each page, image, etc and reindex one last time using searchengines... advantage of Picosearch is it should do this automatically....  

Back to tweaking...
Tags:

Link | Leave a comment | Add to Memories | Share

Up and Running...

Monday, May. 1st, 2006 | 08:15 pm

While I had access to a high speed connection today, I tested all the pages. At this point, I only have a PicoSearch field on the home page. Once the site is complete, with all the kinks worked out, I will trek over to Dad's and set up the SearchmakerPro script to add to the page.

Link | Leave a comment | Add to Memories | Share

Introducing... Betty!

Sunday, Apr. 30th, 2006 | 11:45 pm

Here it is! Take a look...
Still a lot of work to do.

Link | Leave a comment | Add to Memories | Share

DC-Dot

Saturday, Apr. 29th, 2006 | 10:47 pm

I spent the better part of yesterday adding the dc metatags to my pages. I still have a lot to go, but I am well on the way. It consists of entering the URL at DC-Dot and then supplementing with my database entries. I made mistakes before I got the hang of it and made it into a workflow sort of thing. When that happened, it was BBEdit again correcting the code on several pages. There are several workflow sections of doing a digital project. If you work until it's finished, you discover what you should have done differently and have to go back and re-do. In that way, it's better to work on a very small part of the project to establish what works best before making major decisions.

Link | Leave a comment | Add to Memories | Share

No Chat today

Friday, Apr. 28th, 2006 | 02:45 pm

Since I was knee-deep in html, we did not chat today. That's ok because I plan to have the website up and functional before the weekend is out. Today I am building pages. I keep having to redo html because I am find something to change. Thanks to BBEdit, I can do that with multiple files at once. Adobe GoLive does it as well, but not as efficiently.

Link | Leave a comment | Add to Memories | Share

Hope for the best, I guess

Friday, Apr. 28th, 2006 | 10:58 am



Lisa:
I can help you with this, but need to know a bit more. Believe it or not the answer differs with the content you intend to use and how you intend to use it. Is it recipes & photos, meat roasting charts, ingredient measuring charts, etc? Also can you tell more about how you intend to use the information? What is the subject you are addressing in your project? If you can get back to me with these answers, I will get you an answer within a day. (FYI, one guideline we use it that we limit the free use of recipes and photos to five per year, for distribution to less than 500 people.)
Thanks,
Lxxxx Oxxxxx
General Mills, Inc.


Ummm... am I in trouble?

My counter:
Lxxxx,

I will try to answer each question and if you need more than I have provided, please let me know.

The purpose of the project is to learn how to manage and complete a digitizing project. My proposal for the class is attached.

The ten pages I am digitizing consist of:
6 Illustrations
17 black and white photographs
3 color photographs
19 recipes (out of those, some are suggested uses of bread, not a traditional recipe.
Some pages such as 103 include no recipes, just the introduction to the Yeast Bread section.
In addition, page 112 is used as an example and the front and back cover of the cookbook.
Page List:
103, 104, 105, 106, 107, 109, 110, 113, 123


Here is the introduction I want to use for the collection:
“As a graduate student in the Library Science Program at SISLT, I decided to explore the interesting topic of digitization for my independent study class. For the project, I selected 10 pages from the "Yeast Breads" section of old cookbook that belonged to my mother. The actual book resides in my house, as mom doesn't typically use cookbooks these days. The collection contains 65 digital objects, including page images, text, illustrations, and photographs.”

NOTE: Each page has to also be available as a complete image and a text page. So it all adds up to 65 digital objects.

As you can tell the project is well underway, as it took me a while to figure out who to contact about copyright (part of the learning process!). My instructor and I intended to use the project an exercise, even if the copyright did not work out. If it does work out, the information will be used for future students in the Library Science program as an learning tool. However, I would think that time would be fairly limited, as the student space on the server exists while you are a student (not sure how long after you graduate). I should be completing my degree the end of 2006.

Thank you for your consideration.
Lisa Wolfe

I sincerely hope I get permission. I have done a lot of coding! It has been really good, yes, really, in spite of my obsessing and drinking lots of caffiene. But it will be a lot better if I am able to use the information, at least for a little while...

Link | Leave a comment | Add to Memories | Share

Investigating other Search possibilities...

Wednesday, Apr. 26th, 2006 | 07:34 pm

Since the free version of SearchMaker Pro will only index 6 pages, I asked Paul if either FreeFind or PicoSearch would be workable...

Lisa, of the two free search engines you mention neither appears to search your DC metatags... Searchmaker Pro claims they do, but there appears to be no way to do a "fielded" search against specific fields like "author".

Of the 2, I would test Picosearch against your site, and specific directories within your site than contain your project file(s)....

Picosearch seems to have rich customization features, and will do many things searchmaker Pro will not.... because it is a real search engine, not a hack like searchmaker is...



TEST Result: PicoSearch -- as least the free version -- does not seem to search your Dublin Core metatags...I tried to search for "McGraw-Hill" and some other meta tag descriptors and did not yield any results.

PicoSearch DOES do a very nice job of doing basic search against the text in your pages--and it makes it even more critical that you get your "title" page tags to be fully descriptive of what each page is about. I think this is a great result though and glad you pushed on to find a search engine that would work for your particular situation.

It's hilarious how you get ads for yeast within your results.....
Paul T.

Link | Leave a comment | Add to Memories | Share

Ultimatum

Tuesday, Apr. 25th, 2006 | 10:53 pm

As per Paul's request, I will be writing a summary of the project (2-3 pages tops...) for submission to both Paul and Dr. Adkins. Everything has to be finished by May 10th. Yikes!

Link | Leave a comment | Add to Memories | Share

Gather Resources

Sunday, Apr. 23rd, 2006 | 03:25 pm

Note from chat:
I would recommend organizing your references in a web page would be helpful to other students interested in this area of study/research, something you can build on as you learn more, something for Denise and other LIS Faculty to have access to for themselves and other students...

Going back through all the blog entries, emails, bookmark manager and chat transcripts is proving to be a large task.

Link | Leave a comment | Add to Memories | Share

Temporary problem

Sunday, Apr. 23rd, 2006 | 01:51 pm

I made some changes in the directories at bengal, so some of the links won't be working today. Will be fixing it tomorrow...

Link | Leave a comment | Add to Memories | Share

Preparing Content

Sunday, Apr. 23rd, 2006 | 12:25 am

As I said, I need a home page and "search" on every page, but also a way to navigate the objects without using search. All object need to be available on static pages. With 65 objects, that means a lot of pages.

So I have been preparing content for some time. The pages of the cookbook have been scanned and saved in a TIFF file format. I have chosen pages 103-107, 109, 113, 118, and 123. In addition, there are a few others that will make an appearance, such as the cover.
My plan:
  • Pages will be available as page images, in other words an exact online image of the actual page.
  • Pages will be available in text form.
  • Pages will be recreated using html for the digital collection website.
  • Recipes, photos, and illustrations will be available separately so they can be searchable.
  • Each of these "objects" will have their own page on the website.


Page Images
Take the scanned page, use the "Auto Color Correction" and resize to a width of 500.
(this sounds simple, but it was actually a process of trial and error before I came to that conclusion)
Name each file according to identifiers set up in the database, for a total of ten pages.
Page Text
Both to prepare for the text pages and the html pages, the scanned text pages needed correction. The OCR did a reasonable job, but certainly not entirely readable. So after going through and comparing the actual pages of the cookbook to the scanned text, the text pages were corrected and named with the identifiers according to the database, of course a total of ten pages saved in rtf.
Build the Html pages
I wanted each page to replicate the original as closely as possible. So I had to start building and making design choices as I went to mimic the style of the cookbook. Then compose a CSS stylesheet.
Each recipe, image, and cookbook page had to have a separate html page. This made a total of 59 pages, once again named according to the identifiers in the database.
Organize the website
Home page, and sections of the site such as:
Search
Page Images
Page Text (html pages included)
Recipes
Images
Resources

Link | Leave a comment | Add to Memories | Share

SearchMaker Pro!

Saturday, Apr. 22nd, 2006 | 07:15 pm

SEARCH

[Middle English serchen, from Anglo-Norman sercher, variant of Old French cerchier, from Latin circre, to go around, from Latin circus, circle, from Greek krikos, kirkos; see sker-2 in Indo-European roots.]

Yes, it feels like I am going in a circle. The search problem is maddening!
Discussed in chat: what we really want is a public (and free) search engine that handles metadata (dublin core) searches... - or alternatively a way to build a search page "front end" like the example above for just indexing and searching page(s), images, etc on your web site.

And then the answer came in an email from my collaborator!

I found what we were looking for -- a simple search engine that can index and search dublin core metatags without having to work w/ web server issues.  it is what we call browser-side and using javascript to accomplish searching web site(s).

This is a simple approach but allows you to build a search.html page for any web site, or section(s) of web sites, that can then be used to search a controlled list of content on your web site (specific pages, file types, images, etc).

It's not perfect but it is certainly powerful enough for your needs for this semester.. 

And it will be perfect learning tool to show you a bit of how search works "under the hood" -- its more complex than you think, and this is an easy approach :)  I think a week of playing around with it will solve most of your search issues....

The only drawback. It only works on PC/Windows machines -- the creation part that is -- once you create a search.html page using the software you can upload this page to ANY website and use it in most any browser on Mac/PC etc.


So, having a Mac, I went to my dad's and downloaded SearchMaker Pro. Now to keep dad from uninstalling it accidently. Yes, it could happen. I did a trial run to see how it worked. This is a great solution and I look forward to using it to search my collection.

But the pages have to be built and uploaded before I can create and upload the search page. Back to work...

Link | Leave a comment | Add to Memories | Share

Sidetracks

Friday, Apr. 21st, 2006 | 05:32 pm

Very helpful chat today. It's confirmed. Google is not going to work. So we are still searching for a solution.
From Old Books offered some great information on how setting up this sort of search works, if you have access to your own server and don't want to pay big bucks. It's all open source stuff.

Another interesting sidetrack that won't really help my search problem, but could prove useful to demonstrate the use of Dublin Core metatags. Firefox has a plug-in that allows you to view the metadata for a webpage. It's a very tiny download, browser restart, and wella! You have this little icon in the lower right hand corner of your browser window. If you visit a page that contains DC metatags, the icon will turn orange. Click on it, and a window will open with a list of the tags. You can click on each tag to view the "details", in other words, the full text of the tag as if it is lengthy the window cuts it off. Fun and geeky.

Now back to my digital problem. Not only do I have to be concerned with searching the collection, but I need to provide an easy user interface. Here's my first source for this topic.
Some other print sources that I reviewed at the start of the project:

Managing a Digitization Project. Visual Resources Association Bulletin v. 31 no. 2 (Winter 2005) p. 50-65
Mathias, E. Anatomy of a Digitization Project. Library Journal (1976) part Net Connect (Winter 2004) p. 2-7
Arms, William Y. Digital Libraries. Cambridge, Mass.: MIT Press, 2000. : available on through WorldCat


And yes, in the meantime I have been ekeing out online content pages... More on that later.

Link | Leave a comment | Add to Memories | Share

More from General Mills

Friday, Apr. 21st, 2006 | 01:55 pm


Lisa,

I talked to my boss about your request this morning. He asked me to send the request to Lxxxx Oxxxx. She is the person who handles all requests related to recipes and cookbooks. I have done so. I believe she is going to have to get direction from a couple of people in her area, so please give her a few days to respond. If you haven’t heard from her, then feel free to contact her at her email address: xxx@xxyyyy

Good luck with your class.

Dxxx Hxxxx
Senior Paralegal

Link | Leave a comment | Add to Memories | Share