Pages

Categories

Blogroll

Archives

Meta

JEFFREY BEALL: OCLC: A Review p. 85-93

From the opening paragraph: “…I aspire to the high road: objective analysis, keeping in mind that the word radical is in this book’s title.” (p. 85). While I’m not a huge fan of OCLC myself these days, this chapter offers very little by way of objective analysis. It is more of a curmudgeonly rant. Offering little or no compelling evidence, Beall alleges that:

  1. OCLC is more interested in hiring MBAs that MLSs
  2. OCLC’s primary mission is to “separate libraries from their money” (p. 87)
  3. The launch of the Connexion Client software was a complete fiasco that lasted several years
  4. OCLC does not provide sufficient incentive (or support) for libraries to upgrade records in WorldCat, leading to a serious degradation of quality in the database
  5. OCLC Research is little more than a propaganda machine.

The only one of his allegations that I tend to agree with is 4. The lack of credible evidence and the overall belligerent tone of this essay makes any criticisms Beall levels, regardless of their validity, excessively easy to dismiss. How does this help solve the very real problems that catalogers face today?

May 27th, 2010 | Tags: | Category: ReadingNotes,Sabbatical | http://shirley.alptown.com/blog/archives/624

BETH THORNTON: The Existential Crisis of a Cataloger p. 13-17

Presents a summary of challenges facing cataloging, supported by analysis and quotes from reports by Calhoun (2006) and the UC Libraries Bibliographic Services Task Force (2005), among others.  She describes her personal response to the current situation, including conflicted feelings. Some of the criticisms leveled at the cataloging establishment in these reports are valid, so what can librarians do to address them and remain relevant? Thornton cites a number of available resources:

  1. Libraries continue to advertise cataloging positions, and they are hiring young, intelligent, energetic professionals for these positions.
  2. At least some people within the library community continue to recognize the value of traditional cataloging standards and vocabularies that have evolved over many years and continue to do so.
  3. Cooperative cataloging organizations seek to provide training and opportunities for innovation through collaboration. She cites CONSER efforts to simplify serials cataloging, while continuing to create records that serve user needs.
  4. Catalog advocates like Thomas Mann.

Thornton concludes by saying that librarians should bring valuable cataloging traditions with us as we “chase after the shiny things” that are paving the way to the future.

May 27th, 2010 | Tags: | Category: ReadingNotes,Sabbatical | http://shirley.alptown.com/blog/archives/614

Overall impression

Provides valuable insight into the contemporary library cataloging environment. Concludes that the Library of Congress (LC) currently shoulders a disproportionate share of the cost of producing catalog records, contributing to an overall impression among librarians and vendors that the cost of such production is substantially lower than it actually is. The authors argue that this has contributed to the development of a situation in which neither libraries nor vendors are willing to invest the resources required to do original cataloging of many mainstream materials, leading to the continued accumulation of backlogs and hindering the efficacy of cooperative cataloging programs. Alongside the helpful data and analysis, the report also includes some highly questionable estimates, assumptions and assertions. Excellent bibliography featuring many recent reports and articles analyzing cataloging practice and the future of cataloging.

Opportunities for additional research

  • To what extent do the requirements of complying with national/LC standards currently contribute to the cost associated with original cataloging, especially for organizations that are much smaller than LC, where economies of scale limit development of human and technical resources to support cataloging operations?
  • The methodology utilized to estimate the current number of catalogers in North America is highly questionable. The admittedly rough estimates presented in this report need to be subjected to additional scrutiny and research.  Desperately!
  • This report repeatedly asserts that substantial resources are currently wasted by catalogers who insist upon editing records of already acceptable quality for local use without valid reason or sufficient benefit. The report does not present sufficient evidence to support this assertion.  More focused study is needed to examine this issue, so that we can understand the extent to which local edits are made and why. The report describes this type of editing as entirely unnecessary and “redundant.”  To what extent is this true? What is the true cost-benefit of this type of editing? To what extent is this “redundant” effort the result of current systems limitations and/or architecture, in which copies of the same record are stored in isolation from one another and catalogers have very limited ability to make useful edits to “master” records for a variety of technical and not-so-technical reasons?

Notable quotes

p. 4: “Conclusions are based on careful consideration of survey results; interviews and conversations with practicing librarians and vendors; discussion among members of the Bibliographic Record Production social network; extensive reading; participation in the OCLC/NISO Metadata Symposium (April 2009); and our own direct experience with cataloging production and distribution.”

p. 14: “In response to the query about what else we should know about libraries’ MARC records environments, there was great appreciation for LC’s work as well as pleas that it continue. There is general dismay about the quality of non-LC vendor provided records. There were several negative remarks about LC’s cessation of Series Authority work. Many libraries stated that free records are essential to their operations. And many reported difficulty managing vendor supplied record sets for eBook collections and serials packages.”

p. 25: “This tension — between community values and commercial values, between idealism and pragmatism, between social responsibility and private benefit – has deeply affected some aspects of the library market. Cataloging, regarded by many as the heart of librarianship, is one of those areas.”

p. 26: “In general, libraries understate (or simply don’t recognize) the full costs associated with cataloging. This renders questionable any comparison with stated prices from vendors, who typically do have a good handle on costs, since their continued operation depends upon it.”

p. 27:” [due to the contributions of LC]…an entire industry has developed around free (or at least very cheap) MARC records. …many libraries and vendors benefit from a product for which production costs are not recovered.”

p. 27: “Over the past five years, LC has absorbed significant budget cuts, and faces continuing pressure. It has undertaken major staff reductions, especially in its cataloging operations. CIP, as a program that is not directly related to LC’s mission, and for which the costs of production divert staff resources from other programs, must obviously be considered for adjustment. Given the existing level of dependency, such a change would affect the entire profession and the industry based on that profession.”

p. 27: “The market for cataloging records is in some important respects dysfunctional. In our view, the biggest issue is that the market lacks sufficient incentives to stimulate the production of new cataloging records. Obviously, many books, journals, electronic resources and other items are being cataloged, so some elements of the market are working. But structurally, it seems clear that something is amiss.”

p. 27: “Because of their own staffing constraints, or unwillingness to bear the cost of original record creation, many libraries simply wait for another library to catalog an item they have already received. On average those items are held for three to six months, with periodic searches of OCLC to determine whether another library has blinked. While this makes sense as a way of controlling costs, it does not provide optimal service for users.”

p. 29: “Why do so few libraries join BIBCO or CONSER (which also relies on fewer than 50 members)? And what factors determine which records are contributed? It seems likely that titles most needed and valued locally by the contributing library would take priority. Because most of the big PCC contributors are research libraries, it’s also likely that many titles are specialized, and therefore may not be widely held. Therefore, fewer libraries would benefit from that contribution, as opposed to, say, DVDs, which are cited as a major problem by most libraries in the survey. Regardless, these libraries are committed to contributing to the community. Which titles they catalog would matter less if participation were broader. A cooperative system only works well if everyone participates.”

p. 29: “But somehow the incentives to produce new cataloging records are insufficient, from both the commercial and community viewpoints. Otherwise, there would be greater participation in cooperative programs, and/or more vendors seeking to become cataloging agencies. At bottom, we believe this is because cataloging costs and therefore prices are understated and artificially depressed. An even more sobering possibility is that the profession does not believe that cataloging is worth what it costs to create it; that will be quickly determined once all production costs are factored into the price. In the meantime, this is a market that appears to require adjustment.”

p. 33-34: “While major economies of scale have been realized, troublesome issues remain. First, there are thousands of small libraries that operate outside of the shared cataloging infrastructure. Most of these lack the capacity to produce MARC records—they have no catalogers. Second, catalogers have an almost unstoppable urge to improve, tweak, customize and “localize” national-level records; redundant work is still widespread. Third, despite efficiencies, cataloging backlogs continue to grow, not only for audio-visual materials, rare books, and non-Roman languages, but even for the most commonly-held materials.”

p.34: “We now operate in a context where questions about the efficacy of the MARC record and the centrality of the OPAC are continually posed. We wrestle with keeping libraries relevant, and assuring their participation at the network level. We have also entered an era where questions about cost and return on investment are routinely asked of non?profit entities such as libraries. In the long run, there may be better and cheaper alternatives than MARC. In the short run, there may be ways to reduce the cost of producing MARC records.”

p. 34: “LC’s own ILS cannot, for instance, accept ONIX records directly – they must first be converted to MARC21. This will undoubtedly change over time, but for now, most libraries will continue to need cataloging records delivered in MARC format—it is the only usable solution.”

p. 34: “There remain strong arguments for use of standard cataloging principles–controlled vocabulary, classification, subject analysis, and authority control—packaged and delivered in a consistent format. While MARC records may need to be extended, embellished (supplemented with full text, flap copy, excerpts, user tags), for now they provide a common standard and a cooperative infrastructure that controls costs.”

p. 35:”Once records have been distributed to vendors, most seek to add value. In general, this means matching records to a group of titles being shipped, and adding fund, location and electronic invoicing data, updating or adding proxy prefixes to URLs. These value adds are important to workflows, but do not necessarily change the bibliographic data (although some vendors do perform CIP upgrades if they are needed). Once libraries receive the records, via OCLC, the vendor, or another source, many also seek to add value in other ways. As noted in the survey results, 80% of libraries perform some degree of local editing on the records, to customize them for their own constituency. Increasingly, libraries are adding or linking to table of contents, Amazon, and other external sources in order to enrich the bibliographic description.”

p.36: “Another factor affecting shared capacity is the increasing emphasis on making accessible material that is unique locally. This shifts priorities within a given library, and makes it more difficult to share capacity, and to dedicate hours to work that might benefit the community as a whole over one’s own institution.”

p.36:”While it is true that the records produced by LC need to be better supported, it is difficult to imagine the profession and the industry without them. They provide enormous value, to a degree that is difficult to calculate.”

May 24th, 2010 | Tags: | Category: Metadata,ReadingNotes,Sabbatical | http://shirley.alptown.com/blog/archives/585

Chapter 11 – Using Tools to Create Mashups

Google Mashup Editor (GME) (now defunct, see Google App Engine)

Yahoo!Pipes

Creating mashups is difficult due to the need to learn how to use multiple APIs and knit together information from divergent sources. A framework such as RDF promises to make this easier to accomplish, due to standardization. Cites http://simile.mit.edu/ as an example of a project that provides a practical way to look at Semantic Web technology. Tools facilitate reuse of code, “simplify the routine stuff,” allowing the developer to focus on solving problems at a more abstract level.

The example in this chapter involves building a mashup that shows geotagged photos retrieved via the Flickr API on a Google map. It uses GME and Yahoo!Pipes to extend a mashup created in Chapter 10 using PHP and JavaScript. Now that GME is dead, it appears that you can’t actually run the code developed in this chapter, which is unfortunate. Yee provides an example of iterative app building, however, and it is interesting to note how GME tags, HTML and JavaScript all get jumbled together in a single file.

GME was a text-based programming environment; Yahoo!Pipes is visual. Yahoo!Pipes can convert XML to RSS 2.0.  Chapter 4 includes a tutorial on using Yahoo!Pipes.

Google Code provides hosted Subversion services for software development. Appears to be free.

Advantages of using mashup tools like GME and Yahoo!Pipes:

  • Server is hosted, no need to run you own
  • Hosting server takes care of server-side (i.e. PHP) functionality, so no knowledge of PHP is required; only have to know some JavaScript to handle client-side processing
  • Access to Subversion services
  • Access to apps/code snippets/ideas/tips from other developers working with tools
  • Makes it easier for developers to quickly implement/test ideas, esp. in situations where they don’t have a long-term stake in a particular application.

Disadvantages of using mashup tools:

  • Each tool requires you to learn a new framework
  • Sometimes tools don’t give you exactly what you want/need
  • Must reveal code to host, and your code is “branded” to a particular host
  • Application is dependent upon host; problematic if host discontinues service (as GME already has)
  • Allows non-programmers to create mashups, but leaves them vulnerable if they are dependent upon the underlying services for production applications.

Yee’s overall assessment of mashup tools like GME and Yahoo!Pipes: “I think that the GME and Yahoo! Pipes makes it easier for programmers to create certain types of mashups, though it’s not so clear whether they open up mashup development for a nonprogramming audience.”

Useful links from this chapter:

Yee provides a useful table of mashup tools near the end of the chapter. I created an annotated version of this table and published it via Google Docs.

April 12th, 2010 | Tags: | Category: mashups,ReadingNotes,Sabbatical | http://shirley.alptown.com/blog/archives/539

Biskup, Thomas, Heyer, Nils, and Marx Gómez, Jorge. (2007). Building Sound Semantic Web Frameworks for Scalable and Fault-Tolerant Systems. In: Sugumaran, V. (ed.): Application of Agents and Intelligent Information Technologies. Hershey (PA): Idea Group Publishing, pp. 153-181.

Summary
Describes a theoretical framework for implementing the Semantic Web. Core technologies: agents, ontologies, Web services, and personalization. Provides a useful overview explaining the roles that existing technologies, including: XML, RDF, HTTP and SOAP, and ontology languages like SHOE and DAML, can play. Defines the WASP (Web services, Agents, Semantic Web, Personalization) model and a related system architecture named HIVE. WASP will utilize Web services for communication, Agents for modeling typical tasks and solutions, Semantic Web technologies (e.g. XML, RDF) as “a means to provide data and information in a consistent manner that allows retrieval and reasoning”, and Personalization technologies to configure processes to meet needs expressed by individual users. Agents are the central component in this framework because they implement the business logic. Introduces the concept of hyperservices, which are derived from integrating the core WASP technologies into a new type of service infrastructure. Analyzes and presents functional requirements for Semantic Web architecture.

My impressions
This article helped me get a better understanding of the big picture and how all of the various technological pieces (e.g. XML, RDF, SOAP, etc.) could fit together to realize the Semantic Web. I am intrigued by the central role that agents play in this model, and I could see a role for people (possibly librarians, catalogers in particular) in helping to define and develop agents and metadata schema, ontologies, etc. to provide an increasingly structured environment for agents to operate in.

July 01st, 2009 | Tags: | Category: ReadingNotes,Sabbatical,SemanticWeb | http://shirley.alptown.com/blog/archives/184