ILRT logo Institute for Learning and Research Technology *
*

Review of Personal Bibliographic Systems

ILRT Research Report Number: 1032

Publication Date: 2003/06/09
Last Modified : 2003/06/09 17:32

Author: Paul Shabajee


Abstract
This report is a review of personal bibliographic management software and systems – it provides a fairly comprehensive review of available systems, functionality and issues in personal bibliographic data management. It was produced as part of background research for the SWAD-E Semantic Blogging Demonstrator Research and was funded by HP Labs as part of the SWARA project based in ILRT.

(This is an edited version of report originally published as Appendex D. to SWAD-Europe deliverable 12.1.2: Semantic Blogging and Bibliographies - Requirements Specification by Steve Cayzer and Paul Shabajee - http://www.w3.org/2001/sw/Europe/reports/requirements-demo-1/hp-requirements-specification.html)

Introduction

The personal and small group management and publication of bibliographic information is a ubiquitous problem for academics, researchers, students, writers and more broadly authors of many kinds. Many systems have been developed by individuals, groups and organizations to meet their own specific requirements, some these utilise software, that supports the capture, management, sharing and publishing bibliographic information.

This appendix aims to provide a brief overview of key issues in the context of the creation of bibliographies, these are:

  • the kind of motivations behind the personal and small group use of bibliographic management systems - be they paper or electronic or a hybrid
  • the functionality required by users and offered by the various computer based systems
  • the software systems available
  • relevant standards

As part of this work we reviewed 20 (see below) bibliographic software products and services, reviewed on-line literature and conducted a small interview study of users of such systems (see Appendix C). The interview study consisted of 5 interviews, with individuals from academic and commercial backgrounds, they were asked about their use of bibliographic data, systems including software that they have used to capture, manage and publish the data, along with any positive or negative feedback about the systems. Finally they were asked about 'wish list' items, that they would include in any ideal system.

Reasons for Creating and Using Personal and Small Group Bibliographies

bibliography: "a list of the books and articles that have been used by someone when writing a particular book or article" Cambridge Dictionaries Online (http://dictionary.cambridge.org/)

The definition above indicates that the term bibliography relates to a particular list of books or articles. However in broader definition might cover any type of 'work' or 'resource' and may simply be a list of works held for any purpose.

There are a large number of specific reasons for capturing, managing, sharing and publishing bibliographic data - for example: creation of bibliographies; as part of a personal knowledge management toolset; organizing a collection of books or papers; group annotation or indexing of resources; locating resources previously used. Although no specific research in this area has been identified as part of this study, it seems likely (based on the functionality of software systems and small scale interview survey), that the dominant uses are to produce bibliographies related to specific topics. Specific examples of the need to create a bibliography include: student essays, grant applications, curricula vitae, book chapters, project reports, sharing references with colleagues, and reading lists. In general these lists must be formatted in specific manner depending on the standards of organization and/or publication.

Needs, Functionality and Affordances

In order for a user or a group of users to do any of the things listed in the previous section, it is necessary for any system to facilitate a set of core activities, these can be broken down in a number of ways, the following is a fairly generic set:

  1. assisted capture of necessary bibliographic (and other e.g. notes) data
  2. management and manipulation of the data once captured, for example: editing of records; locating of previously entered data; searching for subsets of data for the creation of a bibliography; ability to categorise and annotate the records.
  3. publishing and sharing of the data in an appropriate bibliographic format.

There will be specific needs related to each of these three categories in any particular context. However if we focus on the examples related to the creation of bibliographies, these include:

Assisted Capture 

Users are likely to obtain data from a range of sources - extracting the necessary bibliographic data from these can be problematic. For example: it is time consuming; it is difficult to ensure accuracy and consistency of terms (e.g. spelling and format of author names); it is often difficult to identify all necessary data from or about a resource. Thus functionality such as the ability to automate or semi-automate the identification and capture of data are helpful.

Management and Manipulation 

Once the data is captured it generally needs to be managed and sometimes augmented. The requirements with respect to the creation of bibliographies are less extensive than other potential applications (e.g. knowledge management). However there are a great many functional requirements even in the relatively simple context. Some specific examples of tasks identified during our interview (see Appendix C) survey include:

  • users may wish to add extra keywords - requiring tools to assist in consistent key-wording e.g. retrieve all records indexed with a particular keyword and index/sub-divide them using finer grain terms
  • editing records to ensure consistency (e.g. UK or US spellings)
  • adding bibliographic data as it becomes available
  • augmenting notes and annotations about a work following further reading.
  • the desire to understand relationships between works, e.g. by drawing timelines or mindmaps from the data

Fundamental to these and others is the ability to retrieve records efficiently and effectively. Coupled with this is the ability to annotate, organise and in some cases visualise the data.

Publishing and Sharing 

'Publishing' of the bibliographic data takes a number of forms. In practice the most basic case is the creation of a simple formatted text list (bibliography) of selected records drawn from the system. The format/style (see below) and media (e.g. paper, word processor file, HTML) of the list, will depend on the particular context. More complex examples include, the automatic embedding of references and citations within a word processor document (cite while you write, type system, see below).

The sharing of bibliographic data with individual colleagues or within teams, is a common practice, e.g. passing on references to colleagues for their use in their report or small teams of researchers keeping a common word processor or bibliographic database file.

Sharing data is used for a number of purposes, those identified as part of our study (see Appendix C) included:

  • prevent duplication of effort in finding references
  • as part of authoring process e.g. academic papers
  • flagging up newly found interesting documents to members of a team
  • share thoughts about research papers or publications (e.g. via notes fields)
  • ensure common format for data captured
  • helping members of a team keep track of new developments/research in a research field

Standards

There are a variety of aspects of bibliographic data capture, management and publishing systems that require standards to be set, if systems and data are to be interoperable. These include, the pieces of data to be captured, the naming of the database fields, the organization of the data, the rules of when and how to use a particular indexing [or keyword] terms, the syntax of the data within fields, and the storage formats. Appendix E reviews some of the more extensive library related standards.

While the library standards are largely 'overkill' with respect to personal and small group management of data to produce a bibliography, many of the basic needs behind the requirements remain the same. From Appendix E:

  1. what information should be captured about the ‘publication’ i.e. cataloguing data
  2. the structure of the record
  3. detailed rules or guidelines for how to deal with specific cataloguing situations/issues e.g. what to do when there are two different formats of an authors name used within one publication. Broadly these also include the use of authority lists/files as definitive authorities over for example the spelling of a place or personal name.
  4. how the subject or content of the ‘publication’ should be described i.e. how the publication should be indexed.
  5. the specific syntax (e.g. use of punctuation) for encoding of the record
  6. The specific transfer protocols for transferring the data between locations.

In these areas, the formal library standards are well defined. However at a personal and small group level, they are largely poorly defined or there are many competing 'standards'. Examples include citation styles and interchange formats; these will be discussed in turn.

Citation Styles

There are a very large number of citation styles i.e. how citations are written to acknowledge works used or referred to in a document, and how the references should be formatted, e.g. one of the most commonly used is the Harvard style in which the: author and year of publication is written (i.e. cited) in the text (e.g. J. Smith (2002)) of a publication. In the case of a book, the reference contains the name(s) of the author(s), editor(s) or the institution responsible for writing the book, date of publication (in brackets), title and subtitle (if any) should be underlined or highlighted or in italics, but must be consistent throughout the bibliography, series and individual volume number (if any), edition (if not the first), place of publication (if known) and publisher. The references are place in alphabetical order of the family name of the main author. Other types of publication will have different data e.g. in the case of a journal the journal volume and number.

The other basic citation method is the 'numeric system' (Vancouver) style, in which references are cited by a number in the text which is then used to label and order the references in the bibliography.

There are generic, high level, international standards, for example:

However with regard to specifics of the syntax and format of citation and references traditionally different disciplines have tended to use different styles e.g.

  • Humanities: MLA (Modern Language Association) - MLA Handbook for Writers of Research Papers, 5th ed. MLA, New York, 1999) & The Chicago Manual of Style,
  • Scientific: APA (American Psychological Association) - Publication Manual of the American Psychological Association, 5th ed. APA, Washington: 2001 & CBE (Council of Biology Editors) - Scientific Style and Format, 6th ed. Council of Biology Editors, 1994)
  • History: Turabian - A Manual for Writers of Term Papers, Theses, and Dissertations 6th edition, Kate Turabian's, University of Chicago Press, Chicago: 1996. and Chicago (as above)

In addition, many journals, publishers, governments, corporate bodies and other organizations define their own 'house style'.

In many ways these style rules are analogous to the International Standard Bibliographic Description, ISBD (http://www.ifla.org/VII/s13/pubs/isbd.htm) and AACR2 ( Anglo American Cataloguing Rules http://www.nlc-bnc.ca/jsc/) standards for larger systems, discussed in Appendix E, in that they provide similar guidelines (e.g. data to be captured and order, syntax etc.), but for a fewer pieces of data (fields/elements).

Information Interchange Formats

There are no ubiquitously used standards to encode personal and small group bibliographic data i.e. there is no equivalent of MARC (see Appendix E) in the library world. In general the commercial software packages specifically designed for individuals and small groups use proprietary formats for internal storage and in many cases export options are very restrictive e.g. EndNote (probably the dominant product for personal level data in UK Higher Education) only has txt, rtf and HTML formats as options under the 'export' menu.

In some cases it is possible to use the tools to output different styles to create output 'styles' that match any other text based format. e.g. in the case of EndNote (see table below) there is a fairly comprehensive style template language. Base level formats such as comma delimited text, can be imported as standard by nearly all of the packages reviewed.

In contrast, the majority of such systems can import a great many formats (including competitor's formats) and as in some cases advanced users can create their own 'import filters' (i.e. parsing rules to extract the necessary data from the source format). In general for example MARC records can be imported and the data parsed to extract only the required fields to match the internal data format, having done the necessary mapping between data standards.

The Dublin Core [DC] Metadata Initiative has a citation working group (http://dublincore.org/groups/citation/) which is working on refinements and encoding of bibliographic data. Dublin Core unqualified or qualified can provide a base level metadata standard. There are also various 'cross walks' (mappings) between formats e.g. MARC to Dublin Core (http://www.loc.gov/marc/marc2dc.html). However as yet Dublin Core has not become widely used as the basis of a data exchange system for commercial systems reviewed.

Other metadata and encoding related initiatives include the TEI (Text Encoding Initiative - http://www.tei-c.org/), EAD (Encoded Archival Description - http://www.loc.gov/ead/) and DocBook (http://www.docbook.org/). These all have elements that deal with bibliographic data, however again as these are designed for different and more complex application areas they are not used in the systems reviewed here.

BibTeX [BIBTEX] is one of the more widely used (and older) formats that has been used and well supported with publishing tools. BibTeX files are text files with appropriate encoding - when printing a document using a LATEX processor, the document contains markup that instructs the processor to import and format the records appropriately. This facility means that the BibTeX file can act as a data exchange format. And for example CiteSeer [CITESEER], a widely used digital document library, provides a BibTeX entry (as plain text) which can be cut & pasted into a BibTeX file. BibTeX formatted data can be imported into some of the commercial systems.

Software

The UK, academic market is dominated by ISI ResearchSoft (http://www.isiresearchsoft.com/) who produce the EndNote, ProCite and Reference Manager products, all of which are widely used in Higher Education institutions. However there are a large number of pieces of software that are designed specifically to facilitate the personal and small group capture, management, sharing and publishing of bibliographic data and other generic applications that are used to do the same (e.g. spreadsheets, word processors, and database programs). The list below details the majority of the more developed products and services, identified while researching this report.

The phrases in quotes marks are taken from the cited Web site and where there are no comments from the authors, this means that the product had the basic functionality (see end of this section), only 5 of the products have been tried/tested in any depth, other data is taken from Web based research.

Archiva (http://www.legend2000.com/archiva/arc_index.asp?arcMenu=Archiva) - "an advanced reference management system with integrated thesis processor."

AskSam

(http://www.asksam.com/)- a generic data management system that is very flexible and facilitates the capture, management and publishing of structured, semi-structured and unstructured data. Thus just as with generic structured database systems, it can be adapted to work with bibliographic data.

Bibliographix

(http://www.bibliographix.com/) marketed as a publication planner as well as a bibliographic management tool, it provides an 'ideas manager' in which ideas can be indexed using a "short thesaurus" hence keeping and linking related ideas. It also enables network access to group bibliographic databases.
Biblioscape (http://www.biblioscape.com/) There are different levels of product. The standard edition has all the basic functionality and in addition allows relationships to be defined between references, e.g. "Supportive", "Contradict" (it calls this cross linking). There is a freeware version (http://www.biblioscape.com/biblioexpress.htm)
Bookends (http://www.sonnysoftware.com/) Is a Mac based product and is "a full-featured and cost-effective bibliography/reference and information management system for students and professionals"
Citation (http://www.citationonline.net/) "Citation is a powerful and easy to use bibliographic database system and notes organizer for research writing."
Database Software (Personal) Generic database systems such as Microsoft Access that individuals or groups use to create small scale applications to manage their bibliographic data. Other examples include: Cardbox: http://www.cardbox.co.uk/ and Idealist (http://www.bekon.com/)
EndNote (http://www.endnote.com/) EndNote is from is from ISI ResearchSoft and seems to be the dominant product for personal bibliographic data management in UK Higher Education. It has all the basic features, including very extensive and customisable import and export filters, and a 'cite while you write' facility which integrates with MS Word and Word Perfect. It provides means of importing images and other files. It can use Reference Web Poster (http://www.endnote.com/rwprod.htm) which enables the Web publishing (on their server) of EndNote bibliographies.
Spreadsheet Software Spreadsheet software is can be used to store bibliographic data as a flat field database - the columns being the data fields. These generally providing sorting of the table, fields can be added as required and more sophisticated versions (e.g. MS Excel - http://www.microsoft.com/office/excel/) provide filtering by columns. However there is no easy way to produce actual bibliographies.
Library Master (http://www.balboa-software.com/) "Library Master automatically formats the bibliography, footnotes and citations for your paper, thesis or book in numerous bibliographic styles. It makes it easy to organize research notes and project records."
Ibidem/Nota Bene (http://www.notabene.com/brochure/ibidem.html) "Store bibliographic information in the simple database format, and Ibidem will generate your bibliographic references for you."
Papyrus (http://www.researchsoftwaredesign.com/) Has DOS (version 7) that runs under Windows, and a Mac version (version 8). In addition to basic functionality this provides linking between records e.g. relationships such as "Reviews" or "Refutes" and keywords e.g. "Synonyms", "Supercategory/Subcategory,". Allows the embedding of images. It claims to allow import of references form 'anywhere' including from existing bibliographies in word processor format, using "artificial intelligence techniques in reading your source file, alerting you to potential problems"
PowerRef for Windows (http://www.cheminnovation.com/powerref.html) provides the majority of basic features along with high levels of flexibility in document type, adding user defined fields also allows attaching of graphics to records.
ProCite (http://www.procite.com/) ProCite is also from ISI ResearchSoft and provides comprehensive basic functionality plus a network version with access to a single file, multiple read but only one person can write at one time. It also captures the URL a Web pages title information and stores in the reference collection, and then text from the page can be pasted directly into the ProCite record - although one interviewee noted problems using this facility.
Pybliographer (http://canvas.gnome.org:65348/pybliographer/) is a Linux based product, it has a basic level of functionality. Licensing is based on a GNU GENERAL PUBLIC LICENSE i.e. "Everyone is permitted to copy and distribute verbatim copies of this license document, but changing it is not allowed." (http://www.gnu.org/copyleft/gpl.html)
Reference Manager (http://www.refman.com/) Also from ISI ResearchSoft, again with comprehensive basic functionality along with full multi-user networking i.e. multiple uses can both read and write to the shared database.
RefWorks (http://www.refworks.com/ This was the more extensive of the two Internet-based services that we found. It is designed specifically for group as well as individual use.
Scholar's Aid (http://www.scholarsaid.com/) "Scholar's Aid 2000 is a program package that includes a bibliographical data manager called Library and a notes/information manager called Notes", it seems to have a good range of basic functionality. It also claims to export to XML format.
Scribe SA (http://www.scribesa.com/) Is a online service designed to capture data via a forms interface - it has different version for different output formats i.e. APA, MLA or ISO 690.
WebCrimson (http://www.webcrimson.com/) WebCrimson is essentially an online web publishing service. It uses a server based content management system, which has a large number of pre-defined data fields, to generate pages from templates, the user can choose from a variety of predefined template and edit them or create their own. They then add their data to the database via a forms interface and the system generates the pages. One of the pre-defined (customisable) templates is a bibliography template, which (with some tweaking) produces Web-based bibliographies.

There are a number of other products (e.g. InMagic: http://www.inmagic.com/) and systems (e.g. ADLIB: http://www.uk.adlibsoft.com/) that are designed for a larger scale of usage.

As noted above there are very many products - commercial, open-source and developed by individuals - of which this list is a relatively small sample. However, we believe that the major products have been covered. We are not aware of any publicly available data regarding market share of products and/or actual usage of the systems. Anecdotal evidence in the UK points towards Endnote being a dominant product in Higher Education.

In general the systems have similar basic functionality e.g. template based data entry, simple search facility, import from Z39.50 servers, default and customisable output bibliography templates (see below), in many cases integration with word processors, etc. Some also provide support capture of electronic document and/or ideas management/capture along with other functionality (see below) to provide an integrated workspace for researchers, author, students and other potential users.

Key Features

From the above review and our interview study we have complied a list of 'features' that the various systems provide (with respect to bibliographic data management, we have not included below other features such as ideas management). These have been grouped into three sections; data collection/entry, data management/manipulation & sharing, publication/export. However in many cases features cross these areas or choices in one impact on another; we have tried to make these clear in the text.

Before going into detail about particular features it is worth discussing issues related to platform/operating systems. In the majority of cases the systems above are pieces of software installed on individual machines (as opposed to Web-based products, such as RefWorks) The majority of the packages above are Windows products, those which are Mac based or have Mac versions include Bookends, EndNote, PAPYRUS Version 8.0 and ProCite. The only Linux product that we found was Pybliographer.

The feature list below aims to provide a list of the many of the key features (i.e. it is not comprehensive) that are available in the systems reviewed. Some are very common; where features may only be available in a small number of products, this has been noted.

Data Collection/Entry

Under this heading we include all aspects of the capture of bibliographic data from any source.

  • Pre-formatted publication types: different types of publication require that different data is captured e.g. a journal is likely to require 'journal title' and 'volume number', which are not relevant to other types of publication. Most systems provided a number of standard types along with the ability to add others and in some cases edit the standard version e.g. to add a URL to a book reference.
  • Specifically designed forms based input: to facilitate the data entry e.g. most systems order the fields with author(s), title, year and other key elements at the beginning of the form - the majority do not allow customisation of this ordering e.g. one participant in our survey felt that it was annoying to have to go to the end of a form to enter the URL.
  • Predictive and auto-completion: Some systems monitor text as it is entered into a field, and if there is a matching value in the database already it fills in the field, the user can accept the value or keep typing. This can increase the speed of entry considerably, however where many values are similar e.g. journal names, this is less valuable, some may find it annoying.
  • Indexing/Key-wording records: Many systems provide means of applying keywords to documents to allow the categorising /grouping of records. In general these were un-structured, one product enabled users to link keywords via relationships e.g. e.g. "Synonyms,", "Supercategory", "Subcategory,"
  • Authority Files/Drop Down options: One participant in our interview study had created their own database using Microsoft Access, they had created separate tables for authors, journal names and publishers (but not keyword) and added dropdown option lists on their main data entry form to make data entry faster and more consistent. While this is done to some degree by the predictive completion of text (see above) the authority file is a more robust system. None of the systems we actually tested had this 'drop down' type facility.
  • Spell checking: related to authority files is the facility to check spellings of all the text in a record, this was provided in a many of the products.
  • Import filters: these allow the user to import bibliographic data held in file formats different to that of native system, e.g. BibTeX, or data from competing products or CD Rom based databases. Some products provide a simple parsing language to allow user defined filters.
  • Ability to search and retrieve records from remote Internet accessible bibliographic database services: This is via the Z39.50 protocol. Many systems provide a set of standard 'connection files' for commonly used services, these are pre-configured to access these. In general these can be edited and new ones created as necessary - however this requires some work and generally some technical expertise.
  • Automatic field completion for Web pages: One product had the facility to gather data directly from a Web browser and so auto-complete some of the relevant fields and support capture of text from the page into relevant fields.
  • Document capture: Electronic documents themselves (as opposed to simply the bibliographic details) can be captured e.g. PDF and HTML files. These can then be used to do more advanced searching e.g. full text searching.
  • Attaching image objects to a record: Some products allowed images to be attached to the record and stored in the internal database.

Data Management/Manipulation & Sharing

This includes any processing, management, manipulation or sharing taking place between capture and publishing, including sharing of data.

  • Searching/Filtering of records: Probably the most basic requirement, most systems provide tools to search the data using any field and in some cases multiple fields. Results can generally be exported to an external bibliography.
  • Customized/sorted views: The data can be sorted in to different orders e.g. by author or title and in some cases a table (spreadsheet) view. In some cases it is possible to define which fields to view.
  • Defining relationships between records: A small number of the products offered the facility to define relationships between records of the form "Supportive", "Contradict". These can then be traversed to explore the records and select them for exporting into a bibliography. Other systems allow references to be grouped, which is similar in functional terms to keywording and filtering on specific keywords.
  • Hyperlinking to URLs: Many products provided URL fields which provide a means of direct linking to the source document.
  • Network and Multiple User Access: A number of the products are designed to be used by groups. These generally use a server based database or a single database on a network share. These take care of locking the records to prevent simultaneous editing. Two of those reviewed were Web based services in which the data was held on an external server with logon from any Web browser. Of course many other products can be used for non-simultaneous access to a database file on a network share.
  • Duplicate detection: One product provides duplicate detection, hence assisting in maintaining the database, others provide tools that could facilitate this.
  • Global Search and Replace: Some products provide a means of changing a piece of text across all records e.g. a name spelt consistently incorrectly.
  • Saving and Backing up: Many systems had database back ends and so individual edits are automatically saved, however in many cases actual backing up of the whole database, requires technical knowledge e.g. where the files and which require backing up.
  • Merging of database: In many cases a product allowed the use of multiple databases, some provide tools to assist in merging these, however there may be some problems with dealing with duplicate and near duplicate records.

Publication/Export

  • Creation of Bibliographies in commonly used styles: This is probably the most basic publishing feature. The majority of products allowed customisation of the templates or filters using custom languages or integrated tools.
  • Export filters: More generic than bibliography publishing, is the ability to export the database or a selected sub-set of the database to an external format. Most products offered very limited export facilities, compared to their import features. This may be to try and 'tie' customers into their product. The most basic format seems to be standard comma delimited, with a custom ordering of fields.
  • Integration with standard word processors: Many products provide integration with word processors (e.g. MS Word and WordPerfect). In most cases this takes the form of the ability to place a mark (automatically or not) denoting a citation in the document. The program then either automatically adds a reference to the references section at the end of the document, or the document is processed later to do so.
  • Web posting/publishing of bibliography and/or database: A very common facility was to output bibliographies or the data in HTML or to an online and searchable database.
  • Word processor templates for manuscripts of common journals: Using Microsoft Word Macros one product leads the user through the creation of the document with relevant formatting, text styles and citation styles.

Usability

We are unaware of any usability studies with respect to the bibliographic software reviewed. However from our limited review of systems and comments gained during our interview study, it seems likely that there are a number of usability barriers to the easy and continued use of these systems in general and specific problems with individual products.

Central to the problems we have identified seem to be ease of data entry, all participants in our (albeit small) interview study indicated that the time consuming and often 'fiddly' nature of data entry was problematic. Other key issues relate to customization (e.g. difficulty in creating customized styles), integration with other packages (e.g. technical bugs) and the time consuming nature of simple day to day maintenance. The bottom line for all bar one of our interviewees was that they tended to use the software for personal use only when they need to write papers or other documents with bibliographies, rather than as a general repository for bibliographic data.

It would be interesting and useful to conduct, or find existing data about, usability studies in this area. 

Wish Lists

This 'wish list' of features are compiled from the interviewees in our small scale study, and so can only be indicative of such items that might come from a larger survey. It should be noted that in some cases the features are available in some products, however they were not in the system(s) used by the interviewees. These are ordered under rough headings:

Capture:

  • Auto extraction of data from text citations and references in papers, for example the Windows clip board, including BibTeX format.
  • Auto capture of bibliographic data from a paper e.g. where the papers were from a common journal and thus had a common format - import filter for whole papers.
  • Help in 'getting into' a subject area e.g. the common problem of knowing what to ask and what vocabulary to use in a new field. Possibly have a system to cross search other peoples' data and say that 'people who have indexed this paper have used the following terms' or 'expert y has used the following terms to index this paper'.
  • Links to external databases to confirm spellings of names, journal titles etc.
  • Import data from 'table of contents' services data inc. abstracts.
  • Copy some/all details from one record to another e.g. when adding lots of papers by the same author of same journal
  • Capture references at the end of an electronic publication
  • Auto or semi-auto detection of document type (e.g. book, journal, etc)
  • Notification of additions to selected internal and external databases

Management & Augmentation:

  • More effective classification tools inc. more structured vocabularies
  • Access and update [the] data online, from any machine via a browser
  • Hierarchal keyword structures, "so that they are easier to navigate and conceptualise"
  • Richer annotation possibilities other than 'notes' fields and better searching of them.
  • Semi-automated indexing or keywording from the text of an on-line document
  • Visualization of aspects of data e.g. timelines for papers under a given keyword, citation relationships, or identification of 'camps' of researchers with particular points of view...
  • More flexible merging of databases.
  • Semi-automatic up dating of documents that have used a record [e.g. in a bibliography] when the record changes - i.e. the system should remember when and which files have used the individual records and notify the user when changes are made
  • More effective backup

Sharing:

  • More extensive export facilities
  • See or be notified, when a record was added or edited and by who
  • Ability to select records and e-mail selection from within system
  • Ability to search other people's bibliographic databases e.g. known expert/specialist or researcher in field, in particular found via publications e.g. person x published paper on y, therefore they might have a bibliographic database.

Publication:

  • Output to any bibliographic software
  • Ability to integrate (export to and import from) and output to other applications e.g. graphics, mind mapping, database or visualization program, e.g. export data to a mind mapping program so that main branches were authors, next level publication year and then title
  • Easy selection of records for inclusion in a bibliography e.g. tick boxes next to record on screen, multiple select (using ctrl, left mouse button), shopping cart type approach
  • XML output and input

While these are necessarily very extensive or generalisable to the wider community of users and potential users of such systems, they do give some insight into the perceived needs of users and the significant potential for improvements in design, and functionality of the existing software and services.

Sources of Reference

These sources were the main Web sites from which generic background information was gained as part of our on-line literature review. In the majority of cases information about products and services came from the publishing organisations' Web sites. Other citations are referenced directly in the text.

Evans, Peter. (2002) A review of 3 major Personal Bibliographic Management tools. Available: http://www.biblio-tech.com/html/pbms.html.

Information Systems and Technology University of Waterloo. (2000) Which Personal Bibliographic Management Package Should I Choose? Available: http://ist.uwaterloo.ca/ew/biblio/which.html.

Kent, T. (2002) Bibliographic Software. The UK Online User Group. Available: http://www.ukolug.org.uk/links/biblio.htm.

Maggie. Shapland. (1999) Evaluation of Reference Management Software on NT. 2001. University of Bristol. Available: http://www.cse.bris.ac.uk/~ccmjs/rmeval99.htm.

Memorial University of Newfoundland Libraries, Bibliographic Control Services (2002) Technical standards for electronic bibliographic data/metadata. Available: http://www.mun.ca/library/cat/standards.htm.

Morton, D. (2001) Personal Bibliography Software. Available: http://library.uwaterloo.ca/~dhmorton/dnh4.html.

Online Computer Library Center. (2003) OCLC Online Computer Library Center Homepage. Available: http://www.oclc.org/home/.