BugwoodIDS:Subject code

From Bugwoodwiki

This is a guide to the data entry protocols used in the Bugwood Image Database. Each protocol has a list of instructions (in bold) with rationale or notes after each step. Experienced users can skip the notes and follow the steps of the protocol. Questions about the Bugwoood Image Database should be directed to Joe LaForest or any of the Bugwood Image Database Administrators. The Administrator Guide and the User Guide are available and may answer many questions.

What is the Subject code?

The subject code is a number assigned to a topic. The numbers are assigned sequentially and bear no extra meaning. They are purely a unique identifier for an organism or topic. The two main types of subjects in our system are subjects with a taxonomic classification and Non-taxonomic subjects.

The scientific community has sought to provide a scientific name for every living thing. As expected, taxonomic subjects in BugwoodIDS are listed under the scientific taxonomy associated with it. We frequently update our system to keep it current with the latest taxonomic consensus.

Why use a number as the Subject code rather than the actual name?

Using a number to specify the subject provides a layer of protection against user error. If we were to use the actual name of something as the identifier for the subject of an image, there is a possibility that the name could be misspelled.

Here is an example:

  • 165 is the subject code for Gypsy moth (Lymantria dispar).
  • Common misspellings and typos for that species may include:
    • Lymantia dispar
    • Limantria dispar
    • Lymantrea dispr

It is still possible for user error to occur. For instance:

  • 166 is the code for cottonwood leaf miners (Bucculatrix spp.)
  • 65 is the code for silver fir beetle (Pseudohylesinus sericeus)

In this instances, it is more likely that the error will be caught.

Using numbers as a subject code also allows for global changes to taxonomy in BugwoodIDS. For instance, an earlier name for Lymantria dispar was Porthria dispar. To change the scientific name of all images for that species, only the taxonomic information for one subject needs to be changed rather than changing the information on all 465 images that we have.

How is the subject code used?

The subject code is used to identify the main object presented in an image. Only one subject can be assigned to an image. If there are multiple objects in an image, such as an image comparing multiple species within a genus, the most significant species should be listed as the subject and the other species should be noted in the description.

The subject code is also used to identify the host that may be present in an image. Since the subject code is essentially a unique identifier for an organism or non-taxonomic topic, there is no reason to have a separate code.

The intent of the logic in BugwoodIDS is to capture what was the photographer trying to show when the image was taken. This can be separated into two parts:

  1. What is the main thing in the photo?
  2. What is the main thing in the photo affecting/directly related to?

The basic logic behind what is a subject and what is a host is demonstrated in the following simple sentences:

  • This is an image of SUBJECT on HOST.
  • This is an image of SUBJECT in a HOST field.
  • This is an image of SUBJECT (technique) being performed on HOST.
  • This is an image of SUBJECT (equipment) being installed on HOST.

In all cases, the Subject is the most important thing that the photographer was trying to capture.

Where can I find the Subject Code

There are several locations that these codes can be found. For people doing data entry, you can find a link to "Subject Lookup" on the main administration page. If you are just curious and would like to see what codes we have in our system, we have a Public Subject Lookup. Instructions are available for how to use the subject lookup.