Notes on installing gmod 0.002 alpha on Mac OS 10.3


Contents


Installation

Fan Kang, our database administrator at the SGD Princeton colony, took these excellent notes on installing gmod 0.002 alpha (and the required perl modules) on her G4 desktop running 10.3:
  1. Perl module installation
  2. chado installation
  3. Gbrowse installation: note that Fan successfully installed Gbrowse, but did not go on to configure it to read from the chado schema because we have now moved on to doing the installation on a server.

Suggestions for installation improvements:

Data loading

Used the loading scripts that came with gmod 0.002 alpha:

Gene ontology

Initial attempt when running 'make ontologies': DBI connect error (see
install notes for details).
Problem: several parameters were changed in $PGDATA/pg_hba.conf and $PGDATA/postgresql.conf. it seems the TCP/IP setting didn't take effect with 'pg_ctl reload'.

Solution: shut down and re-start.

Sequence ontology

Problem: so.ontology file was out of sync with the relationship types/predicates hard coded in the global vars section of the loading script (specifically the %oborelmap hash). Basically, a new type, derived_from, was in the so.ontology but gmod_load_ontology.pl was not expecting it (wasn't an element in %oborelmap). The error was something like "cannot insert null into accession".

Solution: just added 'derived_from' to the hash. NOTE: this problem has since been fixed in the cvs on the gmod site!

GFF3 file

Used the GFF3 file in development by Stan for SGD, based on the most recent GFF3 documentation. Changes I made to the GFF3 file to eliminate errors:
  1. replaced "LTR" type (3rd column) with long_terminal_repeat to match the SO term: this is an SGD data issue that we will soon fix.
  2. removed the lines with "landmark" type in the 3rd column: Gbrowse uses this type. For use with GBrowse, I will manually load this type in the next time around (though maybe the load_gff3.pl script can be modified to handle it?)
  3. changed format of ## sequence-region lines to "regular" gff lines with chromosomes, for example, changed:
    ##sequence-region       chrI    1       230210
    
    to:
    chrI	SGD	chromosome	1	230210	.	.	.	ID=chrI
    
Before doing 1. and 2. the error was:
Either the Sequence Ontology was incorrectly loaded,
or this file doesn't contain GFF3 at gmod_load_gff3.pl line 252,  line 34.
Before doing 3., the error was:
Unable to find a source feature id for the reference sequence in this line:
chrII   SGD     gene    646     1128    .       +       0       ID "YBL113W-A"  ; Note "Identified by gene-trapping, microarray-based expression analysis, and genome-wide homology searching"  ; dbxref "SGD:S0028599"  ; orf_classification Dubious

That is, chrII should either have a entry in the 
feature table or earlier in this GFF file and it doesn't.
The modified gff3 file can be found here.

Possible bug?

Suggestions for 0.003 data loading:

last update: