gcskew('NC_000913'); gcskew('somewhere/hoge.gbk'); gcskew('embl:U00096');
codon_counter amino_counter
codon_amino_counter _codon_amino_printer _codon_usage_printer
print readFile('http://togows.dbcls.jp/entry/pubmed/12538262')
$gb->find() |grep replication |head -n 5
G > load ecoli G > gcskew [ecoli graph comes up] G > load bsub G > gcskew [bsub graph comes up]
$gb = load "ecoli"
foreach $cds ($gb->cds()){
next if ($gb->around_startcodon($cds, 50, 50) =~ /pattern/);
}
foreach $cds ($gb->cds()){
my $genelength = length($gb->get_geneseq($cds))
next if ($genelength > 10000 || $genelength < 20);
}
$db = db_load("gene",
-driver=>"mysql",
-database=>"mus_musculus_core_46_36g",
-host=>"ensembldb.ensembl.org",
-port=>3306,
-primarykey=>"gene_id"
);
say $db->{239967}->{status};
exported methods are:
db_dbi
db_exists
db_path
db_set_path
db_save
db_load
most work just like sdb in G::DB::SDB.
$gb->{FEATURE4}->{translation}
$gb->{CDS2}->{translation}
$gb->{thrA}->{translation}
$gb->{b0002}->{translation}
$gb->next_feature("thrA");
$gb->get_geneseq("thrA");
$gb->startcodon("thrA");
$gb->before_startcodon("thrA");
G::SystemsBiology::Serizawa
G::SystemsBiology::EcellReader
G::SystemsBiology::Pathway
G::Tools::RCluster (this is moved to Rcmd::Clustering
mean
sum
variance
standard_deviation
min
mindex
max
maxdex
median
least_squares_fit
cor
ttest
where cor include options for Spearman's, Pearson's, and Kendall's methods, and ttest supports both independent and paired Student's t-test.
query_arm dist_in_cc set_strand genes_from_ori
We are no longer supporting Windows
The former 'Prelude' core has been subdivided into micro module cores each responsible for specific functions. This enables more Object Oriented style of architecture, and better flexibility in plugging. Another advantage is to use Skyline core functions without doing 'use G;'. This can be alternatively called internally as 'use G::Skyline'.
G::Skyline inherits G::IO::GenBankO, which inherits G::IO::GenBankI, which inherits G::Prelude.
The following is the explanation of [new|altered] modules:
G.pm Inherits G::Skyline. Now only responsible for 'new G()' options and output options G::Prelude Core of core. Base class with manipulation methods only. G::IO::GenBankI Inherits G::Prelude. Embedded with GenBank parsers. G::IO::GenBankO Inherits G::GenBankI. Responsible for GenBank flatfile output. G::IO::Bioperl Responsible for the conversion of Bio::Seq::RichSeq to G::Skyline G::IO::Annotation Embedded with annotation functions G::Skyline Dummy class that multiply inherit the above classes
complement() and translate() are moved to G::Seq::Primitive so that they can be 'use'-ed internally from odyssey modules.
As stated above, with stronger object oriented form, it is easier to expand the functions without risking new bugs, and G::IO:: can be plugged with new parsers, such as that for FASTA, EMBL, and so on for more speed.
Conversion of Bio::Seq::RichSeq object to G::Skyline object is further enhanced, now almost perfectly mirroring including the header and the 'join' field.
G::IO::Bioperl::convert(struct Bio::Seq::RichSeq, struct G::Skyline);
easily converts the bioperl object.
SubOpt protocol is also enhanced so that now it can take bioperl object as well as G object. Therefore, all odyssey functions can be directly accessed from bioperl as follows:
$in = new Bio::SeqIO('-file'=>"hoge.embl");
$bioperl = $in->next_seq();
gcskew($bioperl, -window=>1000, -output=>"show");
Multiple options can now be used. For example,
$gb = new G("hoge.embl", "EMBL", "no msg", "without annotation", "longest ORF annotation", "multiple locus");
is now possible. Order can be random.
Database formats supported is also expanded, now supporting all of the following: GenBank, Fasta, SCF, PIR, EMBL, raw, GCG, ace, BSML, swiss, phd, game, qual. Moreover, file format option is now case insensitive. 'GENBANK', 'GenBank', 'genbank' all point to the same old GenBank. Furthermore, Skyline core can now automatically guess the format of the database even without the format options. Thus 'new G("hoge.gbk")' load genbank file, and 'new G("hoge.bsml")' automatically loads in BSML format. From now on, you probably do no have to care about the database formats at the input to G-language GAE.
Network retrieval is also automatically interpreted, and enhanced. Accession numbers starting with NC_ is taken from RefSeq in NCBI, and other accession numbers are taken from ordinary GenBank ftps. Again, here you do not have to enter 'net GenBank' option, although if you prefer that, you can.
Because BSML output is enabled, annotated G object can directly be put into the LabBook Genome XML Viewer (TM) which is a free genome viewer and editer of BSML files. This enables graphical genome view, circular plasmid view, with zooming and refering functions. Also, it is worth noting that it is the first step of G-language GAE to be able to input/output XML formats of databases. GAME XML format is also supported.
Bug in Prelude ($gb→del_key(), parsing of FT w/o value, -w switch compliancy) is fixed, and several new modules are incorporated into the Odyssey functions layer. The documentations for the new functions will be available at our website by the official release of version 2.