Ann: CoreBio 0.4

GavinCrooks · Sep 11, 2006

Announcing CoreBio 0.4
------------------------
CoreBio home page:
http://code.google.com/p/corebio/

Download:
http://corebio.googlecode.com/svn/dist/CoreBio-0.4.1.tar.gz

CoreBio is an open source python library for bioinformatics and
computational biology, designed to be fast, compact, reliable and easy
to use. Currently, CoreBio includes code to store and manipulate
protein and DNA sequences, read and write many common biological
sequence formats, read blast reports and access other computational and
database resources.

The CoreBio project welcomes additional suggestions, code and
participants.

This release includes the following modules:

- data: Standard information used in computational biology.
- matrix: Arrays indexed by alphabetic strings.
- moremath: Various bits of useful math not in the standard

python library.
- resource: Access to programs, complex file formats and
databases
- astral: ASTRAL dataset IO.
- scop: SCOP: Structural Classification of Proteins IO.
- stride: STRIDE: Protein secondary structure assignment

from atomic coordinates.
- seq: Alphabetic sequences and associated tools and data.
- seq_io: Sequence file reading and writing.
- array_io: Read and write arrays of sequence data.
- clustal_io: Read the CLUSTAL sequence file format.
- fasta_io: Read and write FASTA format.
- genbank_io: Read GenBank flat files.
- intelligenetics_io: Read IntelliGenetics format.
- msf_io: Read sequence information in MSF format.
- nbrf_io: Sequence IO for NBRF/PIR format.
- nexus_io: Read the sequence data from a nexus file.
- null_io: Null sequence IO.
- phylip_io: Read Sequences in interleaved Phylip format.
- plain_io: Read and write raw, unformatted sequence data.
- stockholm_io: Read a STOCKHOLM format.
- table_io: Read tab delimited format.
- ssearch_io: Parse sequence search analysis reports.
- blastxml: Read BLAST XML output.
- fasta: Read the output of a fasta similarity search.
- transform: Transformations of Seqs (alphabetic sequences),
including translation with a full suite of
GeneticCode's.

Gavin Crooks and John Gilman

km · Sep 11, 2006

Announcing CoreBio 0.4
------------------------
CoreBio home page:
http://code.google.com/p/corebio/

Download:
http://corebio.googlecode.com/svn/dist/CoreBio-0.4.1.tar.gz

CoreBio is an open source python library for bioinformatics and
computational biology, designed to be fast, compact, reliable and easy
to use. Currently, CoreBio includes code to store and manipulate
protein and DNA sequences, read and write many common biological
sequence formats, read blast reports and access other computational and
database resources.

The CoreBio project welcomes additional suggestions, code and
participants.

This release includes the following modules:

- data: Standard information used in computational biology.
- matrix: Arrays indexed by alphabetic strings.
- moremath: Various bits of useful math not in the standard

python library.
- resource: Access to programs, complex file formats and
databases
- astral: ASTRAL dataset IO.
- scop: SCOP: Structural Classification of Proteins IO.
- stride: STRIDE: Protein secondary structure assignment

from atomic coordinates.
- seq: Alphabetic sequences and associated tools and data.
- seq_io: Sequence file reading and writing.
- array_io: Read and write arrays of sequence data.
- clustal_io: Read the CLUSTAL sequence file format.
- fasta_io: Read and write FASTA format.
- genbank_io: Read GenBank flat files.
- intelligenetics_io: Read IntelliGenetics format.
- msf_io: Read sequence information in MSF format.
- nbrf_io: Sequence IO for NBRF/PIR format.
- nexus_io: Read the sequence data from a nexus file.
- null_io: Null sequence IO.
- phylip_io: Read Sequences in interleaved Phylip format.
- plain_io: Read and write raw, unformatted sequence data.
- stockholm_io: Read a STOCKHOLM format.
- table_io: Read tab delimited format.
- ssearch_io: Parse sequence search analysis reports.
- blastxml: Read BLAST XML output.
- fasta: Read the output of a fasta similarity search.
- transform: Transformations of Seqs (alphabetic sequences),
including translation with a full suite of
GeneticCode's.

Gavin Crooks and John Gilman

GavinCrooks · Sep 11, 2006

km said:
Hi,
why are u reinventing the wheel when Biopython[1] is already existing ? is
there any specific reason u wanted to develop this CoreBio ? why dont u just
extend the existing BioPython package itself ?
regards,
KM
[1]http://biopython.org > <[email protected]> wrote:

Biopython is a fine project which I have used and contributed to in the
past. Unfortunately, Biopython suffers form a lack of focus. It is a
huge heap of code, some of which is well written, but of lot of which
is not, and a significant portion doesn't work as advertised. There is
no consistency, the code-base is idiosyncratic, the documentation is
spotty, and it is very hard to actually discover and use the
functionality that you need. For some common tasks there are three ways
of doing things, only one of which is supported.

CoreBio is intended to be a high-quality, easy to use collection of the
core functionality needed for bioinformatics and computational biology.
Compared to biopython, we are taking a narrow, quality first, rather
than breadth first, approach with simple API's that hide as much
complexity as reasonable possible.

As an example of simple API's consider the common task of reading a
file of protein sequences:

CoreBio will figure out the file format, so 'seq_io.read()' will parse
sequence data from fasta, clustal, genbank, intelligenetics, msf,
nbrf/pir, nexus or phylip formatted files.

Gavin Crooks

ANN: Nucular full text indexing 0.4	0	Feb 6, 2009
[ANN] Ditz 0.4 released	0	Jul 27, 2008
[ANN] PySAL 1.2	0	Aug 1, 2011
ANN: dbf (aka Python dBase)	0	Mar 1, 2013
Trainings/Projects on Bioinformatics,Molecular Biology,MicroBiology,DrugDesigning and SAS Programmin	2	Nov 28, 2007
ANN: PySAL 1.3	0	Feb 1, 2012
ANN: Atox 0.4 released	0	Apr 14, 2004
[ANN] FastCST 0.4: SMTP+POP3 Distribution	0	Mar 15, 2005

Ann: CoreBio 0.4

GavinCrooks

km

GavinCrooks

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads