|
|
dbifasta |
Having created the EMBOSS indices for this file, a database can then be defined in the file emboss.defaults as something like:
DB emrod [ type: N format: fasta method: emblcd directory: /data/embl/fasta ]
% dbifasta
Database indexing for fasta file databases
Database name: EMROD
simple : >ID
idacc : >ID ACC
gcgid : >db:ID
gcgidacc : >db:ID ACC
dbid : >db ID
ncbi : | formats
ID line format [idacc]: idacc
Database directory [.]: data
Wildcard database filename [*.dat]: emrod
Release number [0.0]:
Index date [00/00/00]:
|
Go to the output files for this example
Standard (Mandatory) qualifiers:
[-dbname] string Database name
-idformat menu ID line format
-directory directory Database directory
-filenames string Wildcard database filename
-release string Release number
-date string Index date
Additional (Optional) qualifiers: (none)
Advanced (Unprompted) qualifiers:
-fields menu Index fields
-exclude string Wildcard filename(s) to exclude
-maxindex integer Maximum index length
-sortoptions string Sort options, typically '-T .' to use
current directory for work files and '-k
1,1' to force GNU sort to use the first
field
-[no]systemsort boolean Use system sort utility
-[no]cleanup boolean Clean up temporary files
-indexoutdir outdir Index directory
Associated qualifiers: (none)
General qualifiers:
-auto boolean Turn off prompts
-stdout boolean Write standard output
-filter boolean Read standard input, write standard output
-options boolean Prompt for standard and additional values
-debug boolean Write debug output to program.dbg
-verbose boolean Report some/full command line options
-help boolean Report command line options. More
information on associated and general
qualifiers can be found with -help -verbose
-warning boolean Report warnings
-error boolean Report errors
-fatal boolean Report fatal errors
-die boolean Report deaths
|
| Standard (Mandatory) qualifiers | Allowed values | Default | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| [-dbname] (Parameter 1) |
Database name | A string from 2 to 19 characters, matching regular expression /[A-z][A-z0-9_]+/ | Required | ||||||||||||
| -idformat | ID line format |
|
idacc | ||||||||||||
| -directory | Database directory | Directory | . | ||||||||||||
| -filenames | Wildcard database filename | Any string is accepted | *.dat | ||||||||||||
| -release | Release number | A string up to 9 characters | 0.0 | ||||||||||||
| -date | Index date | Date string dd/mm/yy | 00/00/00 | ||||||||||||
| Additional (Optional) qualifiers | Allowed values | Default | |||||||||||||
| (none) | |||||||||||||||
| Advanced (Unprompted) qualifiers | Allowed values | Default | |||||||||||||
| -fields | Index fields |
|
acnum | ||||||||||||
| -exclude | Wildcard filename(s) to exclude | Any string is accepted | An empty string is accepted | ||||||||||||
| -maxindex | Maximum index length | Integer 0 or more | 0 | ||||||||||||
| -sortoptions | Sort options, typically '-T .' to use current directory for work files and '-k 1,1' to force GNU sort to use the first field | Any string is accepted | -T . -k 1,1 | ||||||||||||
| -[no]systemsort | Use system sort utility | Boolean value Yes/No | Yes | ||||||||||||
| -[no]cleanup | Clean up temporary files | Boolean value Yes/No | Yes | ||||||||||||
| -indexoutdir | Index directory | Output directory | . | ||||||||||||
This file contains non-printing characters and so cannot be displayed here.
This file contains non-printing characters and so cannot be displayed here.
This file contains non-printing characters and so cannot be displayed here.
This file contains non-printing characters and so cannot be displayed here.
dbifasta creates four index files. All are binary but with a simple format.
| Program name | Description |
|---|---|
| dbiblast | Index a BLAST database |
| dbiflat | Index a flat file database |
| dbigcg | Index a GCG formatted database |
| dbxfasta | Database b+tree indexing for fasta file databases |
| dbxflat | Database b+tree indexing for flat file databases |
| dbxgcg | Database b+tree indexing for GCG formatted databases |