Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMASAVP1_0386 |
Symbol | |
ID | 4676970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei SAVP1 |
Kingdom | Bacteria |
Replicon accession | NC_008784 |
Strand | + |
Start bp | 398101 |
End bp | 399219 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639842913 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_989996 |
Protein GI | 121597831 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.195218 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAACA TCGACAATCC GCAACGCGAT CGGGAAACCG GCTTCGCCGA CGCGACGCAG GACACGACGC GCATCGACGA CGTGCGCATC GGCGCGGTGC GCCCGCTCAT CTCGCCCGCG CTGCTGCAGG ACGAACTGCC GGTGCCGAGC GCCGTCCAGG CGCTCGTCGA AGCGAGCCGC GACGCGATCG GCGACGTGCT GCACGGCCGC GACGACCGCC TGCTCGCGAT CGTCGGCCCG TGCTCGATCC ACGATCACGA TCAGGCGCTC GACTACGCGC GCCGGCTGAA AAGCGCCGCC GACGCGCTGC GCGACGACCT GCTGATCGTG ATGCGCGTGT ATTTCGAGAA GCCGCGCACG ACGGTCGGCT GGAAGGGCTA CATCAACGAT CCGCGCCTCG ACGGCAGCTT CCGCATCAAC GAAGGGCTGC GCGCCGCGCG CCGGCTGCTG ATCGACATCA ACGCGCTCGG CCTGCCCGCC GGCACCGAAT TCCTCGATCT GCTGAGCCCG CAGTACATCG CGGATCTGAT CGCCTGGGGC GCGATCGGCG CGCGCACGAC CGAGAGCCAG AGCCACCGGC AGCTCGCGTC GGGGCTGAGC TGCCCGATCG GCTTCAAGAA CGGCACCGAC GGCGGCGTGC AGGTCGCGGC CGACGCGATC GTCGCGGCGC GCGCGAGCCA CGCGTTCATG GGCATGACGA AGATGGGGAT GGCCGCGATT TTCGAGACGC GCGGCAACGA CGCCGCGCAC GTGATCCTGC GCGGCGGCAA GCGGGGCCCG AACTACGATC GCGCGAGCGT CGACGAGGCG TGCGCGGTGC TGCGCGCGGC GGGCCAGCGC GAGCAGGTGA TGATCGACTG CTCGCACGCG AATTCGAACA AGTCGCACCT GCGGCAGGTC GACGTCGCCG AGGACCTCGC GCGCCAGTTG TCGGACGGCG AGCGACGCAT CACCGGCGTG ATGGTCGAGA GCAACCTGGA GGCCGGCCGG CAGGACCTGA AGCCCGGCGT GCCGCTGCAA TACGGCGTGT CGATCACCGA CGCGTGCCTG AGCTGGGCGC AGACCGAGCC CGTGCTCGAC ACGCTCGCGC AGGCGGTGCG GCGGCGGCGC GCCGCCTGA
|
Protein sequence | MQNIDNPQRD RETGFADATQ DTTRIDDVRI GAVRPLISPA LLQDELPVPS AVQALVEASR DAIGDVLHGR DDRLLAIVGP CSIHDHDQAL DYARRLKSAA DALRDDLLIV MRVYFEKPRT TVGWKGYIND PRLDGSFRIN EGLRAARRLL IDINALGLPA GTEFLDLLSP QYIADLIAWG AIGARTTESQ SHRQLASGLS CPIGFKNGTD GGVQVAADAI VAARASHAFM GMTKMGMAAI FETRGNDAAH VILRGGKRGP NYDRASVDEA CAVLRAAGQR EQVMIDCSHA NSNKSHLRQV DVAEDLARQL SDGERRITGV MVESNLEAGR QDLKPGVPLQ YGVSITDACL SWAQTEPVLD TLAQAVRRRR AA
|
| |