Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMAA1958 |
Symbol | |
ID | 3086512 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei ATCC 23344 |
Kingdom | Bacteria |
Replicon accession | NC_006349 |
Strand | + |
Start bp | 2147074 |
End bp | 2148081 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637565825 |
Product | dihydroflavonol-4-reductase family protein |
Protein accession | YP_106481 |
Protein GI | 53716147 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | [TIGR03466] hopanoid-associated sugar epimerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.169655 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGACA TACAACGCGA TCTCGTTCTG GTGACCGGCG CATCCGGTTT TGTCGGCTCC GCCGTCGCGC GCGCCGCGCG GCGGCGGGGC TATCGGGTGC GCGTGCTCGT GCGGCTGACG AGCCCGCGCA CGAACGTCGC GGATCTCGAC GCCGAGATCG CCACCGGCGA CATGCGCGAC GAGGCATCGA TGCGCGCCGC GCTGCGCGGC GTGCGCCATC TGCTGCACGT CGCCGCCGAC TACCGGCTGT GGGCGCCCGA TCCGCTCGAG ATCGAGCGCG CGAATCTCGA AGGCGCGGTC GCGACGATGC GCGCGGCGCT CGCCGAGGGC GTCGAGCGGA TCGTCTACAC GAGCAGCGTC GCCACGCTGA AGGTGACGCC GTCGGGCGCC TCGGCCGACG AATCGTCGCC GCTCGCCGCC GGGCAGGCGA TCGGCGTATA CAAGCGCAGC AAGGTGCTCG CCGAGCGCGC GGTCGAGCGG ATGATCGCCG AGGACGGGCT GCCCGCGGTG ATCGTCAATC CGTCGACGCC GATCGGCCCG CGCGACGTGA AGCCGACGCC GACCGGCCGG ATCATCGTCG AGGCGGCGCT CGGCAAGATC CCGGCGTTCG TCGACACGGG GCTGAACCTC GTGCACGTCG ACGACGTCGC GCAGGGCCAC CTGCTCGCGC TCGAGCGCGG GCGCGTCGGC GAGCGCTACA TCCTCGGCGG CGAGAACCTG CCGCTGCAGG CGATGCTCGC CGACATCGCG CAATCGACGG GACGCAAGCC GCCGACGATC GCGCTGCCGC GCTGGCCGCT GTATCCGATC GCGCTCGGCG CGGAGGCGGT GGCGAAGCTG ACGAAGCGCG AGCCGTTCGT GACGGTCGAC GGGCTCAGGA TGTCGAAGAA CAAGATGTAT TTCACGTCCG CGAAAGCCGA GCGCGAGCTC GGCTACCGCG CGCGGCCGTA CCGCGAAGGC ATTCGCGACG CGCTCGACTG GTTCAGGCAG GCGGGCTATC TGCGCTGA
|
Protein sequence | MTDIQRDLVL VTGASGFVGS AVARAARRRG YRVRVLVRLT SPRTNVADLD AEIATGDMRD EASMRAALRG VRHLLHVAAD YRLWAPDPLE IERANLEGAV ATMRAALAEG VERIVYTSSV ATLKVTPSGA SADESSPLAA GQAIGVYKRS KVLAERAVER MIAEDGLPAV IVNPSTPIGP RDVKPTPTGR IIVEAALGKI PAFVDTGLNL VHVDDVAQGH LLALERGRVG ERYILGGENL PLQAMLADIA QSTGRKPPTI ALPRWPLYPI ALGAEAVAKL TKREPFVTVD GLRMSKNKMY FTSAKAEREL GYRARPYREG IRDALDWFRQ AGYLR
|
| |