Gene BMAA0035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMAA0035 
Symbol 
ID3087741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei ATCC 23344 
KingdomBacteria 
Replicon accessionNC_006349 
Strand
Start bp34127 
End bp35707 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content71% 
IMG OID637563972 
Productaldehyde dehydrogenase (NADP) family protein 
Protein accessionYP_104892 
Protein GI53715982 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGATCA CCGGCGAGAT GTTGATTGGC GCGGCCGCGG TGCGCGGTAG CGAAGGCACG 
ATGCGCGCTT ACGCGCCGGC GCAGGGCGTC GAGCTCGAGC CGACGTTCGG CGCGGGCGGT
GCGGCCGACG TCGATCGCGC GTGCCGCCTC GCGAACGCCG CTTTCGATCC CTTTCGTCAG
GCGCCGCTCG AGACGCGCGC ACGCTTTCTC GAGGCGATCG CCGAGCGCAT CGTCGGGCTC
GGCGATCCAT TGATCGAACG CGCGCACGCG GAATCGGCGC TGCCCGTCGC GCGGCTCGAA
GGCGAGCGCG CGCGCACGGT CGGTCAGCTC AGGCTCTTCG CGGCGATCGT GCGCGACGGC
CGCTGGCTGA GCGCGACGCT CGATTCCGCG CAGCCCGAGC GCAAGCCGCT GCCGCGCGCC
GATCTGCGCT TGCAGAAGAT TCCCGTCGGC CCGGTCGCGG TGTTCGGCGC GAGCAATTTC
CCGCTCGCGT TCTCGGTCGC GGGCGGCGAC ACCGCTTCGG CGTTCGCGGC CGGCTGCCCC
GTCGTCGCGA AGGCGCACCC CGCGCATCTC GGCACGTCGG AGCTCGTCGG GCGCGCGATC
CGGCAGGCTG TCGCCGATTG CGGTTTGCAC GAGGGCGTGT TCTCGCTCGT CGTCGGCGTG
GGCAACGCGA TCGGCGAGGC GCTCGTCGCG CATCCCGCGA TCAGGGCGGT CGGCTTCACC
GGCTCGCGCG CGGGCGGCCT TGCGCTGATG GGCGTTGCCG CGCGGCGGCA CGAGCCGATT
CCGGTCTTCG CGGAAATGAG CAGCATCAAT CCGTTCTTCG TGTTGCCCGG CGCGTTGCGC
GCACGCGGTG CGCAAATCGC GCAAGGCTTC GTCGAATCGC TGACGCTCGG CGTCGGGCAG
TTCTGCACGA ACCCGGGGCT CGTCGTCGCA CTCGAAGGGC CCGACCTGAA GGCGTTCGTC
GACGCGGCCG CGCAGGCGCT CTCGCAAAAG GGCGCGCAGA CGATGCTGAC CTCGGGCATC
GCGTCGTCTT ACGAGAGCGC GGTCGCGGCG CGCCGCGCGG CCGCGGGCGT CAGCGAGGTC
GCGCGCGGCG CGCGCAGCGA CGCGCGGAAC GCCGCGTTGC CCGCGCTCTT CACGACGACG
CACACGCAGT TCGTCCAGAA CCCGCAGCTC GAAGCCGAGA TCTTCGGGCC GACGTCGCTC
GTCGTCGCGT GCCGCGACAT CGACGAGATG ATCGCGCTCG CCGAGCATGT CGAGGGGCAA
CTGAGCGCGA CGCTGCATCT CGAAGACGAC GATGTCGATC TGGCGCGCAA ACTGTTGCCG
ACGCTCGAGC GCCGCGCCGG CCGCATCGTC GCGAACGGCT ATCCGACGGG CGTCGAGGTC
GCGTACGCGA TGGTGCACGG CGGGCCGTTT CCGGCGACGT CGGACCCGCG CAGCACATCG
GTGGGTGCGC TTGCGATCGA GCGCTTCCTG CGGCCCGTCT GCTATCAGGA TTTGCCGGCG
GCGTTGTTGC CCGAGGCACT CGCCGACGCG AATCCGCTCG GCCTCTGGCG CCTGCGCGAC
GGCCAACTCG GCAAGGCATG A
 
Protein sequence
MQITGEMLIG AAAVRGSEGT MRAYAPAQGV ELEPTFGAGG AADVDRACRL ANAAFDPFRQ 
APLETRARFL EAIAERIVGL GDPLIERAHA ESALPVARLE GERARTVGQL RLFAAIVRDG
RWLSATLDSA QPERKPLPRA DLRLQKIPVG PVAVFGASNF PLAFSVAGGD TASAFAAGCP
VVAKAHPAHL GTSELVGRAI RQAVADCGLH EGVFSLVVGV GNAIGEALVA HPAIRAVGFT
GSRAGGLALM GVAARRHEPI PVFAEMSSIN PFFVLPGALR ARGAQIAQGF VESLTLGVGQ
FCTNPGLVVA LEGPDLKAFV DAAAQALSQK GAQTMLTSGI ASSYESAVAA RRAAAGVSEV
ARGARSDARN AALPALFTTT HTQFVQNPQL EAEIFGPTSL VVACRDIDEM IALAEHVEGQ
LSATLHLEDD DVDLARKLLP TLERRAGRIV ANGYPTGVEV AYAMVHGGPF PATSDPRSTS
VGALAIERFL RPVCYQDLPA ALLPEALADA NPLGLWRLRD GQLGKA