Gene BMA10247_A0042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10247_A0042 
Symbol 
ID4889388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10247 
KingdomBacteria 
Replicon accessionNC_009079 
Strand
Start bp33738 
End bp35708 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content71% 
IMG OID640146323 
Productaldehyde dehydrogenase (NADP) family protein 
Protein accessionYP_001077249 
Protein GI126445650 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTGGCTGT TCGGCTTCAA GTCGATCGTT GCGCGAGCCC GCGCCGAAGC CGATATCCCG 
TCATGGGCCC TCAACCGGCG GCTCGCGTTC GCGACGATTG CAATCCCGGC CGGGAATGCC
GCGTGTTCGC ATTCGTTTGC CGGACGGCGC GCATGCGCGC AGGACATGGC GACCGGTGCG
GGCACGCGTG CGCCGGTCGG GATGCGCGGC GCACGCGTTG CCGTCTGCCA CGATAATGGA
ATGCGTGCGC GCCGGGGCGG TGCGATCCGT CGAACCGGCC GGTCGCCTCG ACTCGTTCAA
TTCGACCCAT TCGACCCATT CGTCCGCAGG CGGCGCACGC CGCATCGCGA AGCGCACCGC
CCGCGGCCAG GTTTCCAATT CGGAGGAAGC ATGCAGATCA CCGGCGAGAT GTTGATTGGC
GCGGCCGCGG TGCGCGGTAG CGAAGGCACG ATGCGCGCTT ACGCGCCGGC GCAGGGCGTC
GAGCTCGAGC CGACGTTCGG CGCGGGCGGT GCGGCCGACG TCGATCGCGC GTGCCGCCTC
GCGAACGCCG CTTTCGATCC CTTTCGTCAG GCGCCGCTCG AGACGCGCGC ACGCTTTCTC
GAGGCGATCG CCGAGCGCAT CGTCGGGCTC GGCGATCCAT TGATCGAACG CGCGCACGCG
GAATCGGCGC TGCCCGTCGC GCGGCTCGAA GGCGAGCGCG CGCGCACGGT CGGTCAGCTC
AGGCTCTTCG CGGCGATCGT GCGCGACGGC CGCTGGCTGA GCGCGACGCT CGATTCCGCG
CAGCCCGAGC GCAAGCCGCT GCCGCGCGCC GATCTGCGCT TGCAGAAGAT TCCCGTCGGC
CCGGTCGCGG TGTTCGGCGC GAGCAATTTC CCGCTCGCGT TCTCGGTCGC GGGCGGCGAC
ACCGCTTCGG CGTTCGCGGC CGGCTGCCCC GTCGTCGCGA AGGCGCACCC CGCGCATCTC
GGCACGTCGG AGCTCGTCGG GCGCGCGATC CGGCAGGCTG TCGCCGATTG CGGTTTGCAC
GAGGGCGTGT TCTCGCTCGT CGTCGGCGTG GGCAACGCGA TCGGCGAGGC GCTCGTCGCG
CATCCCGCGA TCAGGGCGGT CGGCTTCACC GGCTCGCGCG CGGGCGGCCT TGCGCTGATG
GGCGTTGCCG CGCGGCGGCA CGAGCCGATT CCGGTCTTCG CGGAAATGAG CAGCATCAAT
CCGTTCTTCG TGTTGCCCGG CGCGTTGCGC GCACGCGGTG CGCAAATCGC GCAAGGCTTC
GTCGAATCGC TGACGCTCGG CGTCGGGCAG TTCTGCACGA ACCCGGGGCT CGTCGTCGCA
CTCGAAGGGC CCGACCTGAA GGCGTTCGTC GACGCGGCCG CGCAGGCGCT CTCGCAAAAG
GGCGCGCAGA CGATGCTGAC CTCGGGCATC GCGTCGTCTT ACGAGAGCGC GGTCGCGGCG
CGCCGCGCGG CCGCGGGCGT CAGCGAGGTC GCGCGCGGCG CGCGCAGCGA CGCGCGGAAC
GCCGCGTTGC CCGCGCTCTT CACGACGACG CACACGCAGT TCGTCCAGAA CCCGCAGCTC
GAAGCCGAGA TCTTCGGGCC GACGTCGCTC GTCGTCGCGT GCCGCGACAT CGACGAGATG
ATCGCGCTCG CCGAGCATGT CGAGGGGCAA CTGAGCGCGA CGCTGCATCT CGAAGACGAC
GATGTCGATC TGGCGCGCAA ACTGTTGCCG ACGCTCGAGC GCCGCGCCGG CCGCATCGTC
GCGAACGGCT ATCCGACGGG CGTCGAGGTC GCGTACGCGA TGGTGCACGG CGGGCCGTTT
CCGGCGACGT CGGACCCGCG CAGCACATCG GTGGGTGCGC TTGCGATCGA GCGCTTCCTG
CGGCCCGTCT GCTATCAGGA TTTGCCGGCG GCGTTGTTGC CCGAGGCACT CGCCGACGCG
AATCCGCTCG GCCTCTGGCG CCTGCGCGAC GGCCAACTCG GCAAGGCATG A
 
Protein sequence
MWLFGFKSIV ARARAEADIP SWALNRRLAF ATIAIPAGNA ACSHSFAGRR ACAQDMATGA 
GTRAPVGMRG ARVAVCHDNG MRARRGGAIR RTGRSPRLVQ FDPFDPFVRR RRTPHREAHR
PRPGFQFGGS MQITGEMLIG AAAVRGSEGT MRAYAPAQGV ELEPTFGAGG AADVDRACRL
ANAAFDPFRQ APLETRARFL EAIAERIVGL GDPLIERAHA ESALPVARLE GERARTVGQL
RLFAAIVRDG RWLSATLDSA QPERKPLPRA DLRLQKIPVG PVAVFGASNF PLAFSVAGGD
TASAFAAGCP VVAKAHPAHL GTSELVGRAI RQAVADCGLH EGVFSLVVGV GNAIGEALVA
HPAIRAVGFT GSRAGGLALM GVAARRHEPI PVFAEMSSIN PFFVLPGALR ARGAQIAQGF
VESLTLGVGQ FCTNPGLVVA LEGPDLKAFV DAAAQALSQK GAQTMLTSGI ASSYESAVAA
RRAAAGVSEV ARGARSDARN AALPALFTTT HTQFVQNPQL EAEIFGPTSL VVACRDIDEM
IALAEHVEGQ LSATLHLEDD DVDLARKLLP TLERRAGRIV ANGYPTGVEV AYAMVHGGPF
PATSDPRSTS VGALAIERFL RPVCYQDLPA ALLPEALADA NPLGLWRLRD GQLGKA