Gene BMA10247_A1901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10247_A1901 
Symbol 
ID4890470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10247 
KingdomBacteria 
Replicon accessionNC_009079 
Strand
Start bp1832273 
End bp1833979 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content67% 
IMG OID640148166 
Productaldehyde dehydrogenase family protein 
Protein accessionYP_001079078 
Protein GI126447745 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02288] phenylacetic acid degradation protein paaN 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0596942 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCATC CTCTGTTCAC GAAGCATGAA GACACGTTGA AGCACGCGCT CTCCACGATC 
GAAACGCGCG GCTACTGGAG CCCGTTCGCC GAGATGCCGA GCCCCAAAGT GTACGGGGAA
AGCGCCAATA CAGACGGCGA AGCAGCATTC AAAGCCCAGT TGGACAAGCC CTTTGAACTC
GACCAACCCG CCTCGGGCGG AACGGTCGGC GCCGAGCGTT CGCCATACGG GTTTGCGCTC
GGCGTCCGCT ACCCGAAGTC GACGCCCGAC GAGCTCATCG CCGCCGCCGC GCAGGCGGAA
TGCGCGTGGC GCAAGGCCGG GCCGACCGCG TGGGCTGGCG TGTGTCTCGA AATTCTCGCC
CGGCTGAATC GCGCGAGCTT CGAGATCGCA TACAGCGTGA TGCACACCAC GGGACAGGCG
TTCATGATGG CGTTCCAGGC GGGCGGCCCG CACGCGCAGG ATCGCGCGCT CGAAGCCGTC
GCCTATGCAT GGCAAGAACT GCAGCGCATT CCCGCCGAAG CGCACTGGGA GAAGCCGCAG
GGCAAGAACC CGCCGCTCGC GATGCGCAAG CGCTACACGA TCGTGCCGCG CGGGACGGGG
CTCGTGCTCG GGTGCTGCAC GTTCCCGACC TGGAACGGCT ATCCCGGTCT GTTCGCCGAT
CTGGCGACCG GCAACACAGT CATCGTCAAG CCGCATCCCG GCGCGATCCT GCCGCTCGCG
ATCACCGTGC GCATCGCGCG CGACGTGCTG CGCGAAGCGG GCTTCGATCC GAACATCGTC
ACGCTGCTCG CGACCGAAGG AAACGACGGC GCACTCGTCC AGAACCTGGC GCGCCGGCCG
GAAATCAAGC TGATCGACTT CACCGGCAGC TCGCAAAACG GCACCTGGCT CGAGCGCAAT
GCGTACCAGG CGCAGGTCTA TACGGAGAAG GCGGGCGTCA ACCAGATCGT GATCGACTCC
GTCGACGACC TGAAAGCCGC CGTCAAGAAC ATCGCGTTCT CGCTTGCGCT CTACTCCGGC
CAGATGTGCA CAGCGCCGCA AAACATCTAT GTGCCGCGTG ACGGCATCCG CACCGCCGAA
GGGCACGTCA GCTTCGACGA CGTCGCGCGG GCGATCGCCG ACGCCGTGCA AAAGCTGACG
GGCGACCCGG CACGCTCGGT CGAACTCATC GGGGCGCTGC AGAACGCAGG CGTCGCGGCA
CGTATCGACG AAGCGCGCAA GCTCGGCCGC ATTCTCGCCG ACAGCCAGGC GCTCGAGCAC
CCGGCATTCA AGGACGCGCG CGTGCGCACG CCGCTCGTGC TGCAACTCGA CGTCGCGGAC
CGTGCGAAGT ACACGCAGGA ATGGTTCGGT CCGATCTCGT TCGTCATCGC GACCGATTCG
ACTGCGCAAT CACTCGATCT CGCCGGCTCG ATCGCGGCCG AGCATGGCGC GCTCACGCTG
TCCGTCTATA GCACGGACGA CGCCGTCGTC GAAGCGGCGC ACGAAGCGGC GGTGCGCGGC
GGCGTCGCGC TGTCGATCAA TCTGACGGGC GGCGTGTTCG TCAATCAGTC GGCGGCGTTC
TCCGACTTTC ACGGCACGGG CGCCAATCCG GCCGCGAATG CGTCGCTCGC CGACGCCGCG
TTCGTCGCGA ACCGCTTCCG CGTCGTTCAG AGCCGCCACC ATGTTGCGCC GAAGGCGGCT
CCCGCGGAAG CCGGCCAAAC GGCATAA
 
Protein sequence
MTHPLFTKHE DTLKHALSTI ETRGYWSPFA EMPSPKVYGE SANTDGEAAF KAQLDKPFEL 
DQPASGGTVG AERSPYGFAL GVRYPKSTPD ELIAAAAQAE CAWRKAGPTA WAGVCLEILA
RLNRASFEIA YSVMHTTGQA FMMAFQAGGP HAQDRALEAV AYAWQELQRI PAEAHWEKPQ
GKNPPLAMRK RYTIVPRGTG LVLGCCTFPT WNGYPGLFAD LATGNTVIVK PHPGAILPLA
ITVRIARDVL REAGFDPNIV TLLATEGNDG ALVQNLARRP EIKLIDFTGS SQNGTWLERN
AYQAQVYTEK AGVNQIVIDS VDDLKAAVKN IAFSLALYSG QMCTAPQNIY VPRDGIRTAE
GHVSFDDVAR AIADAVQKLT GDPARSVELI GALQNAGVAA RIDEARKLGR ILADSQALEH
PAFKDARVRT PLVLQLDVAD RAKYTQEWFG PISFVIATDS TAQSLDLAGS IAAEHGALTL
SVYSTDDAVV EAAHEAAVRG GVALSINLTG GVFVNQSAAF SDFHGTGANP AANASLADAA
FVANRFRVVQ SRHHVAPKAA PAEAGQTA