Gene BURPS668_A0920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0920 
SymbolmmsA 
ID4885865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp901130 
End bp902731 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content68% 
IMG OID640130860 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_001061919 
Protein GI126442999 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGCCGG CGTGCCGCTT CGCATCAACC GGCCGGCGGC GCGCCGCCGG CCATAGCCAG 
GAGAATCTCA CGATGAACGC AACTCCGTCG TCCCGGAAGG GACATCATGT GCCGACCGTG
AAACTGTTGA TCGCCGGCGA ATTCGTCGAA TCCCATGCGA CCGAGTGGCG CGACATCGTC
AACCCGGCGA CTCAGGAACT GCTCGCGCGC GTGCCGTTCT CGACCGTGGC CGAAGTCGGC
GCGGCCGTCG AGGCCGCGCA TGCCGCGTTC GCGAAATGGA AGAGCACGCC GATCTCCGCG
CGCATGCGCA TCATGCTGAA GTTCCAGGAT CTCGTGCGCG CGAACCTGCC GCAGATCGCG
AAGACGCTGA CGGCCGAGCA GGGCAAGACG CTGCCCGACG CCGAAGGCGA CGTGTTCCGC
GGCCTCGAGG TGGTCGAGCA CGCGTGCTCG GTCGGCACGC TGCAACTGGG CGAGTTCGCG
GAGAACGTCG CGGGCGGCGT CGATACGTAC ACGCTGCGCC AGCCGCTCGG CGTGTGCGTC
GGCATCACGC CGTTCAACTT CCCCGCGATG ATCCCGCTAT GGATGTTCCC GATGGCGATC
GTCTGCGGCA ACACGTTCGT GCTGAAGCCG TCCGAGCAGG ATCCGCTGTC GACGATGCAG
CTCGTCGAGC TCGCGATCGA GGCGGGCGTG CCGAAGGGCG TGCTCAACGT CGTGCACGGC
GGCAAGGAAG TCGTCGACGC GCTGTGCTCG CATCCGCTCG TGAAGGCGAT TTCGTTCGTC
GGCTCGACGG CCGTCGGCAC GCACGTGTAC CGGCTCGGCA GCGAGCACGG CAAGCGCGTG
CAATCGATGA TGGGCGCGAA GAACCATGCG GTGATCCTGC CCGATGCGAA CCGCGAGCAG
ACGGTGAACG CGCTCGTCGG CGCGGCGTTC GGCGCGGCGG GCCAGCGCTG CATGGCGACT
TCGGTCGCGG TGCTCGTCGG CGCGGCGCGC GACTGGCTGC CCGACATCGT CGCGAAAGCG
AAGACGCTGA AGGTCAACGC GGGCGCGGAA GCGGGCACCG ACGTCGGCCC CCTGGTGTCG
CGCGCGGCGA AGCAGCGGGT GCTCGGCCTC ATCGAGACCG GCGAACAGGA AGGCGCGAGG
CTCGTGCTCG ACGGCCGCGG CGTGAGCGTG CCCGGCTATG AGCACGGCAA TTTCGTCGGC
CCGACGATCT TCGCGGACGT GAGGCCGGAG ATGTCGGTCT ACACGCATGA AATCTTCGGC
CCGGTGCTGT GCGTGATGTC GGTCGACACG CTCGACGAGG CGATCGCGCT CGTCAACGCG
AATCCGTTCG GCAACGGCGT CGGCCTGTTC ACGCAGAGCG GCGCGGCCGC GCGCAAGTTC
CAGAGCGAGA TCGACATCGG CCAGGTCGGC ATCAACATTC CGATTCCGGT GCCGGTGCCG
TTCTTCAGCT TCACGGGCTC GCGCGGCTCG AAGCTCGGCG ATCTCGGCCC GTACGGCAAG
CAGGTCGTGC AGTTCTACAC GCAGACGAAG ACCGTCACCG CGCGCTGGTT CGACGACGAT
GCGACGGCGG GCGCCGTCAA CACGACGATT CGCCTGCACT GA
 
Protein sequence
MTPACRFAST GRRRAAGHSQ ENLTMNATPS SRKGHHVPTV KLLIAGEFVE SHATEWRDIV 
NPATQELLAR VPFSTVAEVG AAVEAAHAAF AKWKSTPISA RMRIMLKFQD LVRANLPQIA
KTLTAEQGKT LPDAEGDVFR GLEVVEHACS VGTLQLGEFA ENVAGGVDTY TLRQPLGVCV
GITPFNFPAM IPLWMFPMAI VCGNTFVLKP SEQDPLSTMQ LVELAIEAGV PKGVLNVVHG
GKEVVDALCS HPLVKAISFV GSTAVGTHVY RLGSEHGKRV QSMMGAKNHA VILPDANREQ
TVNALVGAAF GAAGQRCMAT SVAVLVGAAR DWLPDIVAKA KTLKVNAGAE AGTDVGPLVS
RAAKQRVLGL IETGEQEGAR LVLDGRGVSV PGYEHGNFVG PTIFADVRPE MSVYTHEIFG
PVLCVMSVDT LDEAIALVNA NPFGNGVGLF TQSGAAARKF QSEIDIGQVG INIPIPVPVP
FFSFTGSRGS KLGDLGPYGK QVVQFYTQTK TVTARWFDDD ATAGAVNTTI RLH