Gene BURPS1106A_4070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_4070 
SymbolmmsA 
ID4902279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3975717 
End bp3977243 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content69% 
IMG OID640137296 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_001068289 
Protein GI126452315 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGGCA GCACCCATTC GAACGATTCG CGCGTGCGCG CACTGGCCCA TTTCATCGGC 
GGGCGCGCGC TCGACGGCGC GAGCGACCGT TACGGCGACG TGTTCGATCC GGCCCTCGGC
ACGGTGACGG CGCGCGTGCC GCTCGCGAGC GGCGCGGAAG TCGATGCGGC CGTCGCCGCC
GCGGCCGCCG CGTTCCCCGC GTGGAGCGAG ACCTCGCCGC TCAAGCGCGC GCGCGTGATG
TTCAAGTTCA AGGAGCTGCT CGACCGCCAT CACGACGAGC TCGCCGAGCT GATCACCCGC
GAGCACGGCA AGGTGTTCTC GGATGCGAAG GGCGAGGTGA TGCGCGGAAT CGAGGTGGTC
GAATTCGCGT GCGGCATTCC GAATCTGCTG AAGACCGACT TCACCGACCA GATCGGCGGC
GGCATCGACA ACTGGAACCT GCGGCAGCCG CTTGGCGTCG TCGCCGGCAT CACGCCGTTC
AATTTTCCGA TGATGGTGCC GTGCTGGATG TTTCCGGTGG CGATCGCGTG CGGCAACACG
TTCGTGCTCA AGCCTTCCGA GCGCGATCCG TCGGCGTCGA TCCGGCTCGC CGAGCTGCTG
AAGGAAGCGG GGCTGCCCGA CGGCGTGTTC AACGTCGTGC ACGGCGACAA GACGGCCGTC
GACGCGCTGA TCGCGCATCC GGACGTGGCC GCGCTGTCGT TCGTCGGCTC GACGCCGATC
GCCGAGTACA TTCACACGCA AGCCGCGCGC CGCGGCAAGC GCGTGCAGGC GCTCGGCGGC
GCGAAGAACC ATCTCGTCGT GATGCCGGAC GCGAACCTCG ATCAGGCAGT GGACGCGCTC
GTCGGCGCCG CGTACGGCTC GGCGGGCGAG CGCTGCATGG CGATTTCCGT CGCGGTCGCG
GTGGGCGGCG TCGCCGACGC GCTCGTCGAG CGGCTCGCCG AGCGTGCGAA GGCGCTGAAG
ATCGGCAACG GGATGAACGC CGACGTCGAA ATGGGGCCGC TCGTGACGGC CGCGCATCGC
GCGAAGGTGT CCGCGTACAT CGACGCCGGC GTCGCGGCGG GCGCGAAGCT CGTCGTCGAC
GGGCGCCGGC ACGTCGTCGC CGGCGGCGAG AACGGCTTCT TCCTCGGCGG CACGCTGTTC
GACGACGTGA CGACCGACAT GTCGATCTAT CGCGAGGAAA TCTTCGGGCC GGTGCTGGCC
GTCGTGCGGG TGCCGGATTT CGCGAGCGCG GTCGAGCTCA TCAACGCGCA CGAGTTCGCC
AACGGCGTGT CGTGCTTCAC GTCCGACGGC GGCATCGCGC GCGCGTTCGC GCGGAAGATC
CAGGTCGGGA TGGTGGGCAT CAACGTGCCG ATCCCGGTGC CGATGGCGTG GCATTCGTTC
GGCGGCTGGA AGCGCTCGCT GTTCGGCGAT CACCACGCAT ACGGCGAGGA GGGCGTGCGT
TTCTACACGC GCTACAAGAG CGTGATGCAG CGCTGGCCGG ACAGCATCGC GAAGGGCGCG
GAGTTCACGA TGCCTGTCGC GAAGTGA
 
Protein sequence
MTGSTHSNDS RVRALAHFIG GRALDGASDR YGDVFDPALG TVTARVPLAS GAEVDAAVAA 
AAAAFPAWSE TSPLKRARVM FKFKELLDRH HDELAELITR EHGKVFSDAK GEVMRGIEVV
EFACGIPNLL KTDFTDQIGG GIDNWNLRQP LGVVAGITPF NFPMMVPCWM FPVAIACGNT
FVLKPSERDP SASIRLAELL KEAGLPDGVF NVVHGDKTAV DALIAHPDVA ALSFVGSTPI
AEYIHTQAAR RGKRVQALGG AKNHLVVMPD ANLDQAVDAL VGAAYGSAGE RCMAISVAVA
VGGVADALVE RLAERAKALK IGNGMNADVE MGPLVTAAHR AKVSAYIDAG VAAGAKLVVD
GRRHVVAGGE NGFFLGGTLF DDVTTDMSIY REEIFGPVLA VVRVPDFASA VELINAHEFA
NGVSCFTSDG GIARAFARKI QVGMVGINVP IPVPMAWHSF GGWKRSLFGD HHAYGEEGVR
FYTRYKSVMQ RWPDSIAKGA EFTMPVAK