Gene BTH_I3333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_I3333 
SymbolmmsA-1 
ID3847024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007651 
Strand
Start bp3796774 
End bp3798300 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content69% 
IMG OID637842999 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_443825 
Protein GI83718489 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGGCA GTACCCATTC GAACGATTCG CGCGTGCGCG CACTGACCCA CTTCATCGGC 
GGCCGCGCGC TCGACGGCGC GAGCGACCGT TACGGCGACG TGTTCGATCC GGCGCTCGGC
AAGGTGACGG CCCGCGTGCC GCTCGCGAGC GGCGCGGAAG TCGAGGCGGC CGTCGCGGCC
GCGGCCGCCG CGTTCCCCGC GTGGAGCGAG ACGTCGCCGC TCAAGCGCGC GCGCGTGATG
TTCAAGTTCA AGGAACTGCT CGACCGCCAT CACGACGAGC TCGCCGAGCT GATCACCCGC
GAGCACGGCA AGGTGTTCTC GGACGCGAAG GGCGAAGTGA TGCGCGGGAT CGAAGTCGTC
GAGTTCGCAT GCGGGATTCC GAATCTGCTG AAGACCGACT TCACCGACCA GATCGGCGGC
GGCATCGACA ACTGGAACCT GCGGCAGCCG CTTGGCGTCG TCGCCGGGAT CACGCCGTTC
AACTTTCCGA TGATGGTGCC GTGCTGGATG TTCCCGGTCG CGATCGCGTG CGGCAACACG
TTCGTGCTGA AGCCGTCCGA GCGAGATCCG TCGGCGTCGA TCCGGCTCGC CGAGCTGCTG
AAGGAAGCGG GGCTGCCCGA CGGCGTGTTC AACGTCGTGC ACGGCGACAA GACGGCTGTC
GACGCGCTGA TCGCGCACCC GGACGTCGCC GCGCTGTCGT TCGTCGGCTC GACGCCGATC
GCCGAATATA TTCATACCGA AGCCGCGCGG CGCGGCAAGC GCGTGCAGGC GCTCGGCGGC
GCGAAGAACC ATCTCGTCGT GATGCCGGAC GCGAACCTCG ATCAGGCGGT CGATGCGCTC
GTCGGCGCGG CGTACGGCTC GGCGGGCGAG CGCTGCATGG CGATCTCCGT CGCCGTGGCG
GTGGGCGGCG TCGCCGACGC GCTCGTCGAG CGGCTCGCCG AGCGCGCGAG GACGCTGAAG
ATCGGCAACG GGATGCAATC CGACGTCGAC ATGGGGCCGC TCGTGACGGC CGCGCATCGC
GCGAAGGTGT CCGCGTACAT CGACGCGGGC GTCGCGGCGG GCGCGAGGCT CGTCGTCGAC
GGGCGCAAGC ATGTCGTCGA CGGCTGCGAG AACGGCTTCT TCCTCGGCGG CACGCTGTTC
GACGACGTGA CGACCGACAT GTCGATCTAC CGCGAGGAGA TTTTCGGGCC GGTGCTGGCG
GTCGTGCGGG TGCCGGATTT CGCGAGCGCG GTCGAGCTCA TCAACGCGCA CGAGTTCGCG
AACGGCGTGT CGTGCTTCAC GTCCGACGGC GGCATCGCGC GCGCGTTCGC ACGGAAGATT
CAGGTCGGGA TGGTCGGCAT CAACGTGCCG ATCCCGGTGC CGATGGCGTG GCATTCGTTC
GGCGGCTGGA AGCGCTCGCT GTTCGGCGAT CACCACGCAT ACGGCGAGGA GGGCGTGCGC
TTCTACACGC GCTACAAGAG CGTGATGCAG CGCTGGCCGG ACAGCATCGC GAAGGGCGCG
GAGTTCGCGA TGCCCGTCGC GAAGTGA
 
Protein sequence
MTGSTHSNDS RVRALTHFIG GRALDGASDR YGDVFDPALG KVTARVPLAS GAEVEAAVAA 
AAAAFPAWSE TSPLKRARVM FKFKELLDRH HDELAELITR EHGKVFSDAK GEVMRGIEVV
EFACGIPNLL KTDFTDQIGG GIDNWNLRQP LGVVAGITPF NFPMMVPCWM FPVAIACGNT
FVLKPSERDP SASIRLAELL KEAGLPDGVF NVVHGDKTAV DALIAHPDVA ALSFVGSTPI
AEYIHTEAAR RGKRVQALGG AKNHLVVMPD ANLDQAVDAL VGAAYGSAGE RCMAISVAVA
VGGVADALVE RLAERARTLK IGNGMQSDVD MGPLVTAAHR AKVSAYIDAG VAAGARLVVD
GRKHVVDGCE NGFFLGGTLF DDVTTDMSIY REEIFGPVLA VVRVPDFASA VELINAHEFA
NGVSCFTSDG GIARAFARKI QVGMVGINVP IPVPMAWHSF GGWKRSLFGD HHAYGEEGVR
FYTRYKSVMQ RWPDSIAKGA EFAMPVAK