Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_3996 |
Symbol | mmsA |
ID | 4881688 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 3900206 |
End bp | 3901732 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640129924 |
Product | methylmalonate-semialdehyde dehydrogenase |
Protein accession | YP_001060989 |
Protein GI | 126442028 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01722] methylmalonic acid semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGGCA GCACCCATTC GAACGATTCG CGCGTGCGCG CACTGGCCCA TTTCATCGGC GGGCGCGCGC TCGACGGCGC GAGCGACCGT TACGGCGACG TGTTCGACCC GGCCCTCGGC ACGGTGACGG CGCGCGTGCC GCTCGCGAGC GGCGCGGAAG TCGATGCGGC CGTCGCCGCC GCGGCCGCCG CGTTCCCCGC GTGGAGCGAG ACCTCGCCGC TCAAGCGCGC GCGCGTGATG TTCAAGTTCA AGGAGCTGCT CGACCGCCAT CACGACGAGC TCGCCGAGCT GATCACCCGC GAGCACGGCA AGGTGTTCCC GGATGCGAAG GGCGAGGTGA TGCGCGGAAT CGAGGTGGTC GAATTCGCGT GCGGCATTCC GAATCTGCTG AAGACCGACT TCACCGACCA GATCGGCGGC GGCATCGACA ACTGGAACCT GCGGCAGCCG CTTGGCGTCG TCGCCGGCAT CACGCCGTTC AATTTTCCGA TGATGGTGCC GTGCTGGATG TTTCCGGTGG CGATCGCGTG CGGCAACACG TTCGTGCTCA AGCCTTCCGA GCGCGATCCG TCGGCGTCGA TCCGGCTCGC CGAGCTGCTG AAGGAAGCGG GGCTGCCCGA CGGCGTGTTC AACGTCGTGC ACGGCGACAA GACGGCCGTC GACGCGCTGA TCGCGCATCC GGACGTGGCC GCGCTGTCGT TCGTCGGCTC GACGCCGATC GCCGAGTACA TTCACACGCA AGCCGCGCGC CGCGGCAAGC GCGTGCAGGC GCTCGGCGGC GCGAAGAACC ATCTCGTCGT GATGCCGGAC GCGAACCTCG ATCAGGCAGT GGACGCGCTC GTCGGCGCCG CGTACGGCTC GGCGGGCGAG CGCTGCATGG CGATTTCCGT CGCGGTCGCG GTGGGCGGCG TCGCCGACGC GCTCGTCGAG CGGCTCGCCG AGCGTGCGAA GGCGCTGAAG ATCGGCAACG GGATGAACGC CGACGTCGAA ATGGGGCCGC TCGTAACGGC CGCGCATCGC GCGAAGGTGT CCGCGTACAT CGACGCCGGC GTCGCGGCGG GCGCGAAGCT CGTCGTCGAC GGGCGCCGGC ACGTCGTCGC CGGCGGCGAG AACGGCTTCT TCCTCGGCGG CACGCTGTTC GATGACGTGA CGACCGACAT GTCGATCTAT CGCGAGGAAA TCTTCGGGCC GGTGCTGGCC GTCGTGCGGG TGCCGGATTT CGCGAGCGCG GTCGAGCTCA TCAACGCGCA CGAGTTCGCC AACGGCGTGT CGTGCTTCAC GTCCGACGGC GGCATCGCGC GCGCGTTTGC GCGGAAGATC CAGGTCGGGA TGGTGGGCAT CAACGTGCCG ATCCCGGTGC CGATGGCGTG GCATTCGTTC GGCGGCTGGA AGCGCTCGCT GTTCGGCGAT CACCACGCAT ACGGCGAGGA GGGCGTGCGT TTCTACACGC GCTACAAGAG CGTGATGCAG CGCTGGCCGG ACAGCATCGC GAAGGGCGCG GAGTTCACGA TGCCTGTCGC GAAGTGA
|
Protein sequence | MTGSTHSNDS RVRALAHFIG GRALDGASDR YGDVFDPALG TVTARVPLAS GAEVDAAVAA AAAAFPAWSE TSPLKRARVM FKFKELLDRH HDELAELITR EHGKVFPDAK GEVMRGIEVV EFACGIPNLL KTDFTDQIGG GIDNWNLRQP LGVVAGITPF NFPMMVPCWM FPVAIACGNT FVLKPSERDP SASIRLAELL KEAGLPDGVF NVVHGDKTAV DALIAHPDVA ALSFVGSTPI AEYIHTQAAR RGKRVQALGG AKNHLVVMPD ANLDQAVDAL VGAAYGSAGE RCMAISVAVA VGGVADALVE RLAERAKALK IGNGMNADVE MGPLVTAAHR AKVSAYIDAG VAAGAKLVVD GRRHVVAGGE NGFFLGGTLF DDVTTDMSIY REEIFGPVLA VVRVPDFASA VELINAHEFA NGVSCFTSDG GIARAFARKI QVGMVGINVP IPVPMAWHSF GGWKRSLFGD HHAYGEEGVR FYTRYKSVMQ RWPDSIAKGA EFTMPVAK
|
| |