Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A1010 |
Symbol | mmsA |
ID | 4906151 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 978188 |
End bp | 979870 |
Gene Length | 1683 bp |
Protein Length | 560 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640144116 |
Product | methylmalonate-semialdehyde dehydrogenase |
Protein accession | YP_001075046 |
Protein GI | 126456253 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01722] methylmalonic acid semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.291559 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGCCGGA GCCGCGCGGC AGGCACGCGT TCGGCGCCGG CGCGGCGGAC CCGGCCCGAT CGGCGTCGGT CCGCTTCGAT CCGCGCGGTG CGCGTTCCTC GCGCGCGCGC CGGCGCCGGC GGCTCGGCGC ATCCCACGCA ACATACGAAT TCAGGAAACA CCGCGATGAA ACACGACAGC AACGTCACTT CCCCCCTCGG CCATCTGATC GACGGCAAGC GCGTCGACGG CGGCGAGCGC GTCCAGCCCG TGTTCGATCC GGCGACGGGC GCATCGACGA AGCGCGTCCG CATGGCCGAC CGCCCGAGCG TCGAGGCGGC GATCGCCGCC GCGCAGGCCG CGTATCCGGC CTGGCGCAAC ACGCCGCCGC TCAAGCGCGC GCGGGTGATG AGCCGCTTCA AGACGCTGCT CGAGGAGCAT GCGAACGAGC TGTGCGCGCT GATCACGGCC GAGCACGGCA AGGTGCTCGC CGATGCGATG GGCGAGTTGC AGCGCGGGAT CGAGAACGTC GAGTACGCGA GCTACGCGCC CGAGCTGCTC AAGGGCGAGC ACAGCAAGAA CGTCGGCCCG GCGATCGACT CGTGGAGCGA GTTCCAGGCG CTCGGCGTGG TGGCGGGGAT CACGCCGTTC AATTTCCCGA TCATGGTGCC GCTGTGGATG TGGCCGATGG CCGTCGCGTG CGGCAACACG TTCGTGCTGA AGCCTTCCGA GCGCACGCCG TCGTCGACGC TGCGCATGGC CGAGCTCGCG CTCGAAGCGT GCCTGCCGCC GGGCGTGCTG AACGTCGTGA ACGGCGACAA GGAAGCCGTC GACACGATCC TGACCGATTC GCGCGTGAAG GCGGTGAGCT TCGTCGGCTC GACGCCGATC GCCGAGTCCA TCTACACGAC GGGCTGCGCG CACGGCAAGC GCGTGCAGGC GCTGGGCGGC GCGAAGAACT TCGCGATCGT GATGCCGGAC GCCGACATCG GCAACGCGGT CAACGCGCTG ATGGGCGCGG CGTACGGTTC GTGCGGCGAG CGGTGCATGG CGATTCCGCT CGTCGTCGCG ATCGGCGACG ACACGGCGGA GCAGGTCGTC GACGGCCTGA AGGCCGAGAT CGCGAAGATG AAGGTCGGTC CGGGCACGGG CGAGCAGGTC GACATGGGGC CGCTCGTCAC GCGGCAGCAC TTCGAGAAGG TGACGGGCTT CGTCGAGGCG GGCATCGCCG CGGGCGCGAC GCTCGTCGTC GACGGGCGCG GCGTGAAGGT GGACGGCCAC GAAGGCGGCT ATTACCTCGG CCCGTGCCTG TTCGACCACG TGAAGCCCGG CATGCCGATC TATCAGCACG AGATCTTCGG GCCGGTGCTG GGCGTCGTGC GCGTCGCGTC GCTCGCCGAG GCGATGGCGC TCGTCGACGC GCACGAGTAC GGCAACGGCA CGTGCCTCTT CACGCGCGAC GGCGAGGCCG CGCGCTTTTT CAGCGACAAC ATCCAGGTCG GGATGGTCGG CATCAACGTG CCGCTGCCCG TGCCCGTTGC TTATCACTCG TTCGGCGGCT GGAAGCGCTC GCTGTTCGGC GATCTGCACG CATACGGACC GGACGCGGTG CGCTTCTATA CGAAGCGCAA GACGATCACG CAGCGCTGGC CGTCGGCCGG CGTGCGCGAG GGGACGGTGT TCAGCTTCCC GTCGAGCCGC TGA
|
Protein sequence | MRRSRAAGTR SAPARRTRPD RRRSASIRAV RVPRARAGAG GSAHPTQHTN SGNTAMKHDS NVTSPLGHLI DGKRVDGGER VQPVFDPATG ASTKRVRMAD RPSVEAAIAA AQAAYPAWRN TPPLKRARVM SRFKTLLEEH ANELCALITA EHGKVLADAM GELQRGIENV EYASYAPELL KGEHSKNVGP AIDSWSEFQA LGVVAGITPF NFPIMVPLWM WPMAVACGNT FVLKPSERTP SSTLRMAELA LEACLPPGVL NVVNGDKEAV DTILTDSRVK AVSFVGSTPI AESIYTTGCA HGKRVQALGG AKNFAIVMPD ADIGNAVNAL MGAAYGSCGE RCMAIPLVVA IGDDTAEQVV DGLKAEIAKM KVGPGTGEQV DMGPLVTRQH FEKVTGFVEA GIAAGATLVV DGRGVKVDGH EGGYYLGPCL FDHVKPGMPI YQHEIFGPVL GVVRVASLAE AMALVDAHEY GNGTCLFTRD GEAARFFSDN IQVGMVGINV PLPVPVAYHS FGGWKRSLFG DLHAYGPDAV RFYTKRKTIT QRWPSAGVRE GTVFSFPSSR
|
| |