Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_A2180 |
Symbol | mmsA |
ID | 3693115 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007435 |
Strand | + |
Start bp | 2662847 |
End bp | 2666206 |
Gene Length | 3360 bp |
Protein Length | 1119 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637732433 |
Product | methylmalonate-semialdehyde dehydrogenase |
Protein accession | YP_337330 |
Protein GI | 76817249 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01722] methylmalonic acid semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCATGGGA TCGGAGTAAG CGCTGAAGCG CTAACTCCGA TCGACAGGAG ACGACACGAT GACGGCACAG GCGTTCCTCG ACGCACGCGA TTTTCTGCTG CGCCACCGCA CCGACTACGA CACCGCCTAT CGCGATTTCA GATGGCCGGC GCTCGACGAA TTCAACTGGG CGCTCGACTA TTTCGACGCG ATCGCGCGCG GCAACGACAA GCCCGCACTA TGGATCGTCG ACGCGGCGTC GGGCGACGGC GCGCGGTACT CGTTCGCGCA GATGTCGGAG CGCTCCGCAC GGATCGCGAA CTGGCTGCGC GAAATCGGTG TCGCACGCGG CGAGCGGATT CTGCTGATGC TGCCGAATCG GGTCGAGCTG TGGGACACGA TGCTCGCGGC GATGAAGCTG GGCGCGGTCG TGCTGCCCGC GACGACGCAA CTGTCCGCCG ACGACGTGCG CGAGCGCGTG CAGATCGGCG GCGCGCGCTA TGCGATCGTC GACGAGCACG AAGCCGAGAA GTTCGAGCAG CCGGGGCTCG ACGTGACGAA GATCGTCGCG GGCGCGCCGC GCGCCGGATG GCTCGCGCTC GCCGACGGCT ATCGCGCGCC GGCCGAGTTC GCGCCCGACG CGCGCACGCG CGCGAGCGAC CCGATGCTGC TGTACTTCAC GTCGGGCACC ACATCGAAGC CGAAGCTCGT CGAGCACACG CACCGCACGT ACCCGGTCGG CAGCCTGTCG ACGATGTACT GGGTCGGCCT GCAGCCCGGC GACATCCACT GGAACATCAG CTCGCCCGGC TGGGCGAAGC ACGCGTGGAG CTGCTTCTAC GCGCCGTGGA ACGCGCAGGC GTGCGTGTTC GCGTTCAACT ACGCCCGCTT CGAGCCGAAG GTGGTGCTCG ATGCGCTCGT CAAGTACCAG GTGACGACGA TGTGCGCGCC GCCGACCGTC TGGCGAATGC TCGTGCAGCA GCCGCTTTCG ACGTTCGCCG TGAAGCTGCG CGAGATCGTC GGCGCGGGCG AGCCGCTCAA CCCCGAGATC ATCGAGCGCG TGAGAAAGGC ATGGGGCGTG ACGATTCGCG ACGGCTACGG GCAGACCGAG ACGACCTGCC TGATCGGCAA TTCGCCGGGC CAGCCGGTCG TGCCGGGCTC GATGGGCCGG CCGCTGCCGG GCTATGCGAT CGCGCTGCTC GATCCCGACG GCGCGCACGC GAGCGAAGGC GAGATCGCGC TGCCGGTCGG CCCGGATGTC GAGCGCCCGG TCGGCCTGAT GAAGGGCTAC GCGAGCAATC CGGAGGCGAC CGCGCACGCG ATGCGCGACG GCCATTACCG GACGTCCGAC ATCGCGCTGC GCCGCGACGA CGGCTATTTC GTCTACGTCG GCCGCGCGGA CGACGTGTTC AAGTCGTCCG ACTACCGGCT GAGCCCGTTC GAGCTCGAGA GCGTGCTGAT CGAGCACGAG GCGATCGCCG AGGCGGCCGT CGTGCCGAGC CCGGACCCGG TGCGGCTGTC GGTGCCGAAG ACCTTCGTCA TGCTGCGCGC GGGCTACGAG CCGAGCGAGA CGCTCGCGCG CGAGATCTTC CGCTTCTCGC GCGAGAAGCT CGCGCCGTAC AAGCGGATTC GGCGGTTGCA GTTCGCCGAG CTGCCGAAGA CGATCTCCGG CAAGATCCGC CGCGTCGAGC TGCGCCGCCG CGAGCTGGAG CGCGGCGACG ACGCGTCGAG CCGGATGCCC GGCGAATACT GGGAAGAGGA TTTCGCGGCC GACGGCAAGT GACGCCGGCG TGCCGCTTCG CATCAACCGG CCGGCGGCGC GCCGCCGGCC ATAGCCAGGA GAATCTCACG ATGAACGCAA CTCCGTCGTC CCGGAAGGGA CATCACGTGC CGACCGTGAA ACTGTTGATC GCCGGCGAAT TCGTCGAATC CCATGCGACC GAGTGGCGCG ACATCGTCAA CCCGGCGACT CAGGAACTGC TCGCACGCGT GCCGTTCTCG ACCGTGGCCG AAGTCGGCGC GGCCGTCGAG GCCGCGCATG CCGCGTTCGC GAAATGGAAG AGCACGCCGA TCTCCGCGCG CATGCGCATC ATGCTGAAGT TCCAGGATCT CGTGCGCGCG AACCTGCCGC AGATCGCGAA GACGCTGACG GCCGAGCAGG GCAAGACGCT GCCCGACGCC GAAGGCGACG TGTTCCGCGG CCTCGAGGTG GTCGAGCACG CGTGCTCGGT CGGCACGCTG CAACTGGGCG AGTTCGCGGA GAACGTCGCG GGCGGCGTCG ATACGTACAC GCTGCGCCAG CCGCTCGGCG TGTGCGTCGG CATCACGCCG TTCAACTTCC CCGCGATGAT CCCGCTATGG ATGTTCCCGA TGGCGATCGT CTGCGGCAAC ACGTTCGTGC TGAAGCCGTC CGAGCAGGAT CCGCTGTCGA CGATGCAGCT CGTCGAGCTC GCGATCGAGG CGGGCGTGCC GAAGGGCGTG CTCAACGTCG TGCACGGCGG CAAGGAAGTC GTCGACGCGC TGTGCTCGCA TCCGCTCGTG AAGGCGATTT CGTTCGTCGG CTCGACGGCC GTCGGCACGC ACGTGTACCG GCTCGGCAGC GAGCACGGCA AGCGCGTGCA ATCGATGATG GGCGCGAAGA ACCATGCGGT GATCCTGCCC GATGCGAACC GCGAGCAGAC GGTGAACGCG CTCGTCGGCG CGGCGTTCGG CGCGGCGGGC CAGCGCTGCA TGGCGACTTC GGTCGCGGTG CTCGTCGGCG CGGCGCGCGA CTGGCTGCCC GACATCGTCG CGAAAGCGAA GACGCTGAAG GTCAACGCGG GCGCGGAAGC GGGCACCGAC GTCGGCCCCC TGGTGTCGCG CGCGGCGAAG CAGCGGGTGC TCGGCCTCAT CGAGACCGGC GAACAGGAAG GCGCGAGGCT CGTGCTCGAC GGCCGCGGCG TGAGCGTGCC CGGCTATGAG CACGGCAATT TCGTCGGCCC GACGATCTTC GCGGACGTGA GGCCGGAGAT GTCGGTCTAC ACGCATGAAA TCTTCGGCCC GGTGCTGTGC GTGATGTCGG TCGACACGCT CGACGAGGCG ATCGCGCTCG TCAACGCGAA TCCGTTCGGC AACGGCGTCG GCCTGTTCAC GCAGAGCGGC GCGGCCGCGC GCAAGTTCCA GAGCGAGATC GACATCGGCC AGGTCGGCAT CAACATTCCG ATTCCGGTGC CGGTGCCGTT CTTCAGCTTC ACGGGCTCGC GCGGCTCGAA GCTCGGCGAT CTCGGCCCGT ACGGCAAGCA GGTCGTGCAG TTCTACACGC AGACGAAGAC CGTCACCGCG CGCTGGTTCG ACGACGATGC GACGGCGGGC GCCGTCAACA CGACGATTCG CCTGCACTGA
|
Protein sequence | MHGIGVSAEA LTPIDRRRHD DGTGVPRRTR FSAAPPHRLR HRLSRFQMAG ARRIQLGARL FRRDRARQRQ ARTMDRRRGV GRRRAVLVRA DVGALRTDRE LAARNRCRTR RADSADAAES GRAVGHDARG DEAGRGRAAR DDATVRRRRA RARADRRRAL CDRRRARSRE VRAAGARRDE DRRGRAARRM ARARRRLSRA GRVRARRAHA RERPDAAVLH VGHHIEAEAR RAHAPHVPGR QPVDDVLGRP AARRHPLEHQ LARLGEARVE LLLRAVERAG VRVRVQLRPL RAEGGARCAR QVPGDDDVRA ADRLANARAA AAFDVRREAA RDRRRGRAAQ PRDHRAREKG MGRDDSRRLR ADRDDLPDRQ FAGPAGRAGL DGPAAAGLCD RAARSRRRAR ERRRDRAAGR PGCRAPGRPD EGLREQSGGD RARDARRPLP DVRHRAAPRR RLFRLRRPRG RRVQVVRLPA EPVRARERAD RARGDRRGGR RAEPGPGAAV GAEDLRHAAR GLRAERDARA RDLPLLAREA RAVQADSAVA VRRAAEDDLR QDPPRRAAPP RAGARRRRVE PDARRILGRG FRGRRQVTPA CRFASTGRRR AAGHSQENLT MNATPSSRKG HHVPTVKLLI AGEFVESHAT EWRDIVNPAT QELLARVPFS TVAEVGAAVE AAHAAFAKWK STPISARMRI MLKFQDLVRA NLPQIAKTLT AEQGKTLPDA EGDVFRGLEV VEHACSVGTL QLGEFAENVA GGVDTYTLRQ PLGVCVGITP FNFPAMIPLW MFPMAIVCGN TFVLKPSEQD PLSTMQLVEL AIEAGVPKGV LNVVHGGKEV VDALCSHPLV KAISFVGSTA VGTHVYRLGS EHGKRVQSMM GAKNHAVILP DANREQTVNA LVGAAFGAAG QRCMATSVAV LVGAARDWLP DIVAKAKTLK VNAGAEAGTD VGPLVSRAAK QRVLGLIETG EQEGARLVLD GRGVSVPGYE HGNFVGPTIF ADVRPEMSVY THEIFGPVLC VMSVDTLDEA IALVNANPFG NGVGLFTQSG AAARKFQSEI DIGQVGINIP IPVPVPFFSF TGSRGSKLGD LGPYGKQVVQ FYTQTKTVTA RWFDDDATAG AVNTTIRLH
|
| |