Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0828 |
Symbol | mmsA |
ID | 4906169 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 826080 |
End bp | 827609 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640143934 |
Product | methylmalonate-semialdehyde dehydrogenase |
Protein accession | YP_001074864 |
Protein GI | 126455515 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01722] methylmalonic acid semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.520502 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCAA CTCCGTCGTC CCGGAAGGGA CATCACGTGC CGACCGTGAA ACTGTTGATC GCCGGCGAAT TCGTCGAATC CCATGCGACC GAGTGGCGCG ACATCGTCAA CCCGGCGACT CAGGAACTGC TCGCGCGCGT GCCGTTCTCG ACCGTGGCCG AAGTCGGCGC GGCCGTCGAG GCCGCGCATG CCGCGTTCGC GAAATGGAAG AGCACGCCGA TCTCCGCGCG CATGCGCATC ATGCTGAAGT TCCAGGATCT CGTGCGCGCG AACCTGCCGC AGATCGCGAA GACGCTGACG GCCGAGCAGG GCAAGACGCT GCCCGACGCC GAAGGCGACG TGTTCCGCGG CCTCGAGGTG GTCGAGCACG CGTGCTCGGT CGGCACGCTG CAACTGGGCG AGTTCGCGGA GAACGTCGCG GGCGGCGTCG ATACGTACAC GCTGCGCCAG CCGCTCGGCG TGTGCGTCGG CATCACGCCG TTCAACTTCC CCGCGATGAT CCCGCTATGG ATGTTCCCGA TGGCGATCGT CTGCGGCAAC ACGTTCGTGC TGAAGCCGTC CGAGCAGGAT CCGCTGTCGA CGATGCAGCT CGTCGAGCTC GCGATCGAGG CGGGCGTGCC GAAGGGCGTG CTCAACGTCG TGCACGGCGG CAAGGAAGTC GTCGACGCGC TGTGCTCGCA TCCGCTCGTG AAGGCGATTT CGTTCGTCGG CTCGACGGCC GTCGGCACGC ACGTGTACCG GCTCGGCAGC GAGCACGGCA AGCGCGTGCA ATCGATGATG GGCGCGAAGA ACCATGCGGT GATCCTGCCC GATGCGAACC GCGAGCAGAC GGTGAACGCG CTCGTCGGCG CGGCGTTCGG CGCGGCGGGC CAGCGCTGCA TGGCGACTTC GGTCGCGGTG CTCGTCGGCG CGGCGCGCGA CTGGCTGCCC GACATCGTCG CGAAAGCGAA GACGCTGAAG GTCAACGCGG GCGCGGAAGC GGGCACCGAC GTCGGCCCCC TGGTGTCGCG CGCGGCGAAG CAGCGGGTGC TCGGCCTCAT CGAGACCGGC GAACAGGAAG GCGCGAGGCT CGTGCTCGAC GGCCGCGGCG TGAGCGTGCC CGGCTATGAG CACGGCAATT TCGTCGGCCC GACGATCTTC GCGGACGTGA GGCCGGAGAT GTCGGTCTAC ACGCATGAAA TCTTCGGCCC GGTGCTGTGC GTGATGTCGG TCGACACGCT CGACGAGGCG ATCGCGCTCG TCAACGCGAA TCCGTTCGGC AACGGCGTCG GCCTGTTCAC GCAGAGCGGC GCGGCCGCGC GCAAGTTCCA GAGCGAGATC GACATCGGCC AGGTCGGCAT CAACATTCCG ATTCCGGTGC CGGTGCCGTT CTTCAGCTTC ACGGGCTCGC GCGGCTCGAA GCTCGGCGAT CTCGGCCCGT ACGGCAAGCA GGTCGTGCAG TTCTACACGC AGACGAAGAC CGTCACCGCG CGCTGGTTCG ACGACGATGC GACGGCGGGC GCCGTCAACA CGACGATTCG CCTGCACTGA
|
Protein sequence | MNATPSSRKG HHVPTVKLLI AGEFVESHAT EWRDIVNPAT QELLARVPFS TVAEVGAAVE AAHAAFAKWK STPISARMRI MLKFQDLVRA NLPQIAKTLT AEQGKTLPDA EGDVFRGLEV VEHACSVGTL QLGEFAENVA GGVDTYTLRQ PLGVCVGITP FNFPAMIPLW MFPMAIVCGN TFVLKPSEQD PLSTMQLVEL AIEAGVPKGV LNVVHGGKEV VDALCSHPLV KAISFVGSTA VGTHVYRLGS EHGKRVQSMM GAKNHAVILP DANREQTVNA LVGAAFGAAG QRCMATSVAV LVGAARDWLP DIVAKAKTLK VNAGAEAGTD VGPLVSRAAK QRVLGLIETG EQEGARLVLD GRGVSVPGYE HGNFVGPTIF ADVRPEMSVY THEIFGPVLC VMSVDTLDEA IALVNANPFG NGVGLFTQSG AAARKFQSEI DIGQVGINIP IPVPVPFFSF TGSRGSKLGD LGPYGKQVVQ FYTQTKTVTA RWFDDDATAG AVNTTIRLH
|
| |