Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0920 |
Symbol | mmsA |
ID | 4885865 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 901130 |
End bp | 902731 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640130860 |
Product | methylmalonate-semialdehyde dehydrogenase |
Protein accession | YP_001061919 |
Protein GI | 126442999 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01722] methylmalonic acid semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACGCCGG CGTGCCGCTT CGCATCAACC GGCCGGCGGC GCGCCGCCGG CCATAGCCAG GAGAATCTCA CGATGAACGC AACTCCGTCG TCCCGGAAGG GACATCATGT GCCGACCGTG AAACTGTTGA TCGCCGGCGA ATTCGTCGAA TCCCATGCGA CCGAGTGGCG CGACATCGTC AACCCGGCGA CTCAGGAACT GCTCGCGCGC GTGCCGTTCT CGACCGTGGC CGAAGTCGGC GCGGCCGTCG AGGCCGCGCA TGCCGCGTTC GCGAAATGGA AGAGCACGCC GATCTCCGCG CGCATGCGCA TCATGCTGAA GTTCCAGGAT CTCGTGCGCG CGAACCTGCC GCAGATCGCG AAGACGCTGA CGGCCGAGCA GGGCAAGACG CTGCCCGACG CCGAAGGCGA CGTGTTCCGC GGCCTCGAGG TGGTCGAGCA CGCGTGCTCG GTCGGCACGC TGCAACTGGG CGAGTTCGCG GAGAACGTCG CGGGCGGCGT CGATACGTAC ACGCTGCGCC AGCCGCTCGG CGTGTGCGTC GGCATCACGC CGTTCAACTT CCCCGCGATG ATCCCGCTAT GGATGTTCCC GATGGCGATC GTCTGCGGCA ACACGTTCGT GCTGAAGCCG TCCGAGCAGG ATCCGCTGTC GACGATGCAG CTCGTCGAGC TCGCGATCGA GGCGGGCGTG CCGAAGGGCG TGCTCAACGT CGTGCACGGC GGCAAGGAAG TCGTCGACGC GCTGTGCTCG CATCCGCTCG TGAAGGCGAT TTCGTTCGTC GGCTCGACGG CCGTCGGCAC GCACGTGTAC CGGCTCGGCA GCGAGCACGG CAAGCGCGTG CAATCGATGA TGGGCGCGAA GAACCATGCG GTGATCCTGC CCGATGCGAA CCGCGAGCAG ACGGTGAACG CGCTCGTCGG CGCGGCGTTC GGCGCGGCGG GCCAGCGCTG CATGGCGACT TCGGTCGCGG TGCTCGTCGG CGCGGCGCGC GACTGGCTGC CCGACATCGT CGCGAAAGCG AAGACGCTGA AGGTCAACGC GGGCGCGGAA GCGGGCACCG ACGTCGGCCC CCTGGTGTCG CGCGCGGCGA AGCAGCGGGT GCTCGGCCTC ATCGAGACCG GCGAACAGGA AGGCGCGAGG CTCGTGCTCG ACGGCCGCGG CGTGAGCGTG CCCGGCTATG AGCACGGCAA TTTCGTCGGC CCGACGATCT TCGCGGACGT GAGGCCGGAG ATGTCGGTCT ACACGCATGA AATCTTCGGC CCGGTGCTGT GCGTGATGTC GGTCGACACG CTCGACGAGG CGATCGCGCT CGTCAACGCG AATCCGTTCG GCAACGGCGT CGGCCTGTTC ACGCAGAGCG GCGCGGCCGC GCGCAAGTTC CAGAGCGAGA TCGACATCGG CCAGGTCGGC ATCAACATTC CGATTCCGGT GCCGGTGCCG TTCTTCAGCT TCACGGGCTC GCGCGGCTCG AAGCTCGGCG ATCTCGGCCC GTACGGCAAG CAGGTCGTGC AGTTCTACAC GCAGACGAAG ACCGTCACCG CGCGCTGGTT CGACGACGAT GCGACGGCGG GCGCCGTCAA CACGACGATT CGCCTGCACT GA
|
Protein sequence | MTPACRFAST GRRRAAGHSQ ENLTMNATPS SRKGHHVPTV KLLIAGEFVE SHATEWRDIV NPATQELLAR VPFSTVAEVG AAVEAAHAAF AKWKSTPISA RMRIMLKFQD LVRANLPQIA KTLTAEQGKT LPDAEGDVFR GLEVVEHACS VGTLQLGEFA ENVAGGVDTY TLRQPLGVCV GITPFNFPAM IPLWMFPMAI VCGNTFVLKP SEQDPLSTMQ LVELAIEAGV PKGVLNVVHG GKEVVDALCS HPLVKAISFV GSTAVGTHVY RLGSEHGKRV QSMMGAKNHA VILPDANREQ TVNALVGAAF GAAGQRCMAT SVAVLVGAAR DWLPDIVAKA KTLKVNAGAE AGTDVGPLVS RAAKQRVLGL IETGEQEGAR LVLDGRGVSV PGYEHGNFVG PTIFADVRPE MSVYTHEIFG PVLCVMSVDT LDEAIALVNA NPFGNGVGLF TQSGAAARKF QSEIDIGQVG INIPIPVPVP FFSFTGSRGS KLGDLGPYGK QVVQFYTQTK TVTARWFDDD ATAGAVNTTI RLH
|
| |