Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B4703 |
Symbol | mmsA |
ID | 6796989 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | - |
Start bp | 4596271 |
End bp | 4597776 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642778776 |
Product | methylmalonate-semialdehyde dehydrogenase |
Protein accession | YP_002149338 |
Protein GI | 197249186 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01722] methylmalonic acid semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0000699732 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAACAG TCGGCAATTT TATTCATGGT AAAACGACAC TGAGCAGCAG CGGCGAGACG CTACCGGTGA CTAATCCGGC CACCGGTAAA GTAATTCGTC AGGTCACCCA GAGCACCCGA GAGGAGATGC TGGCGGCAAT TCAAAGCGCA CATGAGGCTT TTCCGGCATG GAGTAAAATG ACTCCGCTAC GCCGGGCACG TATCCTTTTT GAATTCAAAG TATTACTGGA AAAGCATCGG GACGAACTGG CGGCGCTGAT CGTCAGCGAG CATGGCAAGG TCTGGTCAGA TGCGCTTGGC GAACTGACGC GCGGCATAGA GGTTGTCGAA TTTGCCTGCG GGATCCCGCA CCTCAGCAAA GGAGAGTACT CCTTTAACGT CGGCTCAGGC GTTGACAGCT TTTCATTAAT GCAGCCGCTT GGGGTTGTGG CAGGCATCAC ACCGTTTAAC TTCCCGGCAA TGGTGCCTAT GTGGATGTTC CCGGTTGCTC TGGCTTGCGG CAATACCTTC GTCCTGAAGC CGCCTGCACT GGTGCCGTCC GCATCGTTAC GTATGGCGCA GCTTCTGCAG GAGGCTGGCC TGCCGGACGG CGTGTTTAAC GTTGTTCACT GTGGCAATGA GGCGGCTAGC CTGTTGACCA GCGACCCTCG CGTACAGGCC GTCAGCTTTG TCGGTTCCTC CGCCGTCGCG GAACATATCT ATACCACCGC CAGCGCCCAT GGCAAACGCG TACAGGCCTT CGGGGCCGCC AAAAATCACG CAATCGTCAT GCCGGATGCA GATCTGGACG CCACCGTCAA CGCCATTATG GGCGGAGCAT TTGGTTCCGC GGGCGAACGC TGTATGGCGC TGCCGGTAGT GGTCGCGGTG GGCGATGAAA CGGCGGACCG TCTGATCGAA CGCCTGAAAC CGCTGATTGC GGCGCTACGT ATCGGCCCGG GAGAGCTGCG CGGCAAAGAT GAAAATGAAA TGGGCCCGGT TGTTTCACGT GCTCATCAAC AGAAAGTGTT GGGTTACATC GACAAAGGAG TTAGCGAGGG AGCAACTCTG GTCATGGATG GCCGCAACTA TAGCGTGGCG GGATATCCTG AAGGCTTTTA TGTCGGCGGC ACTCTGTTTG ATAACGTCAC GCCTGAGATG ACCATCTGGC GTGAAGAAAT TTTTGGCCCG GTTCTCGGAA TTGTGCGCGT CCCGGATTAT GCCACCGCCA TTAGCACGGT AAACTCCCAT GAATTCGGCA ACGGCAGCGT GATTTTCACC ACCAACGGTC ACTATGCGCG CGAATTCGCC CAGTCGGTTG AAGCTGGGAT GGTCGGTATT AATATTCCAG TTCCGGTTCC GATGGCATTT CACTCTTTTG GCGGCTGGAA GCGCTCGGTG TTTGGCGCCC TGAACGTCCA TGGGCCGGAT GGCGTGCGCT TCTATACCCG GATGAAAACC GTTACCTCGC GTTGGCCGAA CGGACAGCAG ATCGTATCGG AGTTCAGTAT GCCGACTCTC GGTTAA
|
Protein sequence | METVGNFIHG KTTLSSSGET LPVTNPATGK VIRQVTQSTR EEMLAAIQSA HEAFPAWSKM TPLRRARILF EFKVLLEKHR DELAALIVSE HGKVWSDALG ELTRGIEVVE FACGIPHLSK GEYSFNVGSG VDSFSLMQPL GVVAGITPFN FPAMVPMWMF PVALACGNTF VLKPPALVPS ASLRMAQLLQ EAGLPDGVFN VVHCGNEAAS LLTSDPRVQA VSFVGSSAVA EHIYTTASAH GKRVQAFGAA KNHAIVMPDA DLDATVNAIM GGAFGSAGER CMALPVVVAV GDETADRLIE RLKPLIAALR IGPGELRGKD ENEMGPVVSR AHQQKVLGYI DKGVSEGATL VMDGRNYSVA GYPEGFYVGG TLFDNVTPEM TIWREEIFGP VLGIVRVPDY ATAISTVNSH EFGNGSVIFT TNGHYAREFA QSVEAGMVGI NIPVPVPMAF HSFGGWKRSV FGALNVHGPD GVRFYTRMKT VTSRWPNGQQ IVSEFSMPTL G
|
| |