Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0009 |
Symbol | |
ID | 6275237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 11054 |
End bp | 12211 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642612049 |
Product | chorismate mutase |
Protein accession | YP_001876637 |
Protein GI | 187734525 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0077] Prephenate dehydratase [COG1605] Chorismate mutase |
TIGRFAM ID | [TIGR01807] chorismate mutase domain of proteobacterial P-protein, clade 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGACCA CGCACCACGG GGAGGACGCC TCTCCGGACA ATCCCCCCAG GCAGCAAAAA AGCGGAAACA CGGATGAACA GACCATCGCC CTGACAAAGG CGCGTCTGGC CATTGACGAG GTGGACGCCC GGATTGTAGA GCTGTTAAAA AAACGCGCCG AATGGGTGCA TGAAGTCGGC CGCATTAAAA AGGAAAAAAA TTCCCCCATC TTCGTTCCCG AACGGGAAAC GGCCCTGCTC AACAAATTGA ACCGCCTGAA TGCGGGCGTG CTGCCGGAAG CCTCCCTCCA GGCTATTTAC CGTGAAATCA TTTCCTGCTC CTTCTTTCTG GAAGGCGGCC TGACCATTGC CTACCTGGGC CCCAAAGGAA CCTGGAGCCA CCAGGCGGCC CTCAAGCAGT TCGGAAAAAG TTGCGAACTC ATTCCGTGCC AGAGCTTCAA GGACGTATTT GACATGGTGG ACCGGGGGAA GGCCCAGTAC GGCGTAGTTC CCGTGGAAAA CTCTTCGGAA GGTTCCGTCA CCGCCGTGAT GGATCTTTTC GTCACCTCTC CCCTCAAAAT CTGCGCCCAA ATCAATCTGA ACATCCGCAA CAGCCTGATG GCGGATATTC CGCGGGAACA CATCCGCATC CTGTATTCCC ACCCCCAGGT TCTCGGCCAG ACGCGGAACT GGATCCAGCG GCATTTCCCA AACGCGGAAC TCGTGGAAAC GTCCTCCACC ACGAAAGCCA GCATTCTTGC CAAGGAGAAC GCCGCCATGG GCGCGGCATC CCTCGGCTGT CCGCTGGCCG CGGAATTGTT CGGCCTGAAC ATCCTGGAAG AAGACGTGCA GGACCAGTCC TGCAACACCA CCCGTTTTGC CGTCATCGGA CGCCAGGAAA CGCAGCCCAG CGGCAGGGAC CGCACCTCCC TGCTCATCCG CATCCAGCAT AAACCCGGCA CCCTGGCGGA AGTAGTCAAC TGCTTCCAGC GGCACAACAA TAACCTGATA CGCATTGAAT CCCGCCCGTC CAAAGTCATC AACTGGGAAT ACGTCTTTTA CATAGATGCC GCCGGCCACA TTCAGGAATC CCCCTTACGG GAAACCCTTC CGGAGCTGGA GCAGCACTGC TCCATGCTGA AGATTCTGGG CAGCTACGCG GATACGGACG TCATTTAA
|
Protein sequence | METTHHGEDA SPDNPPRQQK SGNTDEQTIA LTKARLAIDE VDARIVELLK KRAEWVHEVG RIKKEKNSPI FVPERETALL NKLNRLNAGV LPEASLQAIY REIISCSFFL EGGLTIAYLG PKGTWSHQAA LKQFGKSCEL IPCQSFKDVF DMVDRGKAQY GVVPVENSSE GSVTAVMDLF VTSPLKICAQ INLNIRNSLM ADIPREHIRI LYSHPQVLGQ TRNWIQRHFP NAELVETSST TKASILAKEN AAMGAASLGC PLAAELFGLN ILEEDVQDQS CNTTRFAVIG RQETQPSGRD RTSLLIRIQH KPGTLAEVVN CFQRHNNNLI RIESRPSKVI NWEYVFYIDA AGHIQESPLR ETLPELEQHC SMLKILGSYA DTDVI
|
| |