Gene Amuc_0009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0009 
Symbol 
ID6275237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp11054 
End bp12211 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content57% 
IMG OID642612049 
Productchorismate mutase 
Protein accessionYP_001876637 
Protein GI187734525 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01807] chorismate mutase domain of proteobacterial P-protein, clade 2 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGACCA CGCACCACGG GGAGGACGCC TCTCCGGACA ATCCCCCCAG GCAGCAAAAA 
AGCGGAAACA CGGATGAACA GACCATCGCC CTGACAAAGG CGCGTCTGGC CATTGACGAG
GTGGACGCCC GGATTGTAGA GCTGTTAAAA AAACGCGCCG AATGGGTGCA TGAAGTCGGC
CGCATTAAAA AGGAAAAAAA TTCCCCCATC TTCGTTCCCG AACGGGAAAC GGCCCTGCTC
AACAAATTGA ACCGCCTGAA TGCGGGCGTG CTGCCGGAAG CCTCCCTCCA GGCTATTTAC
CGTGAAATCA TTTCCTGCTC CTTCTTTCTG GAAGGCGGCC TGACCATTGC CTACCTGGGC
CCCAAAGGAA CCTGGAGCCA CCAGGCGGCC CTCAAGCAGT TCGGAAAAAG TTGCGAACTC
ATTCCGTGCC AGAGCTTCAA GGACGTATTT GACATGGTGG ACCGGGGGAA GGCCCAGTAC
GGCGTAGTTC CCGTGGAAAA CTCTTCGGAA GGTTCCGTCA CCGCCGTGAT GGATCTTTTC
GTCACCTCTC CCCTCAAAAT CTGCGCCCAA ATCAATCTGA ACATCCGCAA CAGCCTGATG
GCGGATATTC CGCGGGAACA CATCCGCATC CTGTATTCCC ACCCCCAGGT TCTCGGCCAG
ACGCGGAACT GGATCCAGCG GCATTTCCCA AACGCGGAAC TCGTGGAAAC GTCCTCCACC
ACGAAAGCCA GCATTCTTGC CAAGGAGAAC GCCGCCATGG GCGCGGCATC CCTCGGCTGT
CCGCTGGCCG CGGAATTGTT CGGCCTGAAC ATCCTGGAAG AAGACGTGCA GGACCAGTCC
TGCAACACCA CCCGTTTTGC CGTCATCGGA CGCCAGGAAA CGCAGCCCAG CGGCAGGGAC
CGCACCTCCC TGCTCATCCG CATCCAGCAT AAACCCGGCA CCCTGGCGGA AGTAGTCAAC
TGCTTCCAGC GGCACAACAA TAACCTGATA CGCATTGAAT CCCGCCCGTC CAAAGTCATC
AACTGGGAAT ACGTCTTTTA CATAGATGCC GCCGGCCACA TTCAGGAATC CCCCTTACGG
GAAACCCTTC CGGAGCTGGA GCAGCACTGC TCCATGCTGA AGATTCTGGG CAGCTACGCG
GATACGGACG TCATTTAA
 
Protein sequence
METTHHGEDA SPDNPPRQQK SGNTDEQTIA LTKARLAIDE VDARIVELLK KRAEWVHEVG 
RIKKEKNSPI FVPERETALL NKLNRLNAGV LPEASLQAIY REIISCSFFL EGGLTIAYLG
PKGTWSHQAA LKQFGKSCEL IPCQSFKDVF DMVDRGKAQY GVVPVENSSE GSVTAVMDLF
VTSPLKICAQ INLNIRNSLM ADIPREHIRI LYSHPQVLGQ TRNWIQRHFP NAELVETSST
TKASILAKEN AAMGAASLGC PLAAELFGLN ILEEDVQDQS CNTTRFAVIG RQETQPSGRD
RTSLLIRIQH KPGTLAEVVN CFQRHNNNLI RIESRPSKVI NWEYVFYIDA AGHIQESPLR
ETLPELEQHC SMLKILGSYA DTDVI