Gene Amuc_1894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1894 
Symbol 
ID6273816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2299826 
End bp2301133 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content61% 
IMG OID642613955 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_001878489 
Protein GI187736377 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.550183 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTTC ATTCCCATTC CATTTCCTCC CTGCAAGGGG CACTGACCGT GCCCGGAGAC 
AAAAGCATTT CCCACCGCGC CGCCATTCTG GGAGGCCTGG CGGAAGGAGT GACGGAAGTG
GATAATTTTC TATGCAGTGA AGATTGCCTG AATACCCTGC GCGCCATGGA GCAGCTAGGC
GCCAAAGTGG ATGTGCTGGA AGAACGGCAG GGATACGGCC CCGTCCGCTT CCGAATTACG
GGCGTCGCCA TGAGCCCCAA GGCTCCTGAA CGGCCCATTG ACTGCGGCAA TTCCGGCACG
GGCATGAGGC TGCTGGCCGG GATGCTGGCC GCCTGCCCCT TTGATTCGGA AATGTTTGGG
GACGCTTCCC TGAGCTCCCG CCCCATGGGC CGCATCATGC AGCCTCTGGA ACAGATGGGA
GCGCGGATTG AAGCCCGGGG AGCCAAACCG GGCTGTGCTC CGCTGAGCAT TCACGGCGGG
CGGGTGCACC CCATTTCCTA CACGCTTCCC ATGGCCAGCG CCCAGGTTAA AAGCGCCATT
CTATTGGCGG GCATGTTTGC GGACGGAACC ACCACCGTGC GCCAGCCGGC CGTCACCCGC
GACCATACAG AACGCCTGTT CCGCCATTTC GGCGTTCCCT GCACCGTGGA TGGACTTACC
GTCGGAACCT GCGGGCCAGC TCTCCCCGTC GCCCATGATT TGACAGTCCC GGCAGATATT
TCCTCCGCAG CCTTCTGGAT GGTGGCCGCC GCCAGCCGTC CGGGTTCCCG CCTGACGTTG
CGCCAGGTAG GGCTGAACAA GACGCGGAAC GCCGTCATCA GCGCTCTGCA GCGGATGGGC
GCGCGAATGG ACATCGTGCC CACTTCTCCG GAAGATGCCG GCGAACCATA CGGCGATATT
ACCGTGTACG GTTCGGATTC CCTGCACGGC ACCAGCCTCC TCCCGGAAGA AATACCCAAC
CTCATTGACG AGATACCTAT CCTGGCCGTA GCGGGAGCCC TGGGCCGGGG AGACTTCATC
GTCCGAAACG CCCGCGAATT ACGCGTCAAG GAAACGGACC GCATCGCCAC CACGGCGGCC
AACCTCCGGC TCATGGGCGT GGATGTGGAA GAATTTGACG ATGGCATGGT TGTTCACGGC
GGCACTCCCC TGAAGGGAAC GGAATTATCC AGTTATGGGG ACCACCGCAT CGCCATGAGC
TTCCTGGTGG CCGGACTCAG CGCACAGGGA GAAACCGTAG TGACGGATGC GGAATGCATC
AATACGTCCT ATCCCGGTTT TGAACGGGAT CTGGCTCAGT TCCTGTAA
 
Protein sequence
MNLHSHSISS LQGALTVPGD KSISHRAAIL GGLAEGVTEV DNFLCSEDCL NTLRAMEQLG 
AKVDVLEERQ GYGPVRFRIT GVAMSPKAPE RPIDCGNSGT GMRLLAGMLA ACPFDSEMFG
DASLSSRPMG RIMQPLEQMG ARIEARGAKP GCAPLSIHGG RVHPISYTLP MASAQVKSAI
LLAGMFADGT TTVRQPAVTR DHTERLFRHF GVPCTVDGLT VGTCGPALPV AHDLTVPADI
SSAAFWMVAA ASRPGSRLTL RQVGLNKTRN AVISALQRMG ARMDIVPTSP EDAGEPYGDI
TVYGSDSLHG TSLLPEEIPN LIDEIPILAV AGALGRGDFI VRNARELRVK ETDRIATTAA
NLRLMGVDVE EFDDGMVVHG GTPLKGTELS SYGDHRIAMS FLVAGLSAQG ETVVTDAECI
NTSYPGFERD LAQFL