Gene Amuc_1897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1897 
Symbol 
ID6273687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2303821 
End bp2304906 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content63% 
IMG OID642613958 
Productchorismate synthase 
Protein accessionYP_001878492 
Protein GI187736380 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGCA GTTTTGGTCA GGTGTTCAGA ATTTCTACCT GGGGTGAATC CCATGGGACT 
GGGGTAGGCG TGGTGATTGA TGGTTGCCCG TCCCTCGTCC CGGTGACGGA AGAAGACATT
CAGCGGGAGC TGGACCGGCG CAGGCCGGGG CAGAGCGACA TCGTAACCCC CCGCAGGGAG
GAAGACCGCG CGGAAATCCT TTCCGGAGTG CTGGACGGCA AAACCCTGGG AACGCCTATC
GCCATCAGTG TCCGGAACAA GGACCACCGC TCTTCCGCCT ATGACGAGAT GGCCAGAACG
TACCGGCCCT CCCACGCGGA CTATACATAC GACGCTAAAT ACGGCATTCG CGCCTGGGCG
GGCGGGGGCC GGGCCTCCGC ACGGGAAACC ATCGGCCGCG TCGCAGCCGG AGCGGTGGCC
AGGGCCGTGC TGAAGCAGGC TTTCCCCGAT ATGGAGGTCG TGGCCTGGGT GGATCAGGTT
CACCATGTGA AAGCTTCCGT GGACTGGGGA GCCGTGACGG CCTCTGCCAT TGAGAGCAAC
ATCGTCCGTA CGGCGGACCC CTCCGCTGCG GAAGCCATGA TCGCTGCCAT CAAGGAAGCT
CGTGACTCCG GAAACTCCTT GGGCGGCGTG GTCAAATGCG TGGTGCGCGG CTGCCCTCCC
GGACTGGGTG ATCCGGTTTT TGACAAGCTG GACGCTACGC TTGCCCACGC CATGATGAGC
ATTCCCGCCA CCAAGGCTTT CGCCGTGGGT TCCGGTTTTG AAGCGGCGGA CATGACCGGC
TTGGAACATA ATGACCCTTT TTACATGCAG GGCTGCCGGG TGCGTACTAC CACCAACCAC
TCCGGCGGTA TTCAGGGCGG CATCTCCAAC GGAGAGGACA TTCTGATGCG CATCGGCTTC
AAGCCTACGG CCACCTTGAT GATTGACCAG CAGACGGTCA ACAGGGACGG GGAGGATGCC
CGGCTCAAGG GCAGGGGACG GCATGATGCC TGCGTACTGC CGCGCGCCGT GCCCATTGTG
GAGGCCATGG CCTGGCTCTG CCTGTGCGAC CACTACCTGC GCCAACGCTG CCAGAGGGCT
CTGTAA
 
Protein sequence
MSSSFGQVFR ISTWGESHGT GVGVVIDGCP SLVPVTEEDI QRELDRRRPG QSDIVTPRRE 
EDRAEILSGV LDGKTLGTPI AISVRNKDHR SSAYDEMART YRPSHADYTY DAKYGIRAWA
GGGRASARET IGRVAAGAVA RAVLKQAFPD MEVVAWVDQV HHVKASVDWG AVTASAIESN
IVRTADPSAA EAMIAAIKEA RDSGNSLGGV VKCVVRGCPP GLGDPVFDKL DATLAHAMMS
IPATKAFAVG SGFEAADMTG LEHNDPFYMQ GCRVRTTTNH SGGIQGGISN GEDILMRIGF
KPTATLMIDQ QTVNRDGEDA RLKGRGRHDA CVLPRAVPIV EAMAWLCLCD HYLRQRCQRA
L