Gene Amuc_1844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1844 
Symbol 
ID6274599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2240665 
End bp2241624 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content57% 
IMG OID642613907 
ProductPDZ/DHR/GLGF domain protein 
Protein accessionYP_001878442 
Protein GI187736330 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.880462 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATGA AAACCTGTAT CTTTCTGACA TTGGGATTGC TGGTAGGTTC GGGCTCCGGA 
TATTCCATTG ACCGGCCCGC AGGAAGTACG GACAACCTTC AGCCGCCTCC TCTGACGCCG
TACGCGCATC AGGTGAAGCC GCTCCCCTCT TCCAAACCGG CCAGACTGGG AATTGTTCCG
GGCACGGTGC CCCAAGCGCT CGTTGCGCAG CTGGAGCTTA GTGGATTCCC CGGAGTGCTG
GTGACCAAAG TGATGCCGGA CAGCCCCGCC GCCAAGGCCG GGCTCCAGGA AAATGACGTC
ATGGTCAAGC TGGGGGATGT CTCCCTGTCC GGTCCGCAGT CTGTGACGGA AGCCCTGTCT
GAAAAGGTGC CTGGAGACAG GATTACGGCT GTATTTTACC GGAAAGGAAA GAGGGAGACT
GTTGAAATTA CCCTGGATGG GGGAACGCTT TCCGCTGAAG AAATACTGGC GGCCCAGGGG
GATCCCCGCA CGCAGCCCCG CGCAGTTCCT TCCGTCCGGC GTCAGACGGC GCCTTTTTCC
GGAATGGCTG CACGGCCTAA TCTCCCCCAG CGTATTCTGG ATATGCAGCA GATGATGGAT
GAGTTTTTGA AGGATTCCGC CATGGATGAT TACCGGATGG ACGACATCAT CGGCCGGATG
AACCTGACTC CCGGCGCGGC GCAAATGCTC CGGAGCTTGC AGGGACTTCA TCAAATGCCC
ATGCCTCCCA TGGGCAAGGT TTCCGGAGGG GGCCAGAGCA TGTCTTCCGT CCGGATGTCG
GATGCCAACG GGACTATCGT GGTTTCTTCT AATTCCCGGA CGGGAACCAC AGTTCATGTG
ACGGACTCTG CGGGAAAGGT TCTGTATTCC GGCCCCTACA ATACGCAGGA GGAAAAAGCC
GCCGTGCCGG AAGCCGTCAG GGAACGCTTG AAAAACATAG AAACCAATTT CTGCTTTTAA
 
Protein sequence
MDMKTCIFLT LGLLVGSGSG YSIDRPAGST DNLQPPPLTP YAHQVKPLPS SKPARLGIVP 
GTVPQALVAQ LELSGFPGVL VTKVMPDSPA AKAGLQENDV MVKLGDVSLS GPQSVTEALS
EKVPGDRITA VFYRKGKRET VEITLDGGTL SAEEILAAQG DPRTQPRAVP SVRRQTAPFS
GMAARPNLPQ RILDMQQMMD EFLKDSAMDD YRMDDIIGRM NLTPGAAQML RSLQGLHQMP
MPPMGKVSGG GQSMSSVRMS DANGTIVVSS NSRTGTTVHV TDSAGKVLYS GPYNTQEEKA
AVPEAVRERL KNIETNFCF