Gene Amuc_1676 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1676 
Symbol 
ID6274450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2031102 
End bp2033099 
Gene Length1998 bp 
Protein Length665 aa 
Translation table11 
GC content52% 
IMG OID642613735 
ProductEco57I restriction endonuclease 
Protein accessionYP_001878275 
Protein GI187736163 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACATC GAGGGAATGA GGCTTCGGGA GAGACGCAAC TCGATGCCTC CCTTGTGGCT 
ATTCTCCTCG CCGACCGTTC GAGCGGCGCT TTCATCCGCT GGGCCTGCAA CGCCTACACT
ACCCACGGCG AGTCCTATGC GGCGGATCAA GAGATCTACC CCCATCAGGT GCATCTGATT
CAGGAGCGCA CCCGCAAGAC GCAGGAAGAA CAGCGCGACC GCACCAAAAA ATCCGCCGAA
GTCTTTACCC CTGCTTGGCT GTGCAACGCC ATGATCAACG CCCGCGATGC CGTTTACTTC
GGGCGGGAGG AGGTCTTTAA CCGCATGGAG GCTCCATCGT GGACGCCGAC GCGCAAGACG
ATTGACTTTC CGACGACAGC ATCTGGCCGC CGTCTCGCGT GGGAGCGTTA CATCGATGCC
CGCTGTCTGG AAATCACCTG CGGCGAAGCC CCCTTTCTCG TCTCGCGCTA CGATGCCGTC
GATGGTCGCC CCATCTCCTT GGCAGAGCGC ATCGGCATCC TCGACCGCAA GCTGCGTATC
ATCGGCGAGC ATACCTGCAC CGCAGAGGAC TGGTTTCACT GGGCAAAACG CGCCCTCGAA
AGCGCCTACG CTTACGAATA CCAGGGCGAC AGCCTCTTTC TCGCCCGTCT CAATCTTTTT
CTGAGCATTA GCGAGTACCA CCGTCACCTG TGGAAACGCC CCCTCAACCG ACACCAACAA
GAGGAAGTTG CCCGCATCCT CTCGTGGAAT CTCTGGCAGA TGGACGGCCT GACGGCGACG
ACTCCCTTTG CCACGGAACA GGGGAAGCCC GAGGATTCCT TATTTGATTT TTACGCCATC
ACAGCCGAAA GGCGCCCCCT CCGTAGCCTC ATCCGCGACT GGCGCGGCAA AAAAACGATT
CGATTCTCTG AACTCAACCT ATCCACCACC ATGAAATTTG ATTTTGTCAT CGGCAATCCG
CCGTATCAGC TGGAAACCGC CAATAAATCC CTTTCCAACG GACAATTGCC AAGTAAAAGC
ATTTTTCATC ATTTTCAGCT GAGCGCGGAT CAGATTTCCT CCGGTCTTAC CGTCTTGATT
TATCCTGGAG GGCGATGGAT TCAGCGTTCT GGAAAAGGAA TGGCGGATTT CGGCTTACAA
CAAATCAATG ACAGCCGTTT ACAAACCTTG TATTATTACC CCGACAGTAC CGATCTCTTC
CCTGCACAAG TTGCCGAAAT TGCGGACGGC ATCTCCATTG TGGTCAAAAA TGCCCACAAG
ACGACCCCAA GCTTCCGCTA CTTCTATATG CGGCGCGGAG AAAAAACAGG TGTCGAGCTG
GAGCCTCCCG GCGAAAATAT CCTTCCTCTA GACCCACGCG ACGGAGCCGT TGTCAGGAAA
ATCGAGGATT TTGTTCAAAG AAACAAGTTG CCTTATTTGA ACGATAATGT GCACTCAAGA
AATCTATTTG GAATTGAGAG TAATTTTGTC GAAAAAAATC GAGATCAGGT TCGCCTCTAT
CAAGAAGGGG ATGCGGTAGA TTGCGAGACG GAGATCAAGC TATATGCTAA TGACCGAGCC
GGGAAAGCTG GCAGAACGAC ATGGTTTGTT GCACCAAGAA GCATTATTCA GACGAATGAG
GCTTACATCT CCAAATGGAA GGTTGTTGTT TCCAGTGCAA ACGCCGGGGG ACAAAAGAGA
GATTGGCAGT TGGAAATCAT CGACAATCAA TCGGCATTCG GTCGTTCGAG GGTTGCTCTT
TCTTCTTTTG AAACGAAGCA GGAAGCGGAG AATTTTTACC ACTATGTGAA GAGCTATATT
ATTCGCTATG CCTTCCTGAT GACGGATGAA GCTCTGACAA CTTTGGCGCT GAAAGTGCCG
GATATGTCAG ATTATACTTC TGACAACAAG TTAATCGACT GGTCGCAGGA TATTGATAGT
CAGTTGCAGA AGCTCATGTC CCTCAGTGAT GCTGAGTTTG AATACATCAA AAACACGGTG
CAGAGTGTAC GAGCCTAA
 
Protein sequence
MKHRGNEASG ETQLDASLVA ILLADRSSGA FIRWACNAYT THGESYAADQ EIYPHQVHLI 
QERTRKTQEE QRDRTKKSAE VFTPAWLCNA MINARDAVYF GREEVFNRME APSWTPTRKT
IDFPTTASGR RLAWERYIDA RCLEITCGEA PFLVSRYDAV DGRPISLAER IGILDRKLRI
IGEHTCTAED WFHWAKRALE SAYAYEYQGD SLFLARLNLF LSISEYHRHL WKRPLNRHQQ
EEVARILSWN LWQMDGLTAT TPFATEQGKP EDSLFDFYAI TAERRPLRSL IRDWRGKKTI
RFSELNLSTT MKFDFVIGNP PYQLETANKS LSNGQLPSKS IFHHFQLSAD QISSGLTVLI
YPGGRWIQRS GKGMADFGLQ QINDSRLQTL YYYPDSTDLF PAQVAEIADG ISIVVKNAHK
TTPSFRYFYM RRGEKTGVEL EPPGENILPL DPRDGAVVRK IEDFVQRNKL PYLNDNVHSR
NLFGIESNFV EKNRDQVRLY QEGDAVDCET EIKLYANDRA GKAGRTTWFV APRSIIQTNE
AYISKWKVVV SSANAGGQKR DWQLEIIDNQ SAFGRSRVAL SSFETKQEAE NFYHYVKSYI
IRYAFLMTDE ALTTLALKVP DMSDYTSDNK LIDWSQDIDS QLQKLMSLSD AEFEYIKNTV
QSVRA