Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1676 |
Symbol | |
ID | 6274450 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2031102 |
End bp | 2033099 |
Gene Length | 1998 bp |
Protein Length | 665 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642613735 |
Product | Eco57I restriction endonuclease |
Protein accession | YP_001878275 |
Protein GI | 187736163 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACATC GAGGGAATGA GGCTTCGGGA GAGACGCAAC TCGATGCCTC CCTTGTGGCT ATTCTCCTCG CCGACCGTTC GAGCGGCGCT TTCATCCGCT GGGCCTGCAA CGCCTACACT ACCCACGGCG AGTCCTATGC GGCGGATCAA GAGATCTACC CCCATCAGGT GCATCTGATT CAGGAGCGCA CCCGCAAGAC GCAGGAAGAA CAGCGCGACC GCACCAAAAA ATCCGCCGAA GTCTTTACCC CTGCTTGGCT GTGCAACGCC ATGATCAACG CCCGCGATGC CGTTTACTTC GGGCGGGAGG AGGTCTTTAA CCGCATGGAG GCTCCATCGT GGACGCCGAC GCGCAAGACG ATTGACTTTC CGACGACAGC ATCTGGCCGC CGTCTCGCGT GGGAGCGTTA CATCGATGCC CGCTGTCTGG AAATCACCTG CGGCGAAGCC CCCTTTCTCG TCTCGCGCTA CGATGCCGTC GATGGTCGCC CCATCTCCTT GGCAGAGCGC ATCGGCATCC TCGACCGCAA GCTGCGTATC ATCGGCGAGC ATACCTGCAC CGCAGAGGAC TGGTTTCACT GGGCAAAACG CGCCCTCGAA AGCGCCTACG CTTACGAATA CCAGGGCGAC AGCCTCTTTC TCGCCCGTCT CAATCTTTTT CTGAGCATTA GCGAGTACCA CCGTCACCTG TGGAAACGCC CCCTCAACCG ACACCAACAA GAGGAAGTTG CCCGCATCCT CTCGTGGAAT CTCTGGCAGA TGGACGGCCT GACGGCGACG ACTCCCTTTG CCACGGAACA GGGGAAGCCC GAGGATTCCT TATTTGATTT TTACGCCATC ACAGCCGAAA GGCGCCCCCT CCGTAGCCTC ATCCGCGACT GGCGCGGCAA AAAAACGATT CGATTCTCTG AACTCAACCT ATCCACCACC ATGAAATTTG ATTTTGTCAT CGGCAATCCG CCGTATCAGC TGGAAACCGC CAATAAATCC CTTTCCAACG GACAATTGCC AAGTAAAAGC ATTTTTCATC ATTTTCAGCT GAGCGCGGAT CAGATTTCCT CCGGTCTTAC CGTCTTGATT TATCCTGGAG GGCGATGGAT TCAGCGTTCT GGAAAAGGAA TGGCGGATTT CGGCTTACAA CAAATCAATG ACAGCCGTTT ACAAACCTTG TATTATTACC CCGACAGTAC CGATCTCTTC CCTGCACAAG TTGCCGAAAT TGCGGACGGC ATCTCCATTG TGGTCAAAAA TGCCCACAAG ACGACCCCAA GCTTCCGCTA CTTCTATATG CGGCGCGGAG AAAAAACAGG TGTCGAGCTG GAGCCTCCCG GCGAAAATAT CCTTCCTCTA GACCCACGCG ACGGAGCCGT TGTCAGGAAA ATCGAGGATT TTGTTCAAAG AAACAAGTTG CCTTATTTGA ACGATAATGT GCACTCAAGA AATCTATTTG GAATTGAGAG TAATTTTGTC GAAAAAAATC GAGATCAGGT TCGCCTCTAT CAAGAAGGGG ATGCGGTAGA TTGCGAGACG GAGATCAAGC TATATGCTAA TGACCGAGCC GGGAAAGCTG GCAGAACGAC ATGGTTTGTT GCACCAAGAA GCATTATTCA GACGAATGAG GCTTACATCT CCAAATGGAA GGTTGTTGTT TCCAGTGCAA ACGCCGGGGG ACAAAAGAGA GATTGGCAGT TGGAAATCAT CGACAATCAA TCGGCATTCG GTCGTTCGAG GGTTGCTCTT TCTTCTTTTG AAACGAAGCA GGAAGCGGAG AATTTTTACC ACTATGTGAA GAGCTATATT ATTCGCTATG CCTTCCTGAT GACGGATGAA GCTCTGACAA CTTTGGCGCT GAAAGTGCCG GATATGTCAG ATTATACTTC TGACAACAAG TTAATCGACT GGTCGCAGGA TATTGATAGT CAGTTGCAGA AGCTCATGTC CCTCAGTGAT GCTGAGTTTG AATACATCAA AAACACGGTG CAGAGTGTAC GAGCCTAA
|
Protein sequence | MKHRGNEASG ETQLDASLVA ILLADRSSGA FIRWACNAYT THGESYAADQ EIYPHQVHLI QERTRKTQEE QRDRTKKSAE VFTPAWLCNA MINARDAVYF GREEVFNRME APSWTPTRKT IDFPTTASGR RLAWERYIDA RCLEITCGEA PFLVSRYDAV DGRPISLAER IGILDRKLRI IGEHTCTAED WFHWAKRALE SAYAYEYQGD SLFLARLNLF LSISEYHRHL WKRPLNRHQQ EEVARILSWN LWQMDGLTAT TPFATEQGKP EDSLFDFYAI TAERRPLRSL IRDWRGKKTI RFSELNLSTT MKFDFVIGNP PYQLETANKS LSNGQLPSKS IFHHFQLSAD QISSGLTVLI YPGGRWIQRS GKGMADFGLQ QINDSRLQTL YYYPDSTDLF PAQVAEIADG ISIVVKNAHK TTPSFRYFYM RRGEKTGVEL EPPGENILPL DPRDGAVVRK IEDFVQRNKL PYLNDNVHSR NLFGIESNFV EKNRDQVRLY QEGDAVDCET EIKLYANDRA GKAGRTTWFV APRSIIQTNE AYISKWKVVV SSANAGGQKR DWQLEIIDNQ SAFGRSRVAL SSFETKQEAE NFYHYVKSYI IRYAFLMTDE ALTTLALKVP DMSDYTSDNK LIDWSQDIDS QLQKLMSLSD AEFEYIKNTV QSVRA
|
| |