Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1240 |
Symbol | |
ID | 6275824 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 1487749 |
End bp | 1490136 |
Gene Length | 2388 bp |
Protein Length | 795 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642613297 |
Product | Tetratricopeptide TPR_2 repeat protein |
Protein accession | YP_001877846 |
Protein GI | 187735734 |
COG category | [S] Function unknown |
COG ID | [COG1729] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.292436 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTTCCG CCGCCCAGGA AAATATTCCG GATGCCGCTC CCATCGCGGA TGGTCCGCTG GTCGCTAATC CGGAGCAGGA TACGCTGGAT ATGGCGGACA TGCTGTACAA GCAGGCCCAG GCTCCGGCTA CGAAGGAGAA CCGGCAGGAA TATGGACGCC TTCTTGATTT AAGCTTGCGT AAATATCTGG AATTTACCCA ACGTTTTCCC CAGTCCGCCC AGGCTCCTCT GGCGGAGTAT CGTGCTGCCA TGTGTTTGGA GGAGCTGGGC AGGAAAGATG AGGCACATGG CCTGTTCCTC AGATTGATTC AGACGGGCAG CCCCGCCCTG GTTGCCGCCT CCGCCTACCG TCTGGCGACG GATGCTTCTT CTGCCGGGGA GATAGATAAG GCCATCCAGT ATTACCAGCT CGTGATTCGC AATGCGGAAC AGAATGATTT GAAGGTGGAC GCCCAGTACC GTCTGGGACG CCTGTTTCTT TCCAGCGGTA ATCCGGAGGC AGCCGCCACC ATGTTTTGTG CGGTGATGGG CAATCCGGAA GCGGATGCCA AGTTTGTTCT GGTGTCCCGG ATGGGATATG CCGCCCTGTG TGCGGATACG GGACGCTTGG GAGAAGCTTA TTCCGAATAT CGCAAGGTGC TGGAAACGCC CGGTGTGGAT AACAGGAACA GGGGAATCGC CACTCTTCAG GCCGCCATGC TGGCCACCAA ACTGAAAAAG ACTGCGGAAG CCCAGGGCCT TTATGAACGG CTTTTGAAGG ACGAGTCCCT GAAGGAGATG GCGCCGGAGG CCCGAATGGG GCTTCTTCTG GGGCTGTACA ATATGGGCAA GTACAGGGAG ATACTTTCCC AGTATGAGCA GCAGAAGGGA ATAAAGATGC CGACAAAGGA CGGCCAGGTG CGTCTGTTGA TGCTGCTGGG GCAGTCCGCC TATAAATTGA AGGAGTACCG GAAAGCGGCA GATTTCTTTC TGGAAGCGGA AAAGTCCGTT CCTTATACGC AGGAGGCCAT GCAAGCCTCC TTTTACCGGT TATTGTGTTA TAATGAACTT AAGCAGAAAG ATCTCCCCCA GCGGGCCCAG AGCTTCCTGA ACCATTATGC CAAGGCTTTT CCTACCAGTG AGCTGCATGA TATGGTGCGC CTGATGGCTG CAGAGAACCT GTTCAGCTCC AATCCGGCGG ATGCCGCCCG GTTTTATGCC AGCATTGATT TTGACAAAGT GCCCCCGAAG ATGCGTGCGG ATATTTTATA TAAGAGCGCA TGGGCGATTG CCCAGGCCGG GAACAGAGGT GTGGCCGCCA AGCTGCTGAC TGATTTTATC AATGATTTTC CGAAAGATCC CCGCATTTGC GAGGCATTGA CGTTGCGTGG GGACATGTAT GCCAAGACCA AGAAGGAAGC TGAGGCGCTG ATGGATTTTG ACCGTGTGAT TGCCCGCTGG CCGAAAGCTG AATCAGCTGC CGCCGCATGG CAGAGAGCCG CTCAGATTTA TGCGGGACGC CAGGATATGG CGAACATGGC GAAGTATTAT GAAGGCCTGA TTCAGAATTT TCCGAAGGCG TCTCCTGCCG CTTTGGCTGA AGCCCATTTC CTGCTGGGGC GTGCGGCGTT TGACCAGGGA GATTTCAAGT CTTCTATCAG CCATATGGCT GAGGCCAAGA CTCTGGATCC CCAGAAATAC GGAGAACAGG TTAATGTGCT TTCCGTTCTG TCCTATCACA AGCTCCAGGA CGTGAATAAA CTGAAGGAGG CCCTGGAAAC CCTGCAAAAG GAAAATCCTT CCGCCGTGGC TCGTGTGCCG GATGTCATCC CCGCATGGCT GGGCCTTCAG GCCTATGGGA TGAAGGATCT GGAAACGGCG GACAAGTATA TGACCTGGGC TACGCAGAAC GACCAGCTTC AGAATGTGAA GAAGGTGATT TGGCGTAATT TGGCGAAGGT GCGCCTGGCG CTCAGAAAGT ATGACCGCGC CTTGGTGGCT TCCAACAATT TTCTGAAGGA TGAGGACCAG CCTTACCGCC GTGCGGACGG CATGCTGGAC AAGGCTTCCA TTTTGCTGGG GCTGGGCAAG TATGCGGATG CCAGGAAGAC GGCGGAAGAT GCGCTGGCTC TGGGCGTGGA AGGTCCTCTG ATGGCTTCCT TGAAAATTGT TCTGGGGGAT ATTTCCTATG CGGAGAAAAA GTTTGATGAA GCGGCCAAGC ATTACGGTGT TACGGCCGAG CTGTTTGTCA ATGACGCCGA ACTGAAACCC AAGGCTCTCT TCAAGGCGGC GGAAGCTTTG GACAAGGCCG GGCGCAAATC GGAGGCTTCC CAATACCGGG CCCGCCTGCA AAAGGAATTC CCGGATTGGA AACAGGATGG AGAGTCTTTG CCTCCGGACG CACGATAA
|
Protein sequence | MFSAAQENIP DAAPIADGPL VANPEQDTLD MADMLYKQAQ APATKENRQE YGRLLDLSLR KYLEFTQRFP QSAQAPLAEY RAAMCLEELG RKDEAHGLFL RLIQTGSPAL VAASAYRLAT DASSAGEIDK AIQYYQLVIR NAEQNDLKVD AQYRLGRLFL SSGNPEAAAT MFCAVMGNPE ADAKFVLVSR MGYAALCADT GRLGEAYSEY RKVLETPGVD NRNRGIATLQ AAMLATKLKK TAEAQGLYER LLKDESLKEM APEARMGLLL GLYNMGKYRE ILSQYEQQKG IKMPTKDGQV RLLMLLGQSA YKLKEYRKAA DFFLEAEKSV PYTQEAMQAS FYRLLCYNEL KQKDLPQRAQ SFLNHYAKAF PTSELHDMVR LMAAENLFSS NPADAARFYA SIDFDKVPPK MRADILYKSA WAIAQAGNRG VAAKLLTDFI NDFPKDPRIC EALTLRGDMY AKTKKEAEAL MDFDRVIARW PKAESAAAAW QRAAQIYAGR QDMANMAKYY EGLIQNFPKA SPAALAEAHF LLGRAAFDQG DFKSSISHMA EAKTLDPQKY GEQVNVLSVL SYHKLQDVNK LKEALETLQK ENPSAVARVP DVIPAWLGLQ AYGMKDLETA DKYMTWATQN DQLQNVKKVI WRNLAKVRLA LRKYDRALVA SNNFLKDEDQ PYRRADGMLD KASILLGLGK YADARKTAED ALALGVEGPL MASLKIVLGD ISYAEKKFDE AAKHYGVTAE LFVNDAELKP KALFKAAEAL DKAGRKSEAS QYRARLQKEF PDWKQDGESL PPDAR
|
| |