Gene Amuc_1240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1240 
Symbol 
ID6275824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1487749 
End bp1490136 
Gene Length2388 bp 
Protein Length795 aa 
Translation table11 
GC content55% 
IMG OID642613297 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_001877846 
Protein GI187735734 
COG category[S] Function unknown 
COG ID[COG1729] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.292436 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTCCG CCGCCCAGGA AAATATTCCG GATGCCGCTC CCATCGCGGA TGGTCCGCTG 
GTCGCTAATC CGGAGCAGGA TACGCTGGAT ATGGCGGACA TGCTGTACAA GCAGGCCCAG
GCTCCGGCTA CGAAGGAGAA CCGGCAGGAA TATGGACGCC TTCTTGATTT AAGCTTGCGT
AAATATCTGG AATTTACCCA ACGTTTTCCC CAGTCCGCCC AGGCTCCTCT GGCGGAGTAT
CGTGCTGCCA TGTGTTTGGA GGAGCTGGGC AGGAAAGATG AGGCACATGG CCTGTTCCTC
AGATTGATTC AGACGGGCAG CCCCGCCCTG GTTGCCGCCT CCGCCTACCG TCTGGCGACG
GATGCTTCTT CTGCCGGGGA GATAGATAAG GCCATCCAGT ATTACCAGCT CGTGATTCGC
AATGCGGAAC AGAATGATTT GAAGGTGGAC GCCCAGTACC GTCTGGGACG CCTGTTTCTT
TCCAGCGGTA ATCCGGAGGC AGCCGCCACC ATGTTTTGTG CGGTGATGGG CAATCCGGAA
GCGGATGCCA AGTTTGTTCT GGTGTCCCGG ATGGGATATG CCGCCCTGTG TGCGGATACG
GGACGCTTGG GAGAAGCTTA TTCCGAATAT CGCAAGGTGC TGGAAACGCC CGGTGTGGAT
AACAGGAACA GGGGAATCGC CACTCTTCAG GCCGCCATGC TGGCCACCAA ACTGAAAAAG
ACTGCGGAAG CCCAGGGCCT TTATGAACGG CTTTTGAAGG ACGAGTCCCT GAAGGAGATG
GCGCCGGAGG CCCGAATGGG GCTTCTTCTG GGGCTGTACA ATATGGGCAA GTACAGGGAG
ATACTTTCCC AGTATGAGCA GCAGAAGGGA ATAAAGATGC CGACAAAGGA CGGCCAGGTG
CGTCTGTTGA TGCTGCTGGG GCAGTCCGCC TATAAATTGA AGGAGTACCG GAAAGCGGCA
GATTTCTTTC TGGAAGCGGA AAAGTCCGTT CCTTATACGC AGGAGGCCAT GCAAGCCTCC
TTTTACCGGT TATTGTGTTA TAATGAACTT AAGCAGAAAG ATCTCCCCCA GCGGGCCCAG
AGCTTCCTGA ACCATTATGC CAAGGCTTTT CCTACCAGTG AGCTGCATGA TATGGTGCGC
CTGATGGCTG CAGAGAACCT GTTCAGCTCC AATCCGGCGG ATGCCGCCCG GTTTTATGCC
AGCATTGATT TTGACAAAGT GCCCCCGAAG ATGCGTGCGG ATATTTTATA TAAGAGCGCA
TGGGCGATTG CCCAGGCCGG GAACAGAGGT GTGGCCGCCA AGCTGCTGAC TGATTTTATC
AATGATTTTC CGAAAGATCC CCGCATTTGC GAGGCATTGA CGTTGCGTGG GGACATGTAT
GCCAAGACCA AGAAGGAAGC TGAGGCGCTG ATGGATTTTG ACCGTGTGAT TGCCCGCTGG
CCGAAAGCTG AATCAGCTGC CGCCGCATGG CAGAGAGCCG CTCAGATTTA TGCGGGACGC
CAGGATATGG CGAACATGGC GAAGTATTAT GAAGGCCTGA TTCAGAATTT TCCGAAGGCG
TCTCCTGCCG CTTTGGCTGA AGCCCATTTC CTGCTGGGGC GTGCGGCGTT TGACCAGGGA
GATTTCAAGT CTTCTATCAG CCATATGGCT GAGGCCAAGA CTCTGGATCC CCAGAAATAC
GGAGAACAGG TTAATGTGCT TTCCGTTCTG TCCTATCACA AGCTCCAGGA CGTGAATAAA
CTGAAGGAGG CCCTGGAAAC CCTGCAAAAG GAAAATCCTT CCGCCGTGGC TCGTGTGCCG
GATGTCATCC CCGCATGGCT GGGCCTTCAG GCCTATGGGA TGAAGGATCT GGAAACGGCG
GACAAGTATA TGACCTGGGC TACGCAGAAC GACCAGCTTC AGAATGTGAA GAAGGTGATT
TGGCGTAATT TGGCGAAGGT GCGCCTGGCG CTCAGAAAGT ATGACCGCGC CTTGGTGGCT
TCCAACAATT TTCTGAAGGA TGAGGACCAG CCTTACCGCC GTGCGGACGG CATGCTGGAC
AAGGCTTCCA TTTTGCTGGG GCTGGGCAAG TATGCGGATG CCAGGAAGAC GGCGGAAGAT
GCGCTGGCTC TGGGCGTGGA AGGTCCTCTG ATGGCTTCCT TGAAAATTGT TCTGGGGGAT
ATTTCCTATG CGGAGAAAAA GTTTGATGAA GCGGCCAAGC ATTACGGTGT TACGGCCGAG
CTGTTTGTCA ATGACGCCGA ACTGAAACCC AAGGCTCTCT TCAAGGCGGC GGAAGCTTTG
GACAAGGCCG GGCGCAAATC GGAGGCTTCC CAATACCGGG CCCGCCTGCA AAAGGAATTC
CCGGATTGGA AACAGGATGG AGAGTCTTTG CCTCCGGACG CACGATAA
 
Protein sequence
MFSAAQENIP DAAPIADGPL VANPEQDTLD MADMLYKQAQ APATKENRQE YGRLLDLSLR 
KYLEFTQRFP QSAQAPLAEY RAAMCLEELG RKDEAHGLFL RLIQTGSPAL VAASAYRLAT
DASSAGEIDK AIQYYQLVIR NAEQNDLKVD AQYRLGRLFL SSGNPEAAAT MFCAVMGNPE
ADAKFVLVSR MGYAALCADT GRLGEAYSEY RKVLETPGVD NRNRGIATLQ AAMLATKLKK
TAEAQGLYER LLKDESLKEM APEARMGLLL GLYNMGKYRE ILSQYEQQKG IKMPTKDGQV
RLLMLLGQSA YKLKEYRKAA DFFLEAEKSV PYTQEAMQAS FYRLLCYNEL KQKDLPQRAQ
SFLNHYAKAF PTSELHDMVR LMAAENLFSS NPADAARFYA SIDFDKVPPK MRADILYKSA
WAIAQAGNRG VAAKLLTDFI NDFPKDPRIC EALTLRGDMY AKTKKEAEAL MDFDRVIARW
PKAESAAAAW QRAAQIYAGR QDMANMAKYY EGLIQNFPKA SPAALAEAHF LLGRAAFDQG
DFKSSISHMA EAKTLDPQKY GEQVNVLSVL SYHKLQDVNK LKEALETLQK ENPSAVARVP
DVIPAWLGLQ AYGMKDLETA DKYMTWATQN DQLQNVKKVI WRNLAKVRLA LRKYDRALVA
SNNFLKDEDQ PYRRADGMLD KASILLGLGK YADARKTAED ALALGVEGPL MASLKIVLGD
ISYAEKKFDE AAKHYGVTAE LFVNDAELKP KALFKAAEAL DKAGRKSEAS QYRARLQKEF
PDWKQDGESL PPDAR