Gene Amuc_0544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0544 
Symbol 
ID6275155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp642890 
End bp646123 
Gene Length3234 bp 
Protein Length1077 aa 
Translation table11 
GC content56% 
IMG OID642612594 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_001877163 
Protein GI187735051 
COG category[S] Function unknown 
COG ID[COG1729] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000321228 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.11155 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAAT CAAACTATAC TACTGCACTG GCTGCAGGCC TGGTTTCCGT GCTGAGTATC 
GGGGCGGTTC TTCCATCCGG CGCCCAGCAT GGCCCGCGCG ATTACCAGCG CACGGCCCTG
ACGGCCATTA AGGAAGGGAA GTGGCAGGAA GCCCTGGATG CCGTGGATCG CTGCATCCGC
GTTTATGAAC CCCGTATCAA GATGCTGGGG CTGGATGACG GCTTCGGCTG GTTCTATTAC
CAGAAGGGCG TCTGTCTGGC CCAGCTGAAA AATTACAAGG AGGCTGTGGA AGCGTTCAAG
GCTTGTTACA CCAAGTTTCC GAGCGCTAAA AACCAGCTCG TGAAAATGGC CCTGTTCCGG
GAAGGGGAAA ACTACTGCCG TCTTGGGGAT TTTGCCAAGG GTGCGGAGCT GCTGGAAAAA
TTCCTGAAAG AATACCGCAG CGATCCTGTC GCCAGAAACG TCAATGCTGG CGAAGTGCAG
GGGCTTCTGG CCCAATGCTA TTTCAAGATG TCTCCGCCTG CTTTTGAAAA AGGGATGGAA
AACCTTACCT CTTGCGTCAC GTCACGCTAC AAGGGCCGCC GCATTACGGA TGCCGTTATT
ACCAATGGCT TTCTGGCGAT GGTGGATGCC GCCATTAAGA CCGGCAAATG CAGTGAGACG
GTGAAGTTTG TGGAAAACTA CCCTTCCGTG ATGAATATCA GTCCCACGCG TGTGGCTTTG
TACACCCCGC GGCTGGTGAG CTACGTTGCG GAAGTGCTGG AGAAATCCCG TTCCCTGCTT
CAGGATGGAA AGCAGAAAGA GTCTGAAGAT TATGCTTCCC TGGCGATGGT GCTGATGGGT
CTTCTTCCCG ATCAGTCCGG AGTAATGGCG GATGCCAATT ATTCCCTGGA TCGTCTGGGC
CGTGCCAACG GGGCCGTGCC TGGCGTGACG GATTTTTCCC ATACGCTGGA CAGAGCGAAG
GTGACGGCCC TGATCGACCA GTTCAACAAG ATGAAGGAGG AAGGAAAGGT CATGGACGCC
TTCACGTTCA GCTTTATGGG CAACCAGGCC CTGGTGCATG GTTCCCAAAG GGTTGCCCGC
GCCGCCTACC AGCTTATCAA TGAATCCTAC CCGGATGCTC CGGGCAGGGA GGATAACCTG
TATTATCTGG CGATGACCAC CTGGCAGCTG GGAGAAGCGG ACAAGGGAGG CGAGCTTGTG
GCGCAGCACC TGAAGGAATT CCCCAATTCC AAGTATGCCC CCATGCTTAA TACGCTGTCT
TTGGAAGGGC TTCTGAAGGA AAAAAAATTC GATCTCTGCG TTCAGCAGGC GGACAAGGTC
ATGGAGTTGC ATAAGGATGA CCCCACCCAT AAGTTCTATG AACTGGCCCT GTACTGCAAA
GGAGCCTCCC TGTTCAACCT GGGGGCTGCC GACGCCTCCC GTTATAAGGA AGCGGTGCCG
GTGCTGGAAC GCTTCGTGAA GGAATACCGT GACAGCACTT ATCTGAAAAC GGCCATGTAC
CTTCTTGGTG AAACCTACAC GAACCTGGGC AATACGGATG AAGCCATCCG GTCCTTTACC
AATTACATTG CCCGTTTCCC GGACAAGGGG GAGGCCAATA TGGCCGCCGT ATTGTATGAC
CGGGCCTTCA ACTACCTGAA CCGCAAGAAC CCCGGAGACG AAGAGCTTGC CGCGAAAGAT
GCGAAGGAAA TTGTGGACAA TTTCAAGGAC CACCGCCTGT TCCCGTATGC CAACAATTTG
CTGGCTAATC TGTGTGCCGG CAGCAAGGAG CATGAGCAGG AAGCGGAAGG CTATTTCCTG
GCCGCTCTGG AGTCCGCCAA GAAGCTGGGC GACAAGCGTC CCGCTGCGGA AGCCGTGTAC
AACCTGTTTA TTAACGCTAC CAAGAAGCCT CTTCCGGTAG AACCGAAGGA AGCCGTGGAA
ACGGCCAGGA CGGCGCGCCG GGACGAGGTC AAGAAATGGT ATGACGAGTA CTGGAAAGAC
AGCGACCAGC CCGGCAGCCG CTACAGCCTC CAGCTGGCTG CCGCCGCCAT GGACTTCTTT
AAGGATGACA AGGAGATGTT TGACCCGGCA TCCGTCAAGA TGCAGGAAAT TATTGTGAGG
GAAGGCAAGA AGGACGATCC CAAGATGACC GTTCTTCTGG AAGAGGCCGT CAATTCCTAT
ACCAAGACGT ACATGGCCGG CAATCAGGCC CTGGGCCGCA ATCTGGATGC CAATGCCATG
CGCAACCACT TCTACCGGTT CCCCGGCGTG GACAATGATA AAGACAAGAC GCTGAGCGCC
ATGCTTCGCA TGGCCGTTAT TGCCCAGACT CAGGAACGGT ATGAAAAGGC TCCTGTGGAG
ACGGACGAAC AGCGTGCCGA GAAAGCCGCC CTGGAAGGTC TGGTCAAGCA GCTCTTCGTG
GAGCTGAAGC GCGACTTCAA GCCTTCCGAT CTGCCCCCGT ACACGCTTGT GAAGCTTGGC
ATGCACCTGG CCGGCACTTC CCAGCCTGAA GAAAGCATCT CCTACTTTGA TGAAATCCTG
GACCCGTCGG AACCTGACCC GGTGCGTAAG AAGGCCCGCA TCAACGGCAT GTCCAAGTAC
CGCAAGAATG CGGTCTTCGG GAAAGCCGTA GCTCTGGGGC GCAGCAAGGA TAACGCCAAG
GTGGACACCG CCATCAAGAT GATGAGGGAT GAACTGAGCA AGGAAGAATC CAGCTCCAAC
CCGGACCGCA AGGCCATGGA AGACGCCCAG TACAATCTGG TCAAGTTCAC TTCCGCCCGC
CAGGACTGGC CGGCCGTCAT TGCCGCTGCC GACAAGTACC GCGAAAACAA GACCTATAAG
AAGAATCTGC CGGAAGTCCT CTATCTGCAG GGTGAAGCCT ACCTGAAGCA GAATGAGCTG
GACAAGGCGT TGATTAACTT CATGAACATC ACGGGTACGT ACAAGGGGCT CGTGAAGTGG
TCCGCCCCCG CCGTGCTGGC GCAGATGGAT ACGCTGTGGA AGAGGAATAC GATGTCCCAG
GGTGCGGGCA AGCAGCCTTC CGACAGGTAC GTTGCCTGGA AGGCCGGCAG CCAGTACGTG
CAGTTGCTGG ATACTCCCGC CAACCGCAAG AAGATGACGG CGGAGGACAG TGCCCTGGTC
AATGAGGTGA AGGATAAGAC GGCCAAGTTC GGTTCCGATC CTGCCGTCAG CCAGGAACGG
GCGGACATTG CCGCCTATGA AGCAGCCGTA CGCGCCGCCA AGGGCCAGAA ATAA
 
Protein sequence
MIKSNYTTAL AAGLVSVLSI GAVLPSGAQH GPRDYQRTAL TAIKEGKWQE ALDAVDRCIR 
VYEPRIKMLG LDDGFGWFYY QKGVCLAQLK NYKEAVEAFK ACYTKFPSAK NQLVKMALFR
EGENYCRLGD FAKGAELLEK FLKEYRSDPV ARNVNAGEVQ GLLAQCYFKM SPPAFEKGME
NLTSCVTSRY KGRRITDAVI TNGFLAMVDA AIKTGKCSET VKFVENYPSV MNISPTRVAL
YTPRLVSYVA EVLEKSRSLL QDGKQKESED YASLAMVLMG LLPDQSGVMA DANYSLDRLG
RANGAVPGVT DFSHTLDRAK VTALIDQFNK MKEEGKVMDA FTFSFMGNQA LVHGSQRVAR
AAYQLINESY PDAPGREDNL YYLAMTTWQL GEADKGGELV AQHLKEFPNS KYAPMLNTLS
LEGLLKEKKF DLCVQQADKV MELHKDDPTH KFYELALYCK GASLFNLGAA DASRYKEAVP
VLERFVKEYR DSTYLKTAMY LLGETYTNLG NTDEAIRSFT NYIARFPDKG EANMAAVLYD
RAFNYLNRKN PGDEELAAKD AKEIVDNFKD HRLFPYANNL LANLCAGSKE HEQEAEGYFL
AALESAKKLG DKRPAAEAVY NLFINATKKP LPVEPKEAVE TARTARRDEV KKWYDEYWKD
SDQPGSRYSL QLAAAAMDFF KDDKEMFDPA SVKMQEIIVR EGKKDDPKMT VLLEEAVNSY
TKTYMAGNQA LGRNLDANAM RNHFYRFPGV DNDKDKTLSA MLRMAVIAQT QERYEKAPVE
TDEQRAEKAA LEGLVKQLFV ELKRDFKPSD LPPYTLVKLG MHLAGTSQPE ESISYFDEIL
DPSEPDPVRK KARINGMSKY RKNAVFGKAV ALGRSKDNAK VDTAIKMMRD ELSKEESSSN
PDRKAMEDAQ YNLVKFTSAR QDWPAVIAAA DKYRENKTYK KNLPEVLYLQ GEAYLKQNEL
DKALINFMNI TGTYKGLVKW SAPAVLAQMD TLWKRNTMSQ GAGKQPSDRY VAWKAGSQYV
QLLDTPANRK KMTAEDSALV NEVKDKTAKF GSDPAVSQER ADIAAYEAAV RAAKGQK