Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0544 |
Symbol | |
ID | 6275155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 642890 |
End bp | 646123 |
Gene Length | 3234 bp |
Protein Length | 1077 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642612594 |
Product | Tetratricopeptide TPR_2 repeat protein |
Protein accession | YP_001877163 |
Protein GI | 187735051 |
COG category | [S] Function unknown |
COG ID | [COG1729] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000321228 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.11155 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAAAT CAAACTATAC TACTGCACTG GCTGCAGGCC TGGTTTCCGT GCTGAGTATC GGGGCGGTTC TTCCATCCGG CGCCCAGCAT GGCCCGCGCG ATTACCAGCG CACGGCCCTG ACGGCCATTA AGGAAGGGAA GTGGCAGGAA GCCCTGGATG CCGTGGATCG CTGCATCCGC GTTTATGAAC CCCGTATCAA GATGCTGGGG CTGGATGACG GCTTCGGCTG GTTCTATTAC CAGAAGGGCG TCTGTCTGGC CCAGCTGAAA AATTACAAGG AGGCTGTGGA AGCGTTCAAG GCTTGTTACA CCAAGTTTCC GAGCGCTAAA AACCAGCTCG TGAAAATGGC CCTGTTCCGG GAAGGGGAAA ACTACTGCCG TCTTGGGGAT TTTGCCAAGG GTGCGGAGCT GCTGGAAAAA TTCCTGAAAG AATACCGCAG CGATCCTGTC GCCAGAAACG TCAATGCTGG CGAAGTGCAG GGGCTTCTGG CCCAATGCTA TTTCAAGATG TCTCCGCCTG CTTTTGAAAA AGGGATGGAA AACCTTACCT CTTGCGTCAC GTCACGCTAC AAGGGCCGCC GCATTACGGA TGCCGTTATT ACCAATGGCT TTCTGGCGAT GGTGGATGCC GCCATTAAGA CCGGCAAATG CAGTGAGACG GTGAAGTTTG TGGAAAACTA CCCTTCCGTG ATGAATATCA GTCCCACGCG TGTGGCTTTG TACACCCCGC GGCTGGTGAG CTACGTTGCG GAAGTGCTGG AGAAATCCCG TTCCCTGCTT CAGGATGGAA AGCAGAAAGA GTCTGAAGAT TATGCTTCCC TGGCGATGGT GCTGATGGGT CTTCTTCCCG ATCAGTCCGG AGTAATGGCG GATGCCAATT ATTCCCTGGA TCGTCTGGGC CGTGCCAACG GGGCCGTGCC TGGCGTGACG GATTTTTCCC ATACGCTGGA CAGAGCGAAG GTGACGGCCC TGATCGACCA GTTCAACAAG ATGAAGGAGG AAGGAAAGGT CATGGACGCC TTCACGTTCA GCTTTATGGG CAACCAGGCC CTGGTGCATG GTTCCCAAAG GGTTGCCCGC GCCGCCTACC AGCTTATCAA TGAATCCTAC CCGGATGCTC CGGGCAGGGA GGATAACCTG TATTATCTGG CGATGACCAC CTGGCAGCTG GGAGAAGCGG ACAAGGGAGG CGAGCTTGTG GCGCAGCACC TGAAGGAATT CCCCAATTCC AAGTATGCCC CCATGCTTAA TACGCTGTCT TTGGAAGGGC TTCTGAAGGA AAAAAAATTC GATCTCTGCG TTCAGCAGGC GGACAAGGTC ATGGAGTTGC ATAAGGATGA CCCCACCCAT AAGTTCTATG AACTGGCCCT GTACTGCAAA GGAGCCTCCC TGTTCAACCT GGGGGCTGCC GACGCCTCCC GTTATAAGGA AGCGGTGCCG GTGCTGGAAC GCTTCGTGAA GGAATACCGT GACAGCACTT ATCTGAAAAC GGCCATGTAC CTTCTTGGTG AAACCTACAC GAACCTGGGC AATACGGATG AAGCCATCCG GTCCTTTACC AATTACATTG CCCGTTTCCC GGACAAGGGG GAGGCCAATA TGGCCGCCGT ATTGTATGAC CGGGCCTTCA ACTACCTGAA CCGCAAGAAC CCCGGAGACG AAGAGCTTGC CGCGAAAGAT GCGAAGGAAA TTGTGGACAA TTTCAAGGAC CACCGCCTGT TCCCGTATGC CAACAATTTG CTGGCTAATC TGTGTGCCGG CAGCAAGGAG CATGAGCAGG AAGCGGAAGG CTATTTCCTG GCCGCTCTGG AGTCCGCCAA GAAGCTGGGC GACAAGCGTC CCGCTGCGGA AGCCGTGTAC AACCTGTTTA TTAACGCTAC CAAGAAGCCT CTTCCGGTAG AACCGAAGGA AGCCGTGGAA ACGGCCAGGA CGGCGCGCCG GGACGAGGTC AAGAAATGGT ATGACGAGTA CTGGAAAGAC AGCGACCAGC CCGGCAGCCG CTACAGCCTC CAGCTGGCTG CCGCCGCCAT GGACTTCTTT AAGGATGACA AGGAGATGTT TGACCCGGCA TCCGTCAAGA TGCAGGAAAT TATTGTGAGG GAAGGCAAGA AGGACGATCC CAAGATGACC GTTCTTCTGG AAGAGGCCGT CAATTCCTAT ACCAAGACGT ACATGGCCGG CAATCAGGCC CTGGGCCGCA ATCTGGATGC CAATGCCATG CGCAACCACT TCTACCGGTT CCCCGGCGTG GACAATGATA AAGACAAGAC GCTGAGCGCC ATGCTTCGCA TGGCCGTTAT TGCCCAGACT CAGGAACGGT ATGAAAAGGC TCCTGTGGAG ACGGACGAAC AGCGTGCCGA GAAAGCCGCC CTGGAAGGTC TGGTCAAGCA GCTCTTCGTG GAGCTGAAGC GCGACTTCAA GCCTTCCGAT CTGCCCCCGT ACACGCTTGT GAAGCTTGGC ATGCACCTGG CCGGCACTTC CCAGCCTGAA GAAAGCATCT CCTACTTTGA TGAAATCCTG GACCCGTCGG AACCTGACCC GGTGCGTAAG AAGGCCCGCA TCAACGGCAT GTCCAAGTAC CGCAAGAATG CGGTCTTCGG GAAAGCCGTA GCTCTGGGGC GCAGCAAGGA TAACGCCAAG GTGGACACCG CCATCAAGAT GATGAGGGAT GAACTGAGCA AGGAAGAATC CAGCTCCAAC CCGGACCGCA AGGCCATGGA AGACGCCCAG TACAATCTGG TCAAGTTCAC TTCCGCCCGC CAGGACTGGC CGGCCGTCAT TGCCGCTGCC GACAAGTACC GCGAAAACAA GACCTATAAG AAGAATCTGC CGGAAGTCCT CTATCTGCAG GGTGAAGCCT ACCTGAAGCA GAATGAGCTG GACAAGGCGT TGATTAACTT CATGAACATC ACGGGTACGT ACAAGGGGCT CGTGAAGTGG TCCGCCCCCG CCGTGCTGGC GCAGATGGAT ACGCTGTGGA AGAGGAATAC GATGTCCCAG GGTGCGGGCA AGCAGCCTTC CGACAGGTAC GTTGCCTGGA AGGCCGGCAG CCAGTACGTG CAGTTGCTGG ATACTCCCGC CAACCGCAAG AAGATGACGG CGGAGGACAG TGCCCTGGTC AATGAGGTGA AGGATAAGAC GGCCAAGTTC GGTTCCGATC CTGCCGTCAG CCAGGAACGG GCGGACATTG CCGCCTATGA AGCAGCCGTA CGCGCCGCCA AGGGCCAGAA ATAA
|
Protein sequence | MIKSNYTTAL AAGLVSVLSI GAVLPSGAQH GPRDYQRTAL TAIKEGKWQE ALDAVDRCIR VYEPRIKMLG LDDGFGWFYY QKGVCLAQLK NYKEAVEAFK ACYTKFPSAK NQLVKMALFR EGENYCRLGD FAKGAELLEK FLKEYRSDPV ARNVNAGEVQ GLLAQCYFKM SPPAFEKGME NLTSCVTSRY KGRRITDAVI TNGFLAMVDA AIKTGKCSET VKFVENYPSV MNISPTRVAL YTPRLVSYVA EVLEKSRSLL QDGKQKESED YASLAMVLMG LLPDQSGVMA DANYSLDRLG RANGAVPGVT DFSHTLDRAK VTALIDQFNK MKEEGKVMDA FTFSFMGNQA LVHGSQRVAR AAYQLINESY PDAPGREDNL YYLAMTTWQL GEADKGGELV AQHLKEFPNS KYAPMLNTLS LEGLLKEKKF DLCVQQADKV MELHKDDPTH KFYELALYCK GASLFNLGAA DASRYKEAVP VLERFVKEYR DSTYLKTAMY LLGETYTNLG NTDEAIRSFT NYIARFPDKG EANMAAVLYD RAFNYLNRKN PGDEELAAKD AKEIVDNFKD HRLFPYANNL LANLCAGSKE HEQEAEGYFL AALESAKKLG DKRPAAEAVY NLFINATKKP LPVEPKEAVE TARTARRDEV KKWYDEYWKD SDQPGSRYSL QLAAAAMDFF KDDKEMFDPA SVKMQEIIVR EGKKDDPKMT VLLEEAVNSY TKTYMAGNQA LGRNLDANAM RNHFYRFPGV DNDKDKTLSA MLRMAVIAQT QERYEKAPVE TDEQRAEKAA LEGLVKQLFV ELKRDFKPSD LPPYTLVKLG MHLAGTSQPE ESISYFDEIL DPSEPDPVRK KARINGMSKY RKNAVFGKAV ALGRSKDNAK VDTAIKMMRD ELSKEESSSN PDRKAMEDAQ YNLVKFTSAR QDWPAVIAAA DKYRENKTYK KNLPEVLYLQ GEAYLKQNEL DKALINFMNI TGTYKGLVKW SAPAVLAQMD TLWKRNTMSQ GAGKQPSDRY VAWKAGSQYV QLLDTPANRK KMTAEDSALV NEVKDKTAKF GSDPAVSQER ADIAAYEAAV RAAKGQK
|
| |