Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1631 |
Symbol | |
ID | 6273820 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 1965361 |
End bp | 1967607 |
Gene Length | 2247 bp |
Protein Length | 748 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642613691 |
Product | carboxyl-terminal protease |
Protein accession | YP_001878232 |
Protein GI | 187736120 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAAAA ACGCACCCTT TTCCGTTATG AACATGCACT CATTCCGTTG GATTAGACTC ACCGCATTCT CGGCCCTGGC CGCAGCCGCC ATTACTTCCT GCGCCTCTGC GGCTACGGAC TTCAACCAGG TGGGCAAGCA AATGTCCCTG CTGCTCCAGA ATTTCCACTT CTCCCGCAAA GAATTCAGCG ATGAACTATC CACTAAATTC CTGGAAACCT ACCTGCGCAA GGTAGACCCC AACAAAATAT TCTTCACCCA GCAGGACGTA GACGCCCTCA AAAGAAAATA CGGTAAGGAG CTGGACGACT ACCTTATGTC CGGCCAGATG ATGGATGCGG CCCAGGCCAT GCACGCCCTT TACCGCCAGC GCGCCATGCA GCGCATCTCC TATGCGCGGG ATTTGCTGAA AAAGGGAGGC TTCACCTTTG ACAAAGACAA GTCTATCGAA CGTTCCCGCC GCAAAACAGC CGCGTGGCCC AAGGATGAGG CGGAAATGCA GCAGGTCTGG AAAGACATGG TGGAGGAACA GCTCCTGTCC GAAATCCTGC GCCGTGAAAC CGTAGCGCGC CTGGCCAAGG AACAGAACAA GCCCGATCCC CTGGCCAATG AAAAACCCGC GGAGGAAAAA CTGCTTATGC GTTATGAACG CATTCAACGC AATATTCAGG AAACGGATCT GGAAGACGTA GCGGAAACAC TGCTCAGCGC CGTAGCCTTG ACGTATGACC CGCATACGGA TTACATGGGT GCGCGCCAGG TGGACCGTTT CAAAATCTCC ATGGGTACGG AACTCACCGG CATCGGCGCC CTGTTGGGCA GTGAAGACGA CGGTTCCACC AAAATTACCG GTATCGTTGT GGGAGGACCG GCTGACAAAT CCGGAGAATT GAAGCTGAAC GACCGCATCG TTGCCATTGA CTCCGACAAC TCCGGAGAAA TGGTGGATAT CCTGTTCATG AAGCTGGACA AAGTGGTGGA TATGATCCGC GGAGCCGAAA ATACCCAGAT GCGCCTGAAA GTAGAGCCGG CAGACGCCCC CGGACAGGCC AAAATCATTA CGCTGACCCG CTCCAAGGTA CCTCTGAAGG ATGAACTTGC CAAAGGTGAA ATCATTGAAC TTACCGGAGC TCCGGAAGGC AGGAACCGCA TTGGCGTGCT GAGCCTTCCC TCCTTCTACG CAGACATGGA AGGCGGAGAC CGCCGCTGTG CCAAGGATGT CAAAAAAATC CTGGAACGGA TGAACAAGGA AAATGTGGAT GGCCTGGTAA TTGACCTGCG TAGCAACGGC GGCGGTTCCC TGGAGGAAGT GCGCCTGATG ACGGGCTTCT TTACCGGAAA CGGCCCCGTG GTACAAATCA AGGACACCCG CGGCAACGTG GATATCAAAT CCGCCCACAA CCGCCAGAAA CTCTTCAATG GCCCCATTGT GGTGCTCATT AATAAACTCA GCGCATCCGC CTCTGAAATT CTGGCCGCGG CCCTTCAGGA TTACGGCCGC GCCGTGATTG TGGGGGATGA ATCCACCTTC GGGAAGGGGT CTGTGCAGCA GCCTGTGGAC ATCGGCCAAT ACCTGCCTTT CTTCGCGGCC AGAGACCGTG CGGGCCTGCT GAAAGTCACT ACCCAGAAAT TTTACCGTGT GGCGGGCGGC TCCACCCAGC TCAAAGGCGT GGAAAGCGAT ATCCAGCTTC CCACCGCTAC GGCGGCATTC GAGCTGGGAG AAGACATTCT GGACTACGCG ATGCCCTATG ACCAGATTAC GCCCTGCACC AACTACAAAA AGGACTCCTC CATCGCGGCC ATGCTGCCCG TGCTGAAAGA TGCCAGCGCG AAGCGCGTGG AAAAAGACCG CGACCTCCAG ATTGCCAGGG AAGATATCGC CATGATGAAA CAGCGCATCA AGGACAACAA GCTTTCCCTG AACAAGAAAA TCCGGGAACA GGAAAACTCC GCCCTGGAAG AACGCCGCAA ATCCATCAAC AAGGAACGTA AAATCCGCTT CGCGGAAATG GCCAGGGAAG ACGCAACCAA ATACAAAATT TACCGCCTGA CGCTGGATGA CGTCAACGCC AAGGAGCTGC CCCTGGCGGA TCCGGAAAAA GACAATGAAC AATTCATGCA CCTGGCGGAA GACCCCACGG CAGAACTGGA CGACTCCCCG GAATACCCCT CCGGCCTTGA TCCGGAACTC CGCGAAGGCA TCAACATCGT CCAGGATATG CTGAAGCTGG AATCCTCCGG AAAATAA
|
Protein sequence | MEKNAPFSVM NMHSFRWIRL TAFSALAAAA ITSCASAATD FNQVGKQMSL LLQNFHFSRK EFSDELSTKF LETYLRKVDP NKIFFTQQDV DALKRKYGKE LDDYLMSGQM MDAAQAMHAL YRQRAMQRIS YARDLLKKGG FTFDKDKSIE RSRRKTAAWP KDEAEMQQVW KDMVEEQLLS EILRRETVAR LAKEQNKPDP LANEKPAEEK LLMRYERIQR NIQETDLEDV AETLLSAVAL TYDPHTDYMG ARQVDRFKIS MGTELTGIGA LLGSEDDGST KITGIVVGGP ADKSGELKLN DRIVAIDSDN SGEMVDILFM KLDKVVDMIR GAENTQMRLK VEPADAPGQA KIITLTRSKV PLKDELAKGE IIELTGAPEG RNRIGVLSLP SFYADMEGGD RRCAKDVKKI LERMNKENVD GLVIDLRSNG GGSLEEVRLM TGFFTGNGPV VQIKDTRGNV DIKSAHNRQK LFNGPIVVLI NKLSASASEI LAAALQDYGR AVIVGDESTF GKGSVQQPVD IGQYLPFFAA RDRAGLLKVT TQKFYRVAGG STQLKGVESD IQLPTATAAF ELGEDILDYA MPYDQITPCT NYKKDSSIAA MLPVLKDASA KRVEKDRDLQ IAREDIAMMK QRIKDNKLSL NKKIREQENS ALEERRKSIN KERKIRFAEM AREDATKYKI YRLTLDDVNA KELPLADPEK DNEQFMHLAE DPTAELDDSP EYPSGLDPEL REGINIVQDM LKLESSGK
|
| |