Gene Amuc_1631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1631 
Symbol 
ID6273820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1965361 
End bp1967607 
Gene Length2247 bp 
Protein Length748 aa 
Translation table11 
GC content55% 
IMG OID642613691 
Productcarboxyl-terminal protease 
Protein accessionYP_001878232 
Protein GI187736120 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAAA ACGCACCCTT TTCCGTTATG AACATGCACT CATTCCGTTG GATTAGACTC 
ACCGCATTCT CGGCCCTGGC CGCAGCCGCC ATTACTTCCT GCGCCTCTGC GGCTACGGAC
TTCAACCAGG TGGGCAAGCA AATGTCCCTG CTGCTCCAGA ATTTCCACTT CTCCCGCAAA
GAATTCAGCG ATGAACTATC CACTAAATTC CTGGAAACCT ACCTGCGCAA GGTAGACCCC
AACAAAATAT TCTTCACCCA GCAGGACGTA GACGCCCTCA AAAGAAAATA CGGTAAGGAG
CTGGACGACT ACCTTATGTC CGGCCAGATG ATGGATGCGG CCCAGGCCAT GCACGCCCTT
TACCGCCAGC GCGCCATGCA GCGCATCTCC TATGCGCGGG ATTTGCTGAA AAAGGGAGGC
TTCACCTTTG ACAAAGACAA GTCTATCGAA CGTTCCCGCC GCAAAACAGC CGCGTGGCCC
AAGGATGAGG CGGAAATGCA GCAGGTCTGG AAAGACATGG TGGAGGAACA GCTCCTGTCC
GAAATCCTGC GCCGTGAAAC CGTAGCGCGC CTGGCCAAGG AACAGAACAA GCCCGATCCC
CTGGCCAATG AAAAACCCGC GGAGGAAAAA CTGCTTATGC GTTATGAACG CATTCAACGC
AATATTCAGG AAACGGATCT GGAAGACGTA GCGGAAACAC TGCTCAGCGC CGTAGCCTTG
ACGTATGACC CGCATACGGA TTACATGGGT GCGCGCCAGG TGGACCGTTT CAAAATCTCC
ATGGGTACGG AACTCACCGG CATCGGCGCC CTGTTGGGCA GTGAAGACGA CGGTTCCACC
AAAATTACCG GTATCGTTGT GGGAGGACCG GCTGACAAAT CCGGAGAATT GAAGCTGAAC
GACCGCATCG TTGCCATTGA CTCCGACAAC TCCGGAGAAA TGGTGGATAT CCTGTTCATG
AAGCTGGACA AAGTGGTGGA TATGATCCGC GGAGCCGAAA ATACCCAGAT GCGCCTGAAA
GTAGAGCCGG CAGACGCCCC CGGACAGGCC AAAATCATTA CGCTGACCCG CTCCAAGGTA
CCTCTGAAGG ATGAACTTGC CAAAGGTGAA ATCATTGAAC TTACCGGAGC TCCGGAAGGC
AGGAACCGCA TTGGCGTGCT GAGCCTTCCC TCCTTCTACG CAGACATGGA AGGCGGAGAC
CGCCGCTGTG CCAAGGATGT CAAAAAAATC CTGGAACGGA TGAACAAGGA AAATGTGGAT
GGCCTGGTAA TTGACCTGCG TAGCAACGGC GGCGGTTCCC TGGAGGAAGT GCGCCTGATG
ACGGGCTTCT TTACCGGAAA CGGCCCCGTG GTACAAATCA AGGACACCCG CGGCAACGTG
GATATCAAAT CCGCCCACAA CCGCCAGAAA CTCTTCAATG GCCCCATTGT GGTGCTCATT
AATAAACTCA GCGCATCCGC CTCTGAAATT CTGGCCGCGG CCCTTCAGGA TTACGGCCGC
GCCGTGATTG TGGGGGATGA ATCCACCTTC GGGAAGGGGT CTGTGCAGCA GCCTGTGGAC
ATCGGCCAAT ACCTGCCTTT CTTCGCGGCC AGAGACCGTG CGGGCCTGCT GAAAGTCACT
ACCCAGAAAT TTTACCGTGT GGCGGGCGGC TCCACCCAGC TCAAAGGCGT GGAAAGCGAT
ATCCAGCTTC CCACCGCTAC GGCGGCATTC GAGCTGGGAG AAGACATTCT GGACTACGCG
ATGCCCTATG ACCAGATTAC GCCCTGCACC AACTACAAAA AGGACTCCTC CATCGCGGCC
ATGCTGCCCG TGCTGAAAGA TGCCAGCGCG AAGCGCGTGG AAAAAGACCG CGACCTCCAG
ATTGCCAGGG AAGATATCGC CATGATGAAA CAGCGCATCA AGGACAACAA GCTTTCCCTG
AACAAGAAAA TCCGGGAACA GGAAAACTCC GCCCTGGAAG AACGCCGCAA ATCCATCAAC
AAGGAACGTA AAATCCGCTT CGCGGAAATG GCCAGGGAAG ACGCAACCAA ATACAAAATT
TACCGCCTGA CGCTGGATGA CGTCAACGCC AAGGAGCTGC CCCTGGCGGA TCCGGAAAAA
GACAATGAAC AATTCATGCA CCTGGCGGAA GACCCCACGG CAGAACTGGA CGACTCCCCG
GAATACCCCT CCGGCCTTGA TCCGGAACTC CGCGAAGGCA TCAACATCGT CCAGGATATG
CTGAAGCTGG AATCCTCCGG AAAATAA
 
Protein sequence
MEKNAPFSVM NMHSFRWIRL TAFSALAAAA ITSCASAATD FNQVGKQMSL LLQNFHFSRK 
EFSDELSTKF LETYLRKVDP NKIFFTQQDV DALKRKYGKE LDDYLMSGQM MDAAQAMHAL
YRQRAMQRIS YARDLLKKGG FTFDKDKSIE RSRRKTAAWP KDEAEMQQVW KDMVEEQLLS
EILRRETVAR LAKEQNKPDP LANEKPAEEK LLMRYERIQR NIQETDLEDV AETLLSAVAL
TYDPHTDYMG ARQVDRFKIS MGTELTGIGA LLGSEDDGST KITGIVVGGP ADKSGELKLN
DRIVAIDSDN SGEMVDILFM KLDKVVDMIR GAENTQMRLK VEPADAPGQA KIITLTRSKV
PLKDELAKGE IIELTGAPEG RNRIGVLSLP SFYADMEGGD RRCAKDVKKI LERMNKENVD
GLVIDLRSNG GGSLEEVRLM TGFFTGNGPV VQIKDTRGNV DIKSAHNRQK LFNGPIVVLI
NKLSASASEI LAAALQDYGR AVIVGDESTF GKGSVQQPVD IGQYLPFFAA RDRAGLLKVT
TQKFYRVAGG STQLKGVESD IQLPTATAAF ELGEDILDYA MPYDQITPCT NYKKDSSIAA
MLPVLKDASA KRVEKDRDLQ IAREDIAMMK QRIKDNKLSL NKKIREQENS ALEERRKSIN
KERKIRFAEM AREDATKYKI YRLTLDDVNA KELPLADPEK DNEQFMHLAE DPTAELDDSP
EYPSGLDPEL REGINIVQDM LKLESSGK