Gene Amuc_0917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0917 
Symbol 
ID6274250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1094295 
End bp1096802 
Gene Length2508 bp 
Protein Length835 aa 
Translation table11 
GC content61% 
IMG OID642612971 
Productpeptidase U32 
Protein accessionYP_001877530 
Protein GI187735418 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTTCCG ACGACATAAC CCTTTCCGCC CTCCGGCCCG GAGACCTGCG CCCCGAACTG 
CTGGCCCCCG CCGGGGACAT GGACTGCGCC CGCGCCGCCG TAGCCAATGG CGCAGACGCC
ATTTATTTCG GCCTGGACCG TTTCAACGCC CGTCTGCGCG CAGACAACTT CACGCTGGAT
TCCCTTCCGG AACTGATGCG TTTCCTTCAC GCACACGGCG TGAAGGGGTA CGTGACCATG
AATACGCTCA TTTTTACCTC CGAACTGGCG GACGCCCTCT CCTACCTGGG CCACCTGAAT
GCCGCCGGCG CGGACGGGGT CATCGTGCAG GACATCGGTC TTGCCCACTG CCTGACGGAA
TGGGGAAGGC AGGTTCCCGG CATGAAGCTG GAACTGCACG CCTCCACCCA GATGACACTT
TCCTCCCCGG ACGGGTTGGA ATTCGCCTCC GGATTCCTGG ACCTGAAGCA GGCCGTCCTG
GCCCGGGAAC TGAGCCTGAA GGAAATCGGG CAATGTGTAC GGCATACGGA CATTCCCCTG
GAAGTATTTG TACACGGAGC GCTCTGCGTG GCCTATTCCG GCCAGTGCCT TACATCGGAA
AGCCTGGGAC AGCGCAGCGC CAACAGGGGA GAATGCGCGC AGGCCTGCCG CCTGCCCTAC
TCCCTGGTTG TAGACGGCAA AACCGTTCCT CTGGGAGAAC GGCGCTACCT GCTCAGCCCG
CAGGACCTGT GCGCCCTTGA CCGCATCCCG GACCTGGTCC GCATGGGCGT GAAAAGCTAC
AAGATAGAGG GACGTCTGAA AAGCCCGGAA TACGTTGCGG CGGTAACGGC GGCCTATAGA
AAAGCTCTGG ACGCCGCCTG CGCTGGAATA CTTGTGGATG AAATGGTGAC CGCACGCGAC
CGTTATGCAC TGCAAATGGT ATTTTCACGC GGCTTCTCCA CCGGCTGGCT GGACGGGACG
GACCACCCGC GTCTGACCCA CGGCCGCTAC GGTAAAAAGA GGGGCGCCTA CGCAGGCGTC
ATTATGAACA GCGGTCAGGG CTGGCTGGAT ATCAGGCCGG AAGGGGGCAT TCCGCTCGCG
CCCGGGGACG GCTTCGTCAT TGATGCAGGG GAAGACCGCA ATGAAGAACA GGGCGGACGT
ATCTGGAAAG TGCAGAGAAA CCGCCTTTTC TTCCACGGAA AGGCATCCCG TATTGACTGG
AGCAGGGTTA AACCCGGGCA AAAACTCTGG AAAACGGATG ATCCGGCCCT GAACGCGGAG
TTGAAAAAAA TGCGGGAACA CTTGCAGGAA TCCACCATTC CCCTCCATCT GACATGTGCC
GGGTCCGTCG GGAACCCCCT GACGATTGCC TGTCCGGAAT ACGGATGTTC CGTGCAATCT
CCCCAGCCCC TTCAGACGGC AGAAAAACGC CCTCTGACTC CCGAAGTGCT GAAACAACAG
CTCGGCAGGC TGGGGGGAAC GGGATTCCGC CTGGCCTCCT GCGACTGCCC TCTGCCGGAG
GGCCTGATGC TTCCGCTCAG CATCCTCAAC CAGACAAGGC GTGCCCTGGT GGAACGAATT
CAAACGGCCC GGGAGGCAAA AGAATCCTCC GTTCCCTCTC ATCCTCCGGC CTCCTTCACC
CTTCCTCCGC TGCCGAAGGG AACAGCCGTT CCGGACATGC CTCCCCATTT GTCTGTTCTG
TGCCGCAAAA CGGAGCAAAT ACCCGCCGCG CTGGATGCCG GAGCGAATGC CGTTTATCTG
GACTTTGAAG ACCTCCGGGA CTATGCGGAA GGCATAAAAA CCGTCCGGGA ACACGGAAAA
AGCATTCCCG TCTTCCTGGC GACGCCCCGC ATCCAGAAAC CCTCGGAAGT CGGATACTTC
AAACTTATGG AACGGGCGGA ACCTGACGGC ATCCTTATTC GCAACCTGGG TGCGGCGCAA
TATTTCCGCC AGTCCCTCCT GCATCGCATC GGAGATTTTT CCCTGAACGT CGCCAATCCA
TACAGCGCGG CTATCCTGAA GAAACAGGGG AATCTGGCCC ATCTCACCAT CTCTTACGAC
CTGAACGCGG AACAGGTGGC GGACCTGCTC CGCGCCGCGC CGCCGGAATG GTTTGAACTG
ACGCTGCACC AGCACATGCC CATGTTTCAT ATGGAGCATT GCGTGTTCTG CACCTTCCTG
TCGAACGGCA CCAGTTATAA GAACTGCGGC CGCCCCTGCG AACGGCATCA TGTGCAGCTG
CGTGACCGCG TAGGACAGCT TCATCCCCTG CTGGCGGACG CCGGGTGCCG GAACACGCTG
TTCAACGGAC GCGCCCAGAC GGGAGCCGGC TTCCTCCAGG ACTTCCGCCG TCTGGGCCTT
TCCCGTTTCC GGCTGGAACT GCTGGATGAT CCCCCGGAAA AAGTGCGCCT TCTGGTGTCC
CGCTACCGGG AGCTGCTGGA CGGTTCCTGC ACTGCGGCCC GGCTTATCCG GGAATTGGAC
GTCGCAGAAC AGCTCGGCAC TACGGAGGGG ACGCTCCGGC CGGGATAA
 
Protein sequence
MISDDITLSA LRPGDLRPEL LAPAGDMDCA RAAVANGADA IYFGLDRFNA RLRADNFTLD 
SLPELMRFLH AHGVKGYVTM NTLIFTSELA DALSYLGHLN AAGADGVIVQ DIGLAHCLTE
WGRQVPGMKL ELHASTQMTL SSPDGLEFAS GFLDLKQAVL ARELSLKEIG QCVRHTDIPL
EVFVHGALCV AYSGQCLTSE SLGQRSANRG ECAQACRLPY SLVVDGKTVP LGERRYLLSP
QDLCALDRIP DLVRMGVKSY KIEGRLKSPE YVAAVTAAYR KALDAACAGI LVDEMVTARD
RYALQMVFSR GFSTGWLDGT DHPRLTHGRY GKKRGAYAGV IMNSGQGWLD IRPEGGIPLA
PGDGFVIDAG EDRNEEQGGR IWKVQRNRLF FHGKASRIDW SRVKPGQKLW KTDDPALNAE
LKKMREHLQE STIPLHLTCA GSVGNPLTIA CPEYGCSVQS PQPLQTAEKR PLTPEVLKQQ
LGRLGGTGFR LASCDCPLPE GLMLPLSILN QTRRALVERI QTAREAKESS VPSHPPASFT
LPPLPKGTAV PDMPPHLSVL CRKTEQIPAA LDAGANAVYL DFEDLRDYAE GIKTVREHGK
SIPVFLATPR IQKPSEVGYF KLMERAEPDG ILIRNLGAAQ YFRQSLLHRI GDFSLNVANP
YSAAILKKQG NLAHLTISYD LNAEQVADLL RAAPPEWFEL TLHQHMPMFH MEHCVFCTFL
SNGTSYKNCG RPCERHHVQL RDRVGQLHPL LADAGCRNTL FNGRAQTGAG FLQDFRRLGL
SRFRLELLDD PPEKVRLLVS RYRELLDGSC TAARLIRELD VAEQLGTTEG TLRPG