Gene Amuc_1558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1558 
Symbol 
ID6273639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1870950 
End bp1872395 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content56% 
IMG OID642613618 
Productpeptidase M50 
Protein accessionYP_001878160 
Protein GI187736048 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0750] Predicted membrane-associated Zn-dependent proteases 1 
TIGRFAM ID[TIGR00054] RIP metalloprotease RseP 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAGCA TTCTTTCCGT CCTTGCCACT ATCGCCATTA TTTTCGGCGT GGTGATGTTG 
TTTAACTTCA TGATTTTTAT TCATGAACTT GGTCATTTTT TCGCTGCCCG CTGGCGTGGT
CTGTATGTGG ACAGATTTCA AATCTGGTTT GGCCGTCCCA TCTGGAAAAA AACGGTTAAC
GGCGTGCAGT GGGGGCTGGG GTGGATTCCG GCCGGAGGTT TTGTATCCCT GCCGCAGATG
GCTCCTATGG AGGCCATTGA AGGACGTGCG GAGCTTCCCA AGGATTTGAA GCCGGTTACT
CCCTTGGACA AGATTATCGT GGCCGCCGCT GGGCCTGCCG CTTCTTTTCT GCTGGCTGTG
TTGTTTGCCG TGGCCGTCTG GATGGTGGGC AAGCCGGATG TGGAGATGGG CGTGACTACG
GTCGGTTTTG TTGCTCCGGA CAGCCCTGCG GCTCAGGCCG GCATTCTTCC CGGCGACAAG
ATTGTGAAAG TGGACGGCCA TCCTGTGGAC AAGTGGGCGG GCAATATGGA AGGCGTGCGG
GAATTGATCA TGCTGGGCGA GCATGACCGG GTGGTTTTTA CGGTGCAGAG ACCTGGACAT
GAAGGAGAGA TGGAAATTTC CTGCGGATTC CGGATTCCGG AAACGTCCTG GTGGCAGCGC
TCCGGCATGC GTCAGGTGGG GTTGATGCAG GCCATGCCCT GCGTGATTGG GGAAGTGATT
CCCAATTCTC CTGCCGCACT GGCCGGATTG AACCCGGGCG ATAAGGTTGT GGGTGCCAAT
GGAGAACGCC TCTGGAACCC TGCCGCACTG GATGTTCTGC TGAAGAAGAA TGAACCGCTC
TTGCTGGATG TGACGGACAG GGCCGGGGTT GCAAGGCAGG TGAATATCCA GGGGAAGCTT
CCGGAGAATT GGCACAATGG TGCGGACGGT TCCCTGCTCA AGGGGGCCCA GCCTATTCTG
GGCGTGAGCT GGGACCTGAG TTCCGTGGGC CGAGACGTTA CCGTCCATCC TTCTCCGTGG
GCGCAGATCA AGCAGAGTCT GAAATGGATG GGGGATACCC TGGCGAAGGT GGTGGCTCCC
GGCAGCAGCG TGGGCGTGGA GCATCTTTCC GGACCTGTGG GAATTGCCAA TCAGTTTTAC
AAGATGTTCT CTCTGGAGGA AGGGTGGAAG CTGGCCCTGT GGTTTTCCGT GGTGTTGAAC
GTGAACCTGG CGGTTCTGAA TATTCTGCCT CTTCCGGTAG TGGATGGCGG CCATGTGGTC
ATGAATGCCA TTGAATTGGT TTTCCGGCGT CCCCTGAATG TAAAAGTTCT GGAATTCGTC
CAGTTCGGCT TTGTGTTCCT GCTGATGGGG TTCTTCCTGT TTGTGACATT CAAGGATGTG
GGGGATTTCT TTGGCAAGAA GCCGGACAAG CTGCCCACCC CGGTATTCAA GGCCGTCGCG
GATTAG
 
Protein sequence
MDSILSVLAT IAIIFGVVML FNFMIFIHEL GHFFAARWRG LYVDRFQIWF GRPIWKKTVN 
GVQWGLGWIP AGGFVSLPQM APMEAIEGRA ELPKDLKPVT PLDKIIVAAA GPAASFLLAV
LFAVAVWMVG KPDVEMGVTT VGFVAPDSPA AQAGILPGDK IVKVDGHPVD KWAGNMEGVR
ELIMLGEHDR VVFTVQRPGH EGEMEISCGF RIPETSWWQR SGMRQVGLMQ AMPCVIGEVI
PNSPAALAGL NPGDKVVGAN GERLWNPAAL DVLLKKNEPL LLDVTDRAGV ARQVNIQGKL
PENWHNGADG SLLKGAQPIL GVSWDLSSVG RDVTVHPSPW AQIKQSLKWM GDTLAKVVAP
GSSVGVEHLS GPVGIANQFY KMFSLEEGWK LALWFSVVLN VNLAVLNILP LPVVDGGHVV
MNAIELVFRR PLNVKVLEFV QFGFVFLLMG FFLFVTFKDV GDFFGKKPDK LPTPVFKAVA
D