Gene Amuc_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1049 
Symbol 
ID6274061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1251501 
End bp1252583 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content59% 
IMG OID642613100 
Productmetalloendopeptidase, glycoprotease family 
Protein accessionYP_001877656 
Protein GI187735544 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00005889 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGAAT CCCTTACCGT TCTGGGAATA GAATCCTCCT GTGATGAAAC GGCAGTCGCC 
ATCCTGCGTT CTGCCGGAGA GGAAAAAGCT CCGGAAATAC TCTCCTCCGT CATCTCCTCC
CAAATTGCCA TTCACCGCCA GCACGGCGGC GTAGTGCCGG AACTGGCTTC CCGCAACCAT
TCAGCGGATC TTCCCGGAAT CATCCGAACC GCGTGCCGCG AAGCCGGAAC AGCTCCTGCG
GACATTGACG TCTTCGGCGC TACGGGAGGC CCCGGCCTGG TAGCTGCACT TCTGGTAGGC
AACAGCACGG CCAAGGCTCT GGCTCTGGCA GCGGGCAGGC CCTTCGTCTC CGTCAATCAT
CTGGAAGGCC ATCTGCTTTC CCCCTTCCTC AAACGCCCCG GCGGTCCCGT TCCCCATCTG
GGCATGGTCG TTTCCGGAGG CCACACCCTT TTTGTGGATG TGCGCGGCGT AGGGAACTAC
CGCCTGCTGG GCCGCTCTCT GGACGACGCA GCAGGGGAAG CCTTTGACAA GGTAGGCAAA
ATGCTAGGCC TTCCCTATCC CGGAGGGCCG GAAATCGACC GCTTGGCGGC GGAAGGCGAC
CCGGAAGCCT TTTCTTTCCC CCGGGCCCTG ATGAAAGAGC ATACAGCCAA CGTATCTTTC
TCCGGCCTGA AAACGGCCGT TCTCTATACA CTGCCCAAAA TTACGAAAAA CGGCGATCCT
CACGGCCTGC CCCGGCAAAC TCTGCGCGAC CTCTGCGCTT CTTTCCAGCG GGCCGTGACG
GACGTCCTGA TTCACAAGGC GCTGAAGGCC TTGCGCGCCT CCGGTCACCG CACCCTTTCC
ATCTCCGGGG GCGTCTCCTG CAACAGGGAG CTGCGTTCCC GCCTGAAAAC CGCCTGTGAC
CGTGAAAAAG TGAAACTGGT TCTCCCGGAC TTCGACCTGA CGACGGATAA TGCCGCCATG
ATCGCTTATG TCACCTGTCT CAAAGCCCGA AGAGGACTGT TCCATTCTCT GGATGAAGAC
GTTGACCCCA ATCTTAAATT GACGGAGGAT TTAAACAGAT CCAAACATTC AACACATTCC
TGA
 
Protein sequence
MPESLTVLGI ESSCDETAVA ILRSAGEEKA PEILSSVISS QIAIHRQHGG VVPELASRNH 
SADLPGIIRT ACREAGTAPA DIDVFGATGG PGLVAALLVG NSTAKALALA AGRPFVSVNH
LEGHLLSPFL KRPGGPVPHL GMVVSGGHTL FVDVRGVGNY RLLGRSLDDA AGEAFDKVGK
MLGLPYPGGP EIDRLAAEGD PEAFSFPRAL MKEHTANVSF SGLKTAVLYT LPKITKNGDP
HGLPRQTLRD LCASFQRAVT DVLIHKALKA LRASGHRTLS ISGGVSCNRE LRSRLKTACD
REKVKLVLPD FDLTTDNAAM IAYVTCLKAR RGLFHSLDED VDPNLKLTED LNRSKHSTHS