Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1049 |
Symbol | |
ID | 6274061 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 1251501 |
End bp | 1252583 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642613100 |
Product | metalloendopeptidase, glycoprotease family |
Protein accession | YP_001877656 |
Protein GI | 187735544 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00005889 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGAAT CCCTTACCGT TCTGGGAATA GAATCCTCCT GTGATGAAAC GGCAGTCGCC ATCCTGCGTT CTGCCGGAGA GGAAAAAGCT CCGGAAATAC TCTCCTCCGT CATCTCCTCC CAAATTGCCA TTCACCGCCA GCACGGCGGC GTAGTGCCGG AACTGGCTTC CCGCAACCAT TCAGCGGATC TTCCCGGAAT CATCCGAACC GCGTGCCGCG AAGCCGGAAC AGCTCCTGCG GACATTGACG TCTTCGGCGC TACGGGAGGC CCCGGCCTGG TAGCTGCACT TCTGGTAGGC AACAGCACGG CCAAGGCTCT GGCTCTGGCA GCGGGCAGGC CCTTCGTCTC CGTCAATCAT CTGGAAGGCC ATCTGCTTTC CCCCTTCCTC AAACGCCCCG GCGGTCCCGT TCCCCATCTG GGCATGGTCG TTTCCGGAGG CCACACCCTT TTTGTGGATG TGCGCGGCGT AGGGAACTAC CGCCTGCTGG GCCGCTCTCT GGACGACGCA GCAGGGGAAG CCTTTGACAA GGTAGGCAAA ATGCTAGGCC TTCCCTATCC CGGAGGGCCG GAAATCGACC GCTTGGCGGC GGAAGGCGAC CCGGAAGCCT TTTCTTTCCC CCGGGCCCTG ATGAAAGAGC ATACAGCCAA CGTATCTTTC TCCGGCCTGA AAACGGCCGT TCTCTATACA CTGCCCAAAA TTACGAAAAA CGGCGATCCT CACGGCCTGC CCCGGCAAAC TCTGCGCGAC CTCTGCGCTT CTTTCCAGCG GGCCGTGACG GACGTCCTGA TTCACAAGGC GCTGAAGGCC TTGCGCGCCT CCGGTCACCG CACCCTTTCC ATCTCCGGGG GCGTCTCCTG CAACAGGGAG CTGCGTTCCC GCCTGAAAAC CGCCTGTGAC CGTGAAAAAG TGAAACTGGT TCTCCCGGAC TTCGACCTGA CGACGGATAA TGCCGCCATG ATCGCTTATG TCACCTGTCT CAAAGCCCGA AGAGGACTGT TCCATTCTCT GGATGAAGAC GTTGACCCCA ATCTTAAATT GACGGAGGAT TTAAACAGAT CCAAACATTC AACACATTCC TGA
|
Protein sequence | MPESLTVLGI ESSCDETAVA ILRSAGEEKA PEILSSVISS QIAIHRQHGG VVPELASRNH SADLPGIIRT ACREAGTAPA DIDVFGATGG PGLVAALLVG NSTAKALALA AGRPFVSVNH LEGHLLSPFL KRPGGPVPHL GMVVSGGHTL FVDVRGVGNY RLLGRSLDDA AGEAFDKVGK MLGLPYPGGP EIDRLAAEGD PEAFSFPRAL MKEHTANVSF SGLKTAVLYT LPKITKNGDP HGLPRQTLRD LCASFQRAVT DVLIHKALKA LRASGHRTLS ISGGVSCNRE LRSRLKTACD REKVKLVLPD FDLTTDNAAM IAYVTCLKAR RGLFHSLDED VDPNLKLTED LNRSKHSTHS
|
| |