Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1801 |
Symbol | |
ID | 6274685 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2188242 |
End bp | 2189321 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642613865 |
Product | peptidase S15 |
Protein accession | YP_001878400 |
Protein GI | 187736288 |
COG category | [R] General function prediction only |
COG ID | [COG1073] Hydrolases of the alpha/beta superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0921654 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.010289 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACGAT ACTCATTTAA AATATTTTGC GTGGCTTTTT CCGCCTGCTT TTGCTGTGCG GGGAGCCTGT TGGCAGAAGA TAATAACCCC CATACGGACA AGATGATGAA CGAGAAGCTG AATTTAACGC AGGAATGGGA CAAAGTGTTT CCTAAAAGCG ACAAGGTAAC CCACCGCAAG GTAAGTTTCC GCAACCGTTA TGGCATTATG CTGGCTGCGG ATTTGTATAT GCCCCGGAAT GTTAACGGGA AATTGCCGGC TATTGCCGTT TCCGGCCCTT TCGGGGCTGT AAAGGAACAA TCTGCGGGCC TTTATGCCCA GACGATGGCC GAACGAGGTT TTCTGACGAT CGCGTTTGAT CCCTCGTATA CGGGAGAAAG CGGAGGATTT CCCCGCTATG TCGCGTCTCC GGATATCAAT ACGGAGGATT TCTGCGCCGC CGTCGATTAC CTTTCCACCC GAGATGACGT GGATTCGGAA CGTATTGGAA TCATTGGCAT TTGCGGCTGG GGCGGCATGG CGGTCAATGC GGCGGCTATC GACACCCGCA TCAAGGCAAC GGTAACCTCC ACGATGTACG ATATGAGCCG CGTGAACGCG AACGGCTATT TCGACGCGAT GGATGCCGAC GCCCGTTATG AGCTTCGCAA ACAACTGAAT GCCCAGCGGA CGGCTGATGC GAAGAGCGGT TCTTATGCCC TCGCGGGGGG CGTGCCTGAT CCTCTGCCTG CGGATGCCCC CGGATTTGTG AAGGATTATT ACGATTATTA TAAGACGCCC CGCGGCTATC ACAGGCGTTC GCTCAATTCA AATGGCGGAT GGAATGTCAC TTCGGCGCTT TCCTTCATCA ATATGCCCCT GCTGGCGTAC AGCGGTGAAA TCCGCAGCGC CGTGCTCATG ATTCACGGGG AAAAAGCCCA TTCGCGCTAT TTCAGCGAAG ACGCCTTCAG GAAGTTGAAG GGGGACAATA AGGAGCTGAT GATTATTCCC GGTGCAAGCC ATGTGGATCT TTATGACAAT CAAGCCGGTG TCATTCCTTT CGACAGAATC GGACAGTTCT TTCTGGAGCA TCTGAAATAA
|
Protein sequence | MTRYSFKIFC VAFSACFCCA GSLLAEDNNP HTDKMMNEKL NLTQEWDKVF PKSDKVTHRK VSFRNRYGIM LAADLYMPRN VNGKLPAIAV SGPFGAVKEQ SAGLYAQTMA ERGFLTIAFD PSYTGESGGF PRYVASPDIN TEDFCAAVDY LSTRDDVDSE RIGIIGICGW GGMAVNAAAI DTRIKATVTS TMYDMSRVNA NGYFDAMDAD ARYELRKQLN AQRTADAKSG SYALAGGVPD PLPADAPGFV KDYYDYYKTP RGYHRRSLNS NGGWNVTSAL SFINMPLLAY SGEIRSAVLM IHGEKAHSRY FSEDAFRKLK GDNKELMIIP GASHVDLYDN QAGVIPFDRI GQFFLEHLK
|
| |