Gene Amuc_1801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1801 
Symbol 
ID6274685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2188242 
End bp2189321 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content53% 
IMG OID642613865 
Productpeptidase S15 
Protein accessionYP_001878400 
Protein GI187736288 
COG category[R] General function prediction only 
COG ID[COG1073] Hydrolases of the alpha/beta superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0921654 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.010289 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACGAT ACTCATTTAA AATATTTTGC GTGGCTTTTT CCGCCTGCTT TTGCTGTGCG 
GGGAGCCTGT TGGCAGAAGA TAATAACCCC CATACGGACA AGATGATGAA CGAGAAGCTG
AATTTAACGC AGGAATGGGA CAAAGTGTTT CCTAAAAGCG ACAAGGTAAC CCACCGCAAG
GTAAGTTTCC GCAACCGTTA TGGCATTATG CTGGCTGCGG ATTTGTATAT GCCCCGGAAT
GTTAACGGGA AATTGCCGGC TATTGCCGTT TCCGGCCCTT TCGGGGCTGT AAAGGAACAA
TCTGCGGGCC TTTATGCCCA GACGATGGCC GAACGAGGTT TTCTGACGAT CGCGTTTGAT
CCCTCGTATA CGGGAGAAAG CGGAGGATTT CCCCGCTATG TCGCGTCTCC GGATATCAAT
ACGGAGGATT TCTGCGCCGC CGTCGATTAC CTTTCCACCC GAGATGACGT GGATTCGGAA
CGTATTGGAA TCATTGGCAT TTGCGGCTGG GGCGGCATGG CGGTCAATGC GGCGGCTATC
GACACCCGCA TCAAGGCAAC GGTAACCTCC ACGATGTACG ATATGAGCCG CGTGAACGCG
AACGGCTATT TCGACGCGAT GGATGCCGAC GCCCGTTATG AGCTTCGCAA ACAACTGAAT
GCCCAGCGGA CGGCTGATGC GAAGAGCGGT TCTTATGCCC TCGCGGGGGG CGTGCCTGAT
CCTCTGCCTG CGGATGCCCC CGGATTTGTG AAGGATTATT ACGATTATTA TAAGACGCCC
CGCGGCTATC ACAGGCGTTC GCTCAATTCA AATGGCGGAT GGAATGTCAC TTCGGCGCTT
TCCTTCATCA ATATGCCCCT GCTGGCGTAC AGCGGTGAAA TCCGCAGCGC CGTGCTCATG
ATTCACGGGG AAAAAGCCCA TTCGCGCTAT TTCAGCGAAG ACGCCTTCAG GAAGTTGAAG
GGGGACAATA AGGAGCTGAT GATTATTCCC GGTGCAAGCC ATGTGGATCT TTATGACAAT
CAAGCCGGTG TCATTCCTTT CGACAGAATC GGACAGTTCT TTCTGGAGCA TCTGAAATAA
 
Protein sequence
MTRYSFKIFC VAFSACFCCA GSLLAEDNNP HTDKMMNEKL NLTQEWDKVF PKSDKVTHRK 
VSFRNRYGIM LAADLYMPRN VNGKLPAIAV SGPFGAVKEQ SAGLYAQTMA ERGFLTIAFD
PSYTGESGGF PRYVASPDIN TEDFCAAVDY LSTRDDVDSE RIGIIGICGW GGMAVNAAAI
DTRIKATVTS TMYDMSRVNA NGYFDAMDAD ARYELRKQLN AQRTADAKSG SYALAGGVPD
PLPADAPGFV KDYYDYYKTP RGYHRRSLNS NGGWNVTSAL SFINMPLLAY SGEIRSAVLM
IHGEKAHSRY FSEDAFRKLK GDNKELMIIP GASHVDLYDN QAGVIPFDRI GQFFLEHLK