Gene Amuc_1818 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1818 
Symbol 
ID6275773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2206455 
End bp2207756 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content61% 
IMG OID642613882 
ProductPUA domain containing protein 
Protein accessionYP_001878417 
Protein GI187736305 
COG category[R] General function prediction only 
COG ID[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.165222 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.7114 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGATT TCCACAATTC CCGTGATTCC CGCCCCAGGC CCGTTTACCG GGACCGCCGT 
TCCCAGCCGG GCCGCCCGCG CACAGCTCCC AAAAAGAACC TTTTTGCCAA CAAGTCGCCG
GAAGGGAGCG AACAATGGCT CAAGCCCTGG GTGGAGCTGA AGTATTTCAC ATACAATCCG
GCCGTGTTCC CCCGCATGCT CGGCTCCGTA AGCGGGGAAA TAGCGCCGGG AAGCCTGGTA
AACGTCTATG ACAAGAACGG TGAACTGTTC GGCGCCGGAT TCTGGAATGA GTCCAGCCGC
ACGCCCCTGC GCATGGTGTA CCACGGGAAG GACGTTTTTG CCGAACGTGA CCTGGATGCT
GCGCTGGAAC GAGCCGTGAA GCTGCGCAGG GAAGTTCTGC GCCTGGATGA AACCACTAAT
GCGTACCGTG TGCTGCACGG AGATTCCGAC GGACTGGGCG GCCTGGTGGT GGACCGTTAC
GCGGACGTGC TGAGCCTGGA GGTGAGTACG CTGGCCGTCT GGCAGAGGCT GAACCGCTGG
CTGCCCCTGC TGCACCGCCT GTGCGGCACA AAACGCCATG TGGTGCAGGT GGATGAGGGC
ATTGCCCGCA TGGAGGGCAT CCGGGCGGAG GAGGCGCCGG CATCCCCAGC CCCCGTCAGG
CTGGTGAAGA TTGTGGAAAA CGGCATCACC TATGAAGTGG ACTTCGCCCA GGGGCACAAG
ACGGGCTTTT TCTGTGACCA GCGGGACAAC CGCCTCAAGT TCGCCTCCCT GGTGAAGGGG
GCCACGGTGC TGGACCTGTG CTGCTACAGC GGCGGCTTTT CCATTGCCGC CAAGATGCTG
GGCGGAGCGG CGGAGGTAAC GGCTGTGGAT TTGGATGAGA AAGCCGTCGC CATGGCTAAA
AGAAACGGAA ATATCAACCG TCAGCGCATC GACTTCGTGC ATGCGGATGC CTTTGTCTAT
GCGCGCCAGA TGGTGCGCAA CGGCCGCCTG TTTGACGCCG TGCTGCTGGA CCCGCCCAAA
TTCATCGTTG GCAGGGACGG ATATGAAGAG GGCATCAAAA AATACCATGA CCTCAATATG
TTGGGGCTGC AATGCGTGCG TCCCGGCGGC CTGTTTGTCA CCTGTTCCTG TTCCGGGCTG
CTTTCCCCCG CGGAGTTTGA GCATACCGTC ATCAAGGCCG CCCAGCGGCA GGGGCGCAAG
CTTCAGATTA TGGCGATGAC CGGCCCCGGT TGGGACCACC CTTTCCTGAG CACTTATCCG
GAGGGCCGTT ATCTGAAGGT TCTGTGGGCA ATTGCCCTGT AG
 
Protein sequence
MNDFHNSRDS RPRPVYRDRR SQPGRPRTAP KKNLFANKSP EGSEQWLKPW VELKYFTYNP 
AVFPRMLGSV SGEIAPGSLV NVYDKNGELF GAGFWNESSR TPLRMVYHGK DVFAERDLDA
ALERAVKLRR EVLRLDETTN AYRVLHGDSD GLGGLVVDRY ADVLSLEVST LAVWQRLNRW
LPLLHRLCGT KRHVVQVDEG IARMEGIRAE EAPASPAPVR LVKIVENGIT YEVDFAQGHK
TGFFCDQRDN RLKFASLVKG ATVLDLCCYS GGFSIAAKML GGAAEVTAVD LDEKAVAMAK
RNGNINRQRI DFVHADAFVY ARQMVRNGRL FDAVLLDPPK FIVGRDGYEE GIKKYHDLNM
LGLQCVRPGG LFVTCSCSGL LSPAEFEHTV IKAAQRQGRK LQIMAMTGPG WDHPFLSTYP
EGRYLKVLWA IAL