Gene Amuc_1821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1821 
Symbol 
ID6275762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2210941 
End bp2212125 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content58% 
IMG OID642613885 
Productputative SAM-dependent methyltransferase 
Protein accessionYP_001878420 
Protein GI187736308 
COG category[R] General function prediction only 
COG ID[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00015725 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGTT TGATTATTTC TCCAAGAGCA CGCATTTTTC AAGGGCACGA CTGGGTTTAC 
GGCACGGAAG TGCGCAAAAT TTTCGGCAAT CCGCAGCCGG GGGACGTCGT GGCCCTGAAG
GATTTCAAGG ACCGCTTTCT GGGTTCCGCC ATGTTCAATC CCCATTCCCA GATCGTCGCC
AGGCGCTTTT CCCGCCGCAA ACAGGAACTG AACGGAGATT TCTTTTCCAG GCGCATCAGC
CAGGCAGTAG AACTGCGCCG CCGCCGGCTT CCGGAAGAAA CTCTCACCCG GCTCGTTTGG
AGCGAATCCG ACGGGCTTCC CGGCCTCATT GTGGACCGTT ACGCGGATTA TCTGGTCGTC
CAGACGCTGA CGATCGCCAT GGAATGCCGC CTCCCCATCA TCCTGAATGT TTTGGAAGAC
CTTCTTTCTC CCCGCGGGAT TATTGTCAGG AATGATTCAC CCATGCTGGC GGCAGAAGGT
ATTTCCCCCT CCGTCCGGGT GGCACGAGGA CAGCAACCGG AACCTTTTGC CGCACGCAGC
GGCAGCGTGC AATTCATGAT TGACCTTCAG ACGGGACAAA AAACCGGCCT GTATCTGGAC
CAGCTTGACA ATTATGCCGC CGTGGCTCGC TTCGCCCGCG GACGCCGCGT GCTGGACTGC
TTCTGCAACC AGGGCGGTTT CGCCCTGGCC TGCGCCCTTG CCGGTGCCTC GGAGGTAACG
GCCGTGGACG TTTCCCAGGA TGCTATGGAC GCCGTAGCGC GGAACGCCCG CCTGAACGGA
GTCTCCGTGC AGTGCGTCAC GGATAACGCG TTTGACTTCC TGAAAAAGGA AGCGGCCCTT
GTCCGGGACG GAGGAGAACA CAAATGGGAT TTAATTATCC TGGATCCGCC CTCTTTTACC
AGAAACAAAA AATCCGTGCA TGACGCCATG CGCGGATATA AGGAAATCCA CCTCCGCGCC
ATGAAGCTTC TGGCCCCGGG AGGCATCCTT TCCACCTTCT GCTGTTCCCA CCACGCCGGA
GCGGACCTGT TCCGGGAGAG CGTGCTTGAC GCCGCCATTG ATGCTCCGGC CACCCTGCGT
CTGATGCAGC AACACGGCCA AAGAGCGGAT CATCCGGTTT TATTGAATAT TCCGGAAACG
GAATACCTGA AGGGGTTCAC GTATGAACTG CTTCCCGGAA GATGA
 
Protein sequence
MAGLIISPRA RIFQGHDWVY GTEVRKIFGN PQPGDVVALK DFKDRFLGSA MFNPHSQIVA 
RRFSRRKQEL NGDFFSRRIS QAVELRRRRL PEETLTRLVW SESDGLPGLI VDRYADYLVV
QTLTIAMECR LPIILNVLED LLSPRGIIVR NDSPMLAAEG ISPSVRVARG QQPEPFAARS
GSVQFMIDLQ TGQKTGLYLD QLDNYAAVAR FARGRRVLDC FCNQGGFALA CALAGASEVT
AVDVSQDAMD AVARNARLNG VSVQCVTDNA FDFLKKEAAL VRDGGEHKWD LIILDPPSFT
RNKKSVHDAM RGYKEIHLRA MKLLAPGGIL STFCCSHHAG ADLFRESVLD AAIDAPATLR
LMQQHGQRAD HPVLLNIPET EYLKGFTYEL LPGR