Gene Amuc_1160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1160 
Symbol 
ID6273855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1391587 
End bp1392735 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content58% 
IMG OID642613211 
Productaldo/keto reductase 
Protein accessionYP_001877766 
Protein GI187735654 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.0226313 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATGT ATAAACACTA TTCTGACGGC CGCCGCCGCT TTCTCAAAAT GTCCGCCTTG 
TTCGGTACGT CTGCGCTGAT TTCTCCGGCC ATTGCCCGGG CAGAGGGCGG GGGTGCCGAT
TCTCCGGAAA ACGGCGGCTT TGCCGCCTCC CGGAGCCGGA CGCTTGGCAA GGGCTCCTGT
GCCTTGGAAG CTTCTCCCCT TGGCTTCGGA GTCATGGGAA TGACCTACAA CCGCAGCCAG
TCTCCCTCGC GTGAGCAGTG CATCCGGCTT CTTCATGAGG CCGTGGAGCG CGGAGTGACT
CTTTTTGATA CGGCCATCAT CTACGGCCCC CTGAGCAATG AACTTCTGGC CGGGGAAGCT
CTTTCCCCCT TCAGGGGGAA GATCAGCGTC ACTACCAAGT TCGGGCACGA AGTTATTAAC
GGAAAGGGGA CGGGCCGCCA GGACAGCCGC CCGGAAACCA TCCGGCGCTA CTGCGACGAG
TCGCTGCGCC GTTTGAAGGT GGACGCTATT GAATTATTCT ACCAGCACCG CTTTGATCCC
AGGATTCCCG TTGAAGATGT AGCGGGAACC ATCTCCGAAC TGGTCAAGGA AGGCAAGGTG
CGGCGCTGGG GCATGTGCGA AGTCACTCCC GGTACAATTC GCAGGGCCCA CGCCGTCCAT
CCCCTGACAG CCATTCAGAG TGAGTACCAT CTCATGCACC GGGATGTTGA AAACAATGGC
GTTCTGGATG TCTGCCGTGA ACTGGGTATA GGCTTTGTCC CCTACAGCCC GCTCAACAGG
GGATTCCTGG GGGGCTGCAT CAATGAATAC ACGCAATTTG ACCCGAACAA CGACAACCGC
CAGACCCTGC CGCGTTTCAC GCCGGAGGCC ATGAGAGCCA ATATGCGTAT TGTCAATATT
CTGCAACAGT TTGGGAGGAC GCGCGGCATG ACTTCCTCCC AGGTTGCCCT GGGATGGCTG
CTGCAAAAGG CTCCTTATAT CGTACCCATT CCCGGCACCA CTAAACTGTC CCATCTGGAA
GAAAACCTGC ACACGCTTGA TTTTACCTGC TCTCCGCAGG AGTGGGCGGA ACTGGAGAAC
GCCGTAGCCG CCACACCCGT GACCGGAGCA CGCTACAACG CGGAACAGCA AAGGCAAGTA
GGGCATTGA
 
Protein sequence
MSMYKHYSDG RRRFLKMSAL FGTSALISPA IARAEGGGAD SPENGGFAAS RSRTLGKGSC 
ALEASPLGFG VMGMTYNRSQ SPSREQCIRL LHEAVERGVT LFDTAIIYGP LSNELLAGEA
LSPFRGKISV TTKFGHEVIN GKGTGRQDSR PETIRRYCDE SLRRLKVDAI ELFYQHRFDP
RIPVEDVAGT ISELVKEGKV RRWGMCEVTP GTIRRAHAVH PLTAIQSEYH LMHRDVENNG
VLDVCRELGI GFVPYSPLNR GFLGGCINEY TQFDPNNDNR QTLPRFTPEA MRANMRIVNI
LQQFGRTRGM TSSQVALGWL LQKAPYIVPI PGTTKLSHLE ENLHTLDFTC SPQEWAELEN
AVAATPVTGA RYNAEQQRQV GH