Gene Amuc_1911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1911 
Symbol 
ID6275390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2317939 
End bp2319099 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content56% 
IMG OID642613972 
Productiron-containing alcohol dehydrogenase 
Protein accessionYP_001878506 
Protein GI187736394 
COG category[C] Energy production and conversion 
COG ID[COG1454] Alcohol dehydrogenase, class IV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.0885736 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCAGC CATTTCAATT TTTCATGCCC GCGCAAATCT TTTTTGGCGC GGGTTCTTTG 
GACAATCTTG GTTCCGCTCC CCTGCCCGGC ACCAAGGCCC TGATCGTCAT CGGCGGGTCG
TCCGTCAAAC GCCTCGGGTA TCTGGACCGC GTACAGGCTC TTCTGAAAAA ACAGGGAGTG
GAAAGCGTTG TTTTCGATAA AGTGCAGCCC AACCCCGTGG TGGAGCACGT AATGGAAGCC
TCCTCCCTGG CCAGGGAAAC GGGCTGTGAT TTCGTCATCG GCCTGGGCGG GGGCAGCAGC
ATGGATTCCG CCAAGAGCAT CGCCGTGATG GCGGCCAATC CAGGAACCTA CTGGGATTAC
ATCCAGGGAG GTTCCGGCAA GGGGCTTCCC ATTCCCTGCA AACCTCTTCC CATCGTCTGC
ATCACCACTA CGGCGGGAAC CGGAACGGAG GCGGATCCGT GGACCGTCAT CACGAAAGAG
GACACGCAGG AGAAGATCGG TTTCGGGTTC AAGGGTACTT TCCCCACCAT GTCTATCGTA
GATCCGGAGT TGATGCTTTC CGTACCTCCC AAATTAACGG CATACCAGGG GTTTGACGCT
TTGTTCCATG CCGTGGAGGG ATATATGGCT ACAATCGCCT CCCCCATGGG GGACATGTTC
GCGCTCCAGG CTATTGAATA CATTGCCAAA TATCTTCCGC GCGCCGTAAA TAACGGGGAT
GATCTGGAAG CGCGCGCCTA TGTGGCGCTG GCCAATACCT ATTCCGGGTT TGTGGAAACC
ATTTCCTGCT GTACGTCGGA ACATTCCATT GAACATGCCC TCAGCGCCTT CCATCCTTCC
CTGCCCCATG GCGCGGGGCT AATTATGATT TCCTGGGCCT ACCATGAAGC CTATGCTCCC
TCCTGCCCGG AACGTTACGC AAGAGTTGCC GCAGCCATGG GACAGGAAGC CTCCGTGGAC
GGTTTCCTGA ACGGCTTGAA CAGCCTGAAG GAAGCCTGCG GCGTAGACAA GCTGAAGATG
TCCGAATTCG GCATTACACC GGATTTATTT GACGAATACG CCAAAACGGC TTTTTCCACC
ATGGGCAATC TGTTTGAGCT GGACCGTTGC AAGTTGACTC CGGCGGACGT GGTCAGCATC
CTGGAGAAAT CCTATTCCTA G
 
Protein sequence
MYQPFQFFMP AQIFFGAGSL DNLGSAPLPG TKALIVIGGS SVKRLGYLDR VQALLKKQGV 
ESVVFDKVQP NPVVEHVMEA SSLARETGCD FVIGLGGGSS MDSAKSIAVM AANPGTYWDY
IQGGSGKGLP IPCKPLPIVC ITTTAGTGTE ADPWTVITKE DTQEKIGFGF KGTFPTMSIV
DPELMLSVPP KLTAYQGFDA LFHAVEGYMA TIASPMGDMF ALQAIEYIAK YLPRAVNNGD
DLEARAYVAL ANTYSGFVET ISCCTSEHSI EHALSAFHPS LPHGAGLIMI SWAYHEAYAP
SCPERYARVA AAMGQEASVD GFLNGLNSLK EACGVDKLKM SEFGITPDLF DEYAKTAFST
MGNLFELDRC KLTPADVVSI LEKSYS