Gene Amuc_1190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1190 
Symbol 
ID6273832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1426146 
End bp1427147 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content58% 
IMG OID642613241 
ProductAlcohol dehydrogenase zinc-binding domain protein 
Protein accessionYP_001877796 
Protein GI187735684 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000452539 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAGA ATCATTATGC AGAGTTTTCC GAATGCAGTA TGAAGCCCCA GGAGGTGCTG 
GAATATGTTT CCGGCCCCAT TCCCGTTCCG GAGGAGGGAG AAGTTCTGGT CCGGATGAAG
GCGGCTCCGA TCAACCCGGC GGACATCAAT TTTGTACAGG GAGTTTATGG CCTGAAGCCC
GTGCTGCCGC ACTCCCGCGC CGGCCTGGAA GGCTGCGGCG TGGTACAGGA ATCTCGCGCA
GCGGGATTTC GAGAGGGAGA TGAAGTGATT CTCCTGCGCG GCGTGGGTTC CTGGAGCGAG
TATGTGGCGG TTCCCTCCGT GAATGTCATG AAGCTCCCGG TGAAGGTAGA TCCCGTCCAG
GCGGCCATGC TGAAGGTGAA TCCCCTGACC GCTCTGCGCA TGCTGGAAGG GTTCGTTTCC
CTGGAACCGG GGGATTGGCT GGTGCAGAAT GCCGCCAATT CCGGAGTGGG AAGGTGCATT
ATTCAACTGG CCCGTGAAAT GGGCGTGAAG ACAGTGAATT TTGTGAGAAG GCCGGATGAA
TTGAGGGATG AATTGACTGC GCTGGGCGCC GATCTGGTGG TGGGAGAGGA TGACGGGGAT
GTGGTGAAGA ATACGCTGGC CCGCCTGGAT GGAAAGAGGC CTGTGCTGGC TTCCAATGCC
GTGGGCGGGG AAAGCGCCCT GCGCCTGATG GATATGCTGG CTCCCGGTGG AAGCATGGTG
ACGTACGGAG CCATGAGCCG GAAGAGCATC AAGGTGCCGA ACGGTTTTCT GATTTTCAAG
GGTATTAAAC TGGAGGGCCT GTGGGTGACG CAGTGGCTTA AGAATGCCCC TGTTTCAGAG
ATTGAGGCCG CCTATGAGAA ACTGGCGCGC CTGATGGCGG ACGGCAGGTT GAAGCAGGCT
GTGGATACCG TTTATCCGCT AAGCGATGTG CGGAAGGCTG TGGAGAAGGC GCAGGAGGAG
TTCCGCAGCG GCAAGGTGGT GCTTAGCATG GATTGCGCCT GA
 
Protein sequence
MSENHYAEFS ECSMKPQEVL EYVSGPIPVP EEGEVLVRMK AAPINPADIN FVQGVYGLKP 
VLPHSRAGLE GCGVVQESRA AGFREGDEVI LLRGVGSWSE YVAVPSVNVM KLPVKVDPVQ
AAMLKVNPLT ALRMLEGFVS LEPGDWLVQN AANSGVGRCI IQLAREMGVK TVNFVRRPDE
LRDELTALGA DLVVGEDDGD VVKNTLARLD GKRPVLASNA VGGESALRLM DMLAPGGSMV
TYGAMSRKSI KVPNGFLIFK GIKLEGLWVT QWLKNAPVSE IEAAYEKLAR LMADGRLKQA
VDTVYPLSDV RKAVEKAQEE FRSGKVVLSM DCA