Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1190 |
Symbol | |
ID | 6273832 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 1426146 |
End bp | 1427147 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642613241 |
Product | Alcohol dehydrogenase zinc-binding domain protein |
Protein accession | YP_001877796 |
Protein GI | 187735684 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000452539 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 72 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAGA ATCATTATGC AGAGTTTTCC GAATGCAGTA TGAAGCCCCA GGAGGTGCTG GAATATGTTT CCGGCCCCAT TCCCGTTCCG GAGGAGGGAG AAGTTCTGGT CCGGATGAAG GCGGCTCCGA TCAACCCGGC GGACATCAAT TTTGTACAGG GAGTTTATGG CCTGAAGCCC GTGCTGCCGC ACTCCCGCGC CGGCCTGGAA GGCTGCGGCG TGGTACAGGA ATCTCGCGCA GCGGGATTTC GAGAGGGAGA TGAAGTGATT CTCCTGCGCG GCGTGGGTTC CTGGAGCGAG TATGTGGCGG TTCCCTCCGT GAATGTCATG AAGCTCCCGG TGAAGGTAGA TCCCGTCCAG GCGGCCATGC TGAAGGTGAA TCCCCTGACC GCTCTGCGCA TGCTGGAAGG GTTCGTTTCC CTGGAACCGG GGGATTGGCT GGTGCAGAAT GCCGCCAATT CCGGAGTGGG AAGGTGCATT ATTCAACTGG CCCGTGAAAT GGGCGTGAAG ACAGTGAATT TTGTGAGAAG GCCGGATGAA TTGAGGGATG AATTGACTGC GCTGGGCGCC GATCTGGTGG TGGGAGAGGA TGACGGGGAT GTGGTGAAGA ATACGCTGGC CCGCCTGGAT GGAAAGAGGC CTGTGCTGGC TTCCAATGCC GTGGGCGGGG AAAGCGCCCT GCGCCTGATG GATATGCTGG CTCCCGGTGG AAGCATGGTG ACGTACGGAG CCATGAGCCG GAAGAGCATC AAGGTGCCGA ACGGTTTTCT GATTTTCAAG GGTATTAAAC TGGAGGGCCT GTGGGTGACG CAGTGGCTTA AGAATGCCCC TGTTTCAGAG ATTGAGGCCG CCTATGAGAA ACTGGCGCGC CTGATGGCGG ACGGCAGGTT GAAGCAGGCT GTGGATACCG TTTATCCGCT AAGCGATGTG CGGAAGGCTG TGGAGAAGGC GCAGGAGGAG TTCCGCAGCG GCAAGGTGGT GCTTAGCATG GATTGCGCCT GA
|
Protein sequence | MSENHYAEFS ECSMKPQEVL EYVSGPIPVP EEGEVLVRMK AAPINPADIN FVQGVYGLKP VLPHSRAGLE GCGVVQESRA AGFREGDEVI LLRGVGSWSE YVAVPSVNVM KLPVKVDPVQ AAMLKVNPLT ALRMLEGFVS LEPGDWLVQN AANSGVGRCI IQLAREMGVK TVNFVRRPDE LRDELTALGA DLVVGEDDGD VVKNTLARLD GKRPVLASNA VGGESALRLM DMLAPGGSMV TYGAMSRKSI KVPNGFLIFK GIKLEGLWVT QWLKNAPVSE IEAAYEKLAR LMADGRLKQA VDTVYPLSDV RKAVEKAQEE FRSGKVVLSM DCA
|
| |