Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1802 |
Symbol | |
ID | 6274556 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2189364 |
End bp | 2190482 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642613866 |
Product | aldo/keto reductase |
Protein accession | YP_001878401 |
Protein GI | 187736289 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.246141 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.022853 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGCA AGGATTTTTT GAAGATAACA TCCGGTTTGG CCTTATCCCT GGTTTCCCGG GGATGGGCCG GGGTGGGTTC TTCTTTGCTG TCTGACGGTC CGGGGTCTTC TTCCGGGGTG CCGAAGAAGG GTTTTCTTGG GGAATCCCGG CGCCTTGGAG GCCTGGAAGT TTCCTCTATC GGGCTGGGAT GCCTGCCGAT GGTGGGTTAT TACGGCGGCA AGTATGATAA ACAGGAGATG ATTGCCCTGA TACGCCGGGC TTTTGACAAA GGAGTTACTT TTTTTGATAC GGCGGAAGTG TACGGGCCTT ATACCAGTGA GGAATGGGTG GGGGAGGCTC TCGCCCCTGT CCGCAACCAG GTCAGGATAG GAACCAAATT CGGTTTTGGC GTGGAGGAAG GCCGTCCTTC TTCCCTGAAC AGCAGGCCCG ACCATATCCG GCGTGCGGTA GAAGGTTCCC TCAGGCGTTT GCGTACCGAC CACATTGACC TGTTTTACCA GCACCGGGTG GACCCGGATG TTCCGATGGA GGAGGTGGCA GGTACGGTGA AGGAACTGAT GCAGGAGGGA AAAGTGCTGC ATTTCGGCCT GTCCGAAGCC GGCGCCCGTT CCATCAGGAG GGCTTATGCC GAGTGTCCGG TGAGCGCCGT CCAGAGCGAA TACGCTATCT GGTGGAGGGA ACCGGAGACG AAGATTTTTC CCACGTTGGA AGAGTTGGGC ATCGGTTTTG TTCCGTATTG TCCGCTGGGG CGCGCCTTTC TGGCAGGAGC CGTCCGGGAG GACAGCCGTT TTCAAAAGCG GGACCGCCGC GCCACTTTGC CCCGGTTTAC TCCGGAAGCC CTCAGATTCA ACATGCCGCT GACTGTTCTT GTCCGGGAAT GGGCGGAACG CAGGGGCATG ACTCCGGCCC AGTTCGCCCT GTCCTGGATG CTTTCCCGGA AACCGTGGAT TGCGCCTGTT CCCGGAACAA CCAATCCAGC CCATCTGGAT GATTTTCTGG GAGGGGCTTC CGTCCGCCTG TCCGAATCGG AACTCAAGGA ATTCGACCTT GCCTGTTCCA GAATTCCCCT GATGGGGCAC CGGGCGGATC CGTTTACGGA GAGCCAGATT GACAAGTAG
|
Protein sequence | MKRKDFLKIT SGLALSLVSR GWAGVGSSLL SDGPGSSSGV PKKGFLGESR RLGGLEVSSI GLGCLPMVGY YGGKYDKQEM IALIRRAFDK GVTFFDTAEV YGPYTSEEWV GEALAPVRNQ VRIGTKFGFG VEEGRPSSLN SRPDHIRRAV EGSLRRLRTD HIDLFYQHRV DPDVPMEEVA GTVKELMQEG KVLHFGLSEA GARSIRRAYA ECPVSAVQSE YAIWWREPET KIFPTLEELG IGFVPYCPLG RAFLAGAVRE DSRFQKRDRR ATLPRFTPEA LRFNMPLTVL VREWAERRGM TPAQFALSWM LSRKPWIAPV PGTTNPAHLD DFLGGASVRL SESELKEFDL ACSRIPLMGH RADPFTESQI DK
|
| |