Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1160 |
Symbol | |
ID | 6273855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 1391587 |
End bp | 1392735 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642613211 |
Product | aldo/keto reductase |
Protein accession | YP_001877766 |
Protein GI | 187735654 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.0226313 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATGT ATAAACACTA TTCTGACGGC CGCCGCCGCT TTCTCAAAAT GTCCGCCTTG TTCGGTACGT CTGCGCTGAT TTCTCCGGCC ATTGCCCGGG CAGAGGGCGG GGGTGCCGAT TCTCCGGAAA ACGGCGGCTT TGCCGCCTCC CGGAGCCGGA CGCTTGGCAA GGGCTCCTGT GCCTTGGAAG CTTCTCCCCT TGGCTTCGGA GTCATGGGAA TGACCTACAA CCGCAGCCAG TCTCCCTCGC GTGAGCAGTG CATCCGGCTT CTTCATGAGG CCGTGGAGCG CGGAGTGACT CTTTTTGATA CGGCCATCAT CTACGGCCCC CTGAGCAATG AACTTCTGGC CGGGGAAGCT CTTTCCCCCT TCAGGGGGAA GATCAGCGTC ACTACCAAGT TCGGGCACGA AGTTATTAAC GGAAAGGGGA CGGGCCGCCA GGACAGCCGC CCGGAAACCA TCCGGCGCTA CTGCGACGAG TCGCTGCGCC GTTTGAAGGT GGACGCTATT GAATTATTCT ACCAGCACCG CTTTGATCCC AGGATTCCCG TTGAAGATGT AGCGGGAACC ATCTCCGAAC TGGTCAAGGA AGGCAAGGTG CGGCGCTGGG GCATGTGCGA AGTCACTCCC GGTACAATTC GCAGGGCCCA CGCCGTCCAT CCCCTGACAG CCATTCAGAG TGAGTACCAT CTCATGCACC GGGATGTTGA AAACAATGGC GTTCTGGATG TCTGCCGTGA ACTGGGTATA GGCTTTGTCC CCTACAGCCC GCTCAACAGG GGATTCCTGG GGGGCTGCAT CAATGAATAC ACGCAATTTG ACCCGAACAA CGACAACCGC CAGACCCTGC CGCGTTTCAC GCCGGAGGCC ATGAGAGCCA ATATGCGTAT TGTCAATATT CTGCAACAGT TTGGGAGGAC GCGCGGCATG ACTTCCTCCC AGGTTGCCCT GGGATGGCTG CTGCAAAAGG CTCCTTATAT CGTACCCATT CCCGGCACCA CTAAACTGTC CCATCTGGAA GAAAACCTGC ACACGCTTGA TTTTACCTGC TCTCCGCAGG AGTGGGCGGA ACTGGAGAAC GCCGTAGCCG CCACACCCGT GACCGGAGCA CGCTACAACG CGGAACAGCA AAGGCAAGTA GGGCATTGA
|
Protein sequence | MSMYKHYSDG RRRFLKMSAL FGTSALISPA IARAEGGGAD SPENGGFAAS RSRTLGKGSC ALEASPLGFG VMGMTYNRSQ SPSREQCIRL LHEAVERGVT LFDTAIIYGP LSNELLAGEA LSPFRGKISV TTKFGHEVIN GKGTGRQDSR PETIRRYCDE SLRRLKVDAI ELFYQHRFDP RIPVEDVAGT ISELVKEGKV RRWGMCEVTP GTIRRAHAVH PLTAIQSEYH LMHRDVENNG VLDVCRELGI GFVPYSPLNR GFLGGCINEY TQFDPNNDNR QTLPRFTPEA MRANMRIVNI LQQFGRTRGM TSSQVALGWL LQKAPYIVPI PGTTKLSHLE ENLHTLDFTC SPQEWAELEN AVAATPVTGA RYNAEQQRQV GH
|
| |