Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1111 |
Symbol | |
ID | 6273966 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 1326464 |
End bp | 1328140 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642613162 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_001877718 |
Protein GI | 187735606 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAAGCG ATAGAGTTAA AGCAGGGTTC GAGCGTGCGC CGCACCGCAG TTTGATGCGT GCCACCGGAA TGACGGATGA GGATTTAAGC CGTCCTTTCA TTGCCATTTG CAATTCTTTT AATGAAGTGA TTCCGGGCCA TGTCCACCTG AACAGGGTGG CCGCCCTCAT CAAGGAGGAG GTACGCAAGG CCGGAGGAAC TCCCGTGGAA TTCAACCTTC CCGGCGTTTG TGACGGCATT GCCATGGGCC ACGGCGGCAT GAAGTTTTCC CTGGCCAGCC GTGAGCTGAT CGCGGACAGC GTAGAGACGA TGCTGAGCGC CCATGCGTTT GATGCTATGA TCTGCATCCC TAATTGCGAC AAGATTGTTC CCGGCATGAT TATGGGCGCC CTGCGCTGCA ATATTCCCAC TATTTTCTGC AGCGGCGGTC CGATGGCCGC CGGCATGGCG GAGGACGGCA CGGTGCTGGA CCTGAACAGC GTGTTTGAGG CTGTCGCCCG CTTTAAGGCA GGCAAGATTA ATGAGGAGGA ACTTCATTCC CTGGAATGCC GCGCCTGCCC CGGCGCCGGT TCCTGCTCCG GCATGTTTAC AGCCAATTCC ATGAATTGCC TGAGCGAGGT GATCGGCCTG GCCCTTCCCG GCAACGGTTC CCTGCTGGCT ACTTCAGAGG AACGAAAGGA GTTCTGGAAG CAGACTGCCC GCCGCGCCGT GGAGATGGCG AAGGCGGACG GCCCCCTGCC GCGAGACATC GTAACCCGTG ACGCTATCGA CAATGCTTTC ACAATTGATA TGGCCATGGG CGGCAGTTCC AATACCGTGC TCCATACGCT GGCTATCGCC AGGGAAGCCG GCGTGGAGTA TGACCTCCAG CGCATCAATG ATATTTCTAG GCGAACCCCG AATATTTGCA AGGTGGCTCC TTCCTCCCGC TTCCACATGC AGGATGTTCT GCGTGCAGGC GGGGTGAGTG CCATCATTCA TGAAATTGCC AGAATTCCCG GAGCCCTTCA TCTGGACGCC ATGACCGTCA GCGGGAAAAC GCTGGGTGAA ACAGTGGAGG GATGCGGCAT TGCGGATGAA ACCGTGATTC ATCCGTTGGA AAATGCCTAT TCCCGTGATG GCGGCCTGGC GATTCTGTTC GGCAATCTGG CTGAGGAGGG CGCTGTGGTG AAAAAGGCGG GTGTGCATCC GAATATGATG AGTTTCCGCG GGCCCGCCGT GATTTTCGAG TCTCAGGAAG AGGCCTGCGA AGGCATCCTT GCCGGGAAGG TGAAATCAGG CGATGTGGTT GTCATACGCA ATGAAGGCCC CAAGGGCGGC CCCGGCATGC AGGAAATGCT GGCTCCCACT TCCTATATTA TGGGGCAGGG CCTTGGCGCG GAAGTGGCGC TTATTACGGA CGGCCGTTTT TCCGGAGCCA CGCACGGAGC CTGTATTGGC CATATTTCCC CGGAAGCGGC GGAAGGCGGC CTGATCGGCC TGCTGAGGAA CGGGGATATT ATTGAGTATT CCATTCCGGA CCGCACGCTG AACGTCTGTT TGAGCGAGGA GGAGATCGCA CGCCGCCGTG CGGATTGGAA ACCTACCTAT AACAGGGTTT CCTCCTCCTG GCTGAGCCGT TACCGCCAGC TTGCCACGAA TGCCAGCAAG GGGGCAGTCC TCCGGCGCGG GGAATAA
|
Protein sequence | MRSDRVKAGF ERAPHRSLMR ATGMTDEDLS RPFIAICNSF NEVIPGHVHL NRVAALIKEE VRKAGGTPVE FNLPGVCDGI AMGHGGMKFS LASRELIADS VETMLSAHAF DAMICIPNCD KIVPGMIMGA LRCNIPTIFC SGGPMAAGMA EDGTVLDLNS VFEAVARFKA GKINEEELHS LECRACPGAG SCSGMFTANS MNCLSEVIGL ALPGNGSLLA TSEERKEFWK QTARRAVEMA KADGPLPRDI VTRDAIDNAF TIDMAMGGSS NTVLHTLAIA REAGVEYDLQ RINDISRRTP NICKVAPSSR FHMQDVLRAG GVSAIIHEIA RIPGALHLDA MTVSGKTLGE TVEGCGIADE TVIHPLENAY SRDGGLAILF GNLAEEGAVV KKAGVHPNMM SFRGPAVIFE SQEEACEGIL AGKVKSGDVV VIRNEGPKGG PGMQEMLAPT SYIMGQGLGA EVALITDGRF SGATHGACIG HISPEAAEGG LIGLLRNGDI IEYSIPDRTL NVCLSEEEIA RRRADWKPTY NRVSSSWLSR YRQLATNASK GAVLRRGE
|
| |