Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0785 |
Symbol | |
ID | 6274397 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 921168 |
End bp | 922277 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642612836 |
Product | 3-isopropylmalate dehydrogenase |
Protein accession | YP_001877400 |
Protein GI | 187735288 |
COG category | [C] Energy production and conversion [E] Amino acid transport and metabolism |
COG ID | [COG0473] Isocitrate/isopropylmalate dehydrogenase |
TIGRFAM ID | [TIGR00169] 3-isopropylmalate dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.537709 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAAC ATCACCATCA TATTGCTGTC TTGGCCGGCG ACGGTATCGG TCCCGAGGTC ATGGCACAGG CTCTGAAGGT TCTGGATGCC GTAAGCGGCA AGTTTGGCTT TACCGTAAGC CGCAAGGAGG CATTTGTAGG GGGTGCGGGC ATTGACCACT GCGGCAAGGC TCTTCCCGAA GAAACCATTC GCGCCTGCGA AGAGGCGGAC GCCGTTCTGT TCGGCTCCGT GGGCGGTCCC AAATGGGAGC ATCTCCCCGC CAATGAACAA CCGGAACGCG GAGCCCTGCT GCCTCTGAGA AAACATTTCG GCTTGTATGC CAACCTGCGT CCGGGCGTAT GCCTTCCGGC CTTGACCCAT GCCTCCCCCA TCAAGAATGA ACTGATTGAA GGCGGATTCG ATATTTTATG TGTCCGGGAA TTGACTGGCG GCCTGTATTT CGGCCAGCCC CGTTTCCGCG AACGGGAGGG AGACGACGAA GTCGTCGTGG ATACCATGCG CTACCATAAA AGCGAGATGG TGCGTATCGC CAAAGTGGCG TTCGAGGCTG CGCGCGGCCG CCGTAAGCGC GTCACCAGCG TGGACAAGGC CAATGTGCTG ACCAATTCCC TGCTGTGGCG CGAGACAATG ATTGAAGTTT CCAAGGATTA TCCCGACGTG GAATTGCTGC ATATGTACGT GGACAATGCA GCCATGCAGC TGGTGCGCAA TCCCCGCCAG TTCGACGTGC TGGTGACGGA AAACCTGTTC GGGGACATTC TTTCCGATGA AATGGCCATG ATTTGCGGCT CCCTGGGCAT GCTGCCCAGC GCCAGCCTGT GCCAAGGCGC GCAGGACAAC GGCCTGTTCT TCGGCCTTTA CGAACCCTCC GGAGGCTCCG CCCCGGATAT TGCCGGCAAG GGCATTGCAA ACCCAATCGC CCAGATTCTT TCCCTTTCCA TGCTGTTGCG CTACTCCCTG GGAGAAAAAA CAGCGGCAGA CGCCATTGAC TCCGCCGTTC GCCGCGTGAT TGACCAGGGC TGCCGCACGG GCGACCTGGC TACGGGAGCT CCCGGAGAAA TCCGCGTGAA TACGGCGGAA ATGGGTGACG CCATCATCGC CGCCCTGTAA
|
Protein sequence | MSEHHHHIAV LAGDGIGPEV MAQALKVLDA VSGKFGFTVS RKEAFVGGAG IDHCGKALPE ETIRACEEAD AVLFGSVGGP KWEHLPANEQ PERGALLPLR KHFGLYANLR PGVCLPALTH ASPIKNELIE GGFDILCVRE LTGGLYFGQP RFREREGDDE VVVDTMRYHK SEMVRIAKVA FEAARGRRKR VTSVDKANVL TNSLLWRETM IEVSKDYPDV ELLHMYVDNA AMQLVRNPRQ FDVLVTENLF GDILSDEMAM ICGSLGMLPS ASLCQGAQDN GLFFGLYEPS GGSAPDIAGK GIANPIAQIL SLSMLLRYSL GEKTAADAID SAVRRVIDQG CRTGDLATGA PGEIRVNTAE MGDAIIAAL
|
| |