Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2099 |
Symbol | |
ID | 7408808 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2224996 |
End bp | 2226063 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643716465 |
Product | 3-isopropylmalate dehydrogenase |
Protein accession | YP_002573948 |
Protein GI | 222530066 |
COG category | [C] Energy production and conversion [E] Amino acid transport and metabolism |
COG ID | [COG0473] Isocitrate/isopropylmalate dehydrogenase |
TIGRFAM ID | [TIGR00169] 3-isopropylmalate dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000153808 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATAGGA TAGCTGTAAT TCCTGGAGAT GGAATAGGTC CTGAAGTAAT TGAACAAGCG CTAATTGTTC TTGATAAGAT ATCTTCTAAA TTTGGAGTTA AATTTGAATA TATCTTCATT GACGCTGGCG GATGTGCAAT CGACAAGTAT GGTGTGCCAA TTAGAGAGGA AGATTTAGAA CTTGTTAAAA AATGTGAGGC AACATTATTG GGTGCTGTTG GTGGACCAAA GTGGGATAAT CTTCCCGGAA ATTTGAGACC TGAACAAGCT CTTTTGAAGC TCAGAGGAGG ACTAAAAGTT TATGCAAACC TGCGTCCTGC TGTGCTGTAT GATGAGTTAA GAGATTCATC ACCTCTCAAA AAAGAGATTG TATCAAGAGG CATTGATATT CTGGTTGTAA GAGAGCTAAT TGGTGGCATG TACTTTGGTC CAAAGGGAAG AGAAGTAAAA GATGGCGATG AAGTGGCTTA TGACACAGAG GTGTATTCAA AAAGTGAAGT TAGAAGAATC GCAAAGGTTG CGTTTGAATC TGCCAGAAAG AGAAGGAAAA AGGTAACCTC TGTTGACAAG GCAAACATAT TAGAATCATC AAGGCTCTGG AGAGAAACTG TTGAGGAGGT TGCAAAAGAT TATCCGGATG TGGAGCTTTC TCACATGTAT GTTGACAATG CATCAATGCA GCTTGTAAAA GACCCATCAC AGTTTGATGT TATACTTACT TCCAACATGT TCGGTGACAT TTTGTCTGAT GAGGCATCAA TGATAGTAGG GTCGATTGGT ATGCTTGCCT CAGCTTCACT TGGCGAGGGC AGTGTGGGAC TTTACGAGCC AATACACGGC ACAGCACCTG ACATTGCAGG TCAGGATTTG GCAAACCCGA TTGCAACAAT TTTGTCTGCT GCGATGATGC TGCGCTACAG CTTTGACATG GAAGATGCTG CAAAGGCTAT AGAAAATGCT GTGAAGATTG CTCTCAAAGA AGGGTATAGA ACAAGAGATA TCTACACAGA AAATTGTAAG CTTGTTGGAA CAAAGCAAAT GGGAAAAATT ATTTGTGAAA ATATCTAA
|
Protein sequence | MHRIAVIPGD GIGPEVIEQA LIVLDKISSK FGVKFEYIFI DAGGCAIDKY GVPIREEDLE LVKKCEATLL GAVGGPKWDN LPGNLRPEQA LLKLRGGLKV YANLRPAVLY DELRDSSPLK KEIVSRGIDI LVVRELIGGM YFGPKGREVK DGDEVAYDTE VYSKSEVRRI AKVAFESARK RRKKVTSVDK ANILESSRLW RETVEEVAKD YPDVELSHMY VDNASMQLVK DPSQFDVILT SNMFGDILSD EASMIVGSIG MLASASLGEG SVGLYEPIHG TAPDIAGQDL ANPIATILSA AMMLRYSFDM EDAAKAIENA VKIALKEGYR TRDIYTENCK LVGTKQMGKI ICENI
|
| |