Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2254 |
Symbol | |
ID | 3830749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2357722 |
End bp | 2358987 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637830174 |
Product | 3-isopropylmalate dehydratase large subunit |
Protein accession | YP_431084 |
Protein GI | 83591075 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0065] 3-isopropylmalate dehydratase large subunit |
TIGRFAM ID | [TIGR01343] homoaconitate hydratase family protein [TIGR02083] 3-isopropylmalate dehydratase, large subunit [TIGR02086] 3-isopropylmalate dehydratase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCATGA CCATCACCGA AAAGATCCTG GCCGCCCACG CCGGATTAAA GGCGGTAGAG CCTGGCCAGC TCATCAACGC TAAAGTAGAC CTGGCCCTGG GGAACGATAT CACCGCCCCC CTGGCCATCC AGGAGTTCAA AAAGCTCGGA GTGAAAAAAG TCTTCGACCC GGAGCGGGTC GTCCTGGTAC CGGACCACTT CACCCCGGCC AAGGACATTA AGTCCGCCGA GCAGGCGAAA ATCTTGCGCG ATTTCGCCCG GGAGCAGGGC TTGACCCACT ACTTTGAAAT CGGCCGTATG GGCATCGAGC ACTGCCTGCT GCCGGAGGCG GGCCTGGTCG GTCCCGGAGA CCTGGTCATC GGTGCCGACT CCCATACCTG CACCTACGGG GCCCTCGGGG CCTTCGCCAC CGGCGTCGGC TCCACGGACC TCGCCGCCGC CATGGCCACC GGCGAGCTAT GGTTTAAAGT ACCCGAGACC ATTCTCTTCC GTTACCACGG GAAGCTTAAG CCCTGGGTAG GCGGCAAAGA CCTCATCCTC TACACCATCG GCCGGATCGG CGTCGACGGC GCCCGCTATA TGGCCATGGA ATTCACCGGC GAAGCCATTA CCAACCTCTC CATGGAAGGC CGCTTCACCA TGGCCAACAT GGCCATCGAA GCCGGCGGCA AGAACGGCAT CTTCCCGGTG GACGAAAAGA CCGTAGAATA CATCCGGGGC CGGCTGCAAA GGGACTACCG CATCTACCAG AGCGACCCCG ACGCCCGTTA CAATCAGGAG ATAGACATCG ACGCCAGTAA GATCGAACCC CAGGTAGCCC TGCCCCACCT GCCCGAAAAC GCCCGGAGTG TAAAAGAAAT AGGAGAGATC AAAATCGACC AGGTAGTCAT CGGCAGCTGC ACCAACGGCC GCCTGGAGGA CCTGCGGGTG GCGGCGCAAA TCCTAAAGGG GCAAAAGGTC CATCCCGAAG TAAGACTCAT TGTCATCCCC GGCACCCAGC AGATCTACGC CGCAGCCCTG GCCGAAGGGT TGATAGCAAC CTTTATCGAA GCCGGGGCGG CCGTATCCAC CCCCACCTGC GGCCCCTGCC TGGGCGGGCA TATGGGGATC CTGGCTAAAG GGGAGCGGGC CCTGGCCACC ACCAACCGCA ACTTCGTCGG CCGCATGGGC CATCCCGAGA GCGAAGTCTA TCTCGCCGGC CCGGCGGTAG CCGCCGCCAG CGCCGTTAAA GGGCGCATCG CCGCCCCAGA GGAGGTAGTA AAATGA
|
Protein sequence | MGMTITEKIL AAHAGLKAVE PGQLINAKVD LALGNDITAP LAIQEFKKLG VKKVFDPERV VLVPDHFTPA KDIKSAEQAK ILRDFAREQG LTHYFEIGRM GIEHCLLPEA GLVGPGDLVI GADSHTCTYG ALGAFATGVG STDLAAAMAT GELWFKVPET ILFRYHGKLK PWVGGKDLIL YTIGRIGVDG ARYMAMEFTG EAITNLSMEG RFTMANMAIE AGGKNGIFPV DEKTVEYIRG RLQRDYRIYQ SDPDARYNQE IDIDASKIEP QVALPHLPEN ARSVKEIGEI KIDQVVIGSC TNGRLEDLRV AAQILKGQKV HPEVRLIVIP GTQQIYAAAL AEGLIATFIE AGAAVSTPTC GPCLGGHMGI LAKGERALAT TNRNFVGRMG HPESEVYLAG PAVAAASAVK GRIAAPEEVV K
|
| |