Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0990 |
Symbol | |
ID | 4462866 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | - |
Start bp | 1074144 |
End bp | 1075415 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639700008 |
Product | 3-isopropylmalate dehydratase large subunit |
Protein accession | YP_843415 |
Protein GI | 116754297 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0065] 3-isopropylmalate dehydratase large subunit |
TIGRFAM ID | [TIGR01343] homoaconitate hydratase family protein [TIGR02086] 3-isopropylmalate dehydratase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGTGA TGGGCCTGAG CATGGCTGGT CAAACCCTCT CCGAGAAGAT ATTCTCAAGG GCCGCGAACA AGGAGGCCAG GGCTGGCGAG TTCGTCATGG CCTCGATAGA TTGCGCGATG ATACATGACA TAACCGGACC CCTTGCTGTC AGGGGCTTCT ATGAGATTGC AGGCAAAGGG GCCAGGGTCT GGAATCCCTC CAGGATCGTG ATCCTCTTCG ATCATCAGGT GCCTGCAGAC AGCATAAAGG CAGCTGAGAA CCATCAGATG CTGAGGGCAT TTGCAAAAGA GCAGGGCATC ATAAACTACG ATGTGTTTAG CGGAATATGT CATCAGGTCA TGCCGGAGAA CGGCCATGTG CTTCCAGGAC AGCTCATTGT CGGCACTGAT TCTCACACAT GCACCTACGG CGCGCTGGGT GCATTCGCTA CCGGCATTGG CTCAACAGAT ATGGCCAGCG TCTTCGCCAC AGGAAAGCTC TGGTTTATGG TTCCTCAGAC CCTCAGGCTT GTGATAGACG GGCGCCTACG TAGGAGAGTC ACATCAAAGG ATGTGATTCT CAGGATCATC GGCGACATCG GCGCAGATGG TGCGAACTAC CTCGCATGCG AGTTCGCCGG ATCTGCGGTC GAGAGGATGA GCATCGCAGA GAGAATGACC ATGACCAACA TGTCGATAGA GATGGGCGCG AAGGCAGGGC TCGTGGAGCC TGACAGGGTG ACCATGACAT ACCTAAAGGA GTGGCTCACA GAGGAGCCGA TCAGGGGCGA TGAGGACGCA ATCTTCGAAG AAAAACACTG GGATGTGAAC GATCTCGAAC CACAGGTGGC CATGCCACAT CGTGTTGATA ATGTGGTGCC TGTCAGCAGG CTCCCTCATG TGAAGATTGA CCAGATCTTC CTCGGGTCAT GCACGAACGG ACGATTTGAG GATCTGAAGC TCGCCGCAGA GGTGATGGGT GATGAGCCGG TCGCACGGGG AGTCAGGATG ATAGTAATCC CTGCGAGCAG GAAGGAATAC ATGAGGGCAC TCAGGGCAGG ACTCATCGAG AAGTTCATGG AGGCGGGCGC GATCGTCGAG TCTCCCTGCT GCGGCCCGTG CATGGGTGGA AGCTTTGGGC TGATCGGGCC TGGAGAGGTC TCCCTGTCAA CATCGAACAG AAATTTCGTC GGAAGGCAGG GCAGCCCGAA GGGCGAGGTT TATCTCTGCT CTCCGGCAGT CGCAGGGGCG AGCGCCATAA CAGGAGAGAT CACAGATCCG AGGGAGATCT GA
|
Protein sequence | MGVMGLSMAG QTLSEKIFSR AANKEARAGE FVMASIDCAM IHDITGPLAV RGFYEIAGKG ARVWNPSRIV ILFDHQVPAD SIKAAENHQM LRAFAKEQGI INYDVFSGIC HQVMPENGHV LPGQLIVGTD SHTCTYGALG AFATGIGSTD MASVFATGKL WFMVPQTLRL VIDGRLRRRV TSKDVILRII GDIGADGANY LACEFAGSAV ERMSIAERMT MTNMSIEMGA KAGLVEPDRV TMTYLKEWLT EEPIRGDEDA IFEEKHWDVN DLEPQVAMPH RVDNVVPVSR LPHVKIDQIF LGSCTNGRFE DLKLAAEVMG DEPVARGVRM IVIPASRKEY MRALRAGLIE KFMEAGAIVE SPCCGPCMGG SFGLIGPGEV SLSTSNRNFV GRQGSPKGEV YLCSPAVAGA SAITGEITDP REI
|
| |