Gene Mthe_0990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0990 
Symbol 
ID4462866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1074144 
End bp1075415 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content56% 
IMG OID639700008 
Product3-isopropylmalate dehydratase large subunit 
Protein accessionYP_843415 
Protein GI116754297 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR01343] homoaconitate hydratase family protein
[TIGR02086] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGTGA TGGGCCTGAG CATGGCTGGT CAAACCCTCT CCGAGAAGAT ATTCTCAAGG 
GCCGCGAACA AGGAGGCCAG GGCTGGCGAG TTCGTCATGG CCTCGATAGA TTGCGCGATG
ATACATGACA TAACCGGACC CCTTGCTGTC AGGGGCTTCT ATGAGATTGC AGGCAAAGGG
GCCAGGGTCT GGAATCCCTC CAGGATCGTG ATCCTCTTCG ATCATCAGGT GCCTGCAGAC
AGCATAAAGG CAGCTGAGAA CCATCAGATG CTGAGGGCAT TTGCAAAAGA GCAGGGCATC
ATAAACTACG ATGTGTTTAG CGGAATATGT CATCAGGTCA TGCCGGAGAA CGGCCATGTG
CTTCCAGGAC AGCTCATTGT CGGCACTGAT TCTCACACAT GCACCTACGG CGCGCTGGGT
GCATTCGCTA CCGGCATTGG CTCAACAGAT ATGGCCAGCG TCTTCGCCAC AGGAAAGCTC
TGGTTTATGG TTCCTCAGAC CCTCAGGCTT GTGATAGACG GGCGCCTACG TAGGAGAGTC
ACATCAAAGG ATGTGATTCT CAGGATCATC GGCGACATCG GCGCAGATGG TGCGAACTAC
CTCGCATGCG AGTTCGCCGG ATCTGCGGTC GAGAGGATGA GCATCGCAGA GAGAATGACC
ATGACCAACA TGTCGATAGA GATGGGCGCG AAGGCAGGGC TCGTGGAGCC TGACAGGGTG
ACCATGACAT ACCTAAAGGA GTGGCTCACA GAGGAGCCGA TCAGGGGCGA TGAGGACGCA
ATCTTCGAAG AAAAACACTG GGATGTGAAC GATCTCGAAC CACAGGTGGC CATGCCACAT
CGTGTTGATA ATGTGGTGCC TGTCAGCAGG CTCCCTCATG TGAAGATTGA CCAGATCTTC
CTCGGGTCAT GCACGAACGG ACGATTTGAG GATCTGAAGC TCGCCGCAGA GGTGATGGGT
GATGAGCCGG TCGCACGGGG AGTCAGGATG ATAGTAATCC CTGCGAGCAG GAAGGAATAC
ATGAGGGCAC TCAGGGCAGG ACTCATCGAG AAGTTCATGG AGGCGGGCGC GATCGTCGAG
TCTCCCTGCT GCGGCCCGTG CATGGGTGGA AGCTTTGGGC TGATCGGGCC TGGAGAGGTC
TCCCTGTCAA CATCGAACAG AAATTTCGTC GGAAGGCAGG GCAGCCCGAA GGGCGAGGTT
TATCTCTGCT CTCCGGCAGT CGCAGGGGCG AGCGCCATAA CAGGAGAGAT CACAGATCCG
AGGGAGATCT GA
 
Protein sequence
MGVMGLSMAG QTLSEKIFSR AANKEARAGE FVMASIDCAM IHDITGPLAV RGFYEIAGKG 
ARVWNPSRIV ILFDHQVPAD SIKAAENHQM LRAFAKEQGI INYDVFSGIC HQVMPENGHV
LPGQLIVGTD SHTCTYGALG AFATGIGSTD MASVFATGKL WFMVPQTLRL VIDGRLRRRV
TSKDVILRII GDIGADGANY LACEFAGSAV ERMSIAERMT MTNMSIEMGA KAGLVEPDRV
TMTYLKEWLT EEPIRGDEDA IFEEKHWDVN DLEPQVAMPH RVDNVVPVSR LPHVKIDQIF
LGSCTNGRFE DLKLAAEVMG DEPVARGVRM IVIPASRKEY MRALRAGLIE KFMEAGAIVE
SPCCGPCMGG SFGLIGPGEV SLSTSNRNFV GRQGSPKGEV YLCSPAVAGA SAITGEITDP
REI