Gene Mthe_0788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0788 
Symbol 
ID4461975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp839074 
End bp840222 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content57% 
IMG OID639699799 
Product3-isopropylmalate dehydratase 
Protein accessionYP_843217 
Protein GI116754099 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR01343] homoaconitate hydratase family protein
[TIGR02086] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCACG ACGGCACAAG CGTCCTTGCC ATAAAGGCAT TCAGGGAGAT GGGGTCAGAG 
AAGGTCTGGG ATAAAAGCAG GATAGTAATA CCGTTCGATC ACATCGTGCC CGCAAACAAT
GAGACCGCTG CGACGCTTCA GGCGGAGGTG AGAAGATGGG CGAGGGCTCA GGGGATTGAG
AACTTCTACG ACTGCGGTCA TGGCATATGC CACCAGGTCT TCTGCGAGAT GGGTTTCGCT
CTTCCTGGGG CGCTTGTCGT GGGCGCCGAC TCTCATTCCT GTACTTATGG TGCACTCGGC
GCATTCGGAA CAGGTGTGGG CGCCACGGAC ATGGCTGAGA TCTATTCCCG CGGGAGGCTA
TGGTTCAGAG TGCCGGAGAC GATATGCATG CGCCTTGAGG GCACTCTGGG TGATATGGTA
TCAGCAAAGG ATCTCGCCCT CTTCGTGGTG AAGGAGATGG GCGCGGATGG CGCCAACTAC
ATGTCCGTGG AGTTCGTCGG CGGGGCTGTG GAGAGGCTGA GCATATCAGG CAGGATGACT
CTGTGCAACA TGGGTGTTGA GATGGGAGCA AAGGCTGCGA TCGTCCCGCC GGATGAGAGC
GTCGACGCAT ACCTCGCTAG AAGAGCCAGA CGTCCATACA CGCACATCCA CTCAGACCCG
GGATCATACT ACAGAGAGAT CGAGTACGAT GTGAGCGATA TTCCTCCAAT GATTGCGGCT
CCATACCGCG TTGACAATGT TCATCCAGTC AGGGATCTGG CAGGCATCGA GGTGGACCAG
GTATTCATCG GCACATGTAC CAACGGAAGG CTGGAGGATC TGGAGATGGC AGCCCGGATC
GTGAAGGGCA AAAGGGTTAA GATCAGAACG CTTGTGATCC CCGCCTCCAG AGAGATATAT
CTTGGTGCTC TGAGATCTGG GGTAATTGAG ACCCTTGTCG AGGCCGGCGC GATGATCGGC
CCGCCGGGAT GCGGTCCATG CCTTGGCGCA CACATGGGAG TTCTGGGCGA CGGAGAGGTC
TGTTTGTCCA CATCAAACAG AAACTTCCCG GGAAGGATGG GCAGAAACGG AAAGGTCTAC
CTGGCATCGC CTGCAACTGC CGCAGCCACG GCGATCACAG GAAAGATCAC AGATCCAAGG
GACGTATGA
 
Protein sequence
MSHDGTSVLA IKAFREMGSE KVWDKSRIVI PFDHIVPANN ETAATLQAEV RRWARAQGIE 
NFYDCGHGIC HQVFCEMGFA LPGALVVGAD SHSCTYGALG AFGTGVGATD MAEIYSRGRL
WFRVPETICM RLEGTLGDMV SAKDLALFVV KEMGADGANY MSVEFVGGAV ERLSISGRMT
LCNMGVEMGA KAAIVPPDES VDAYLARRAR RPYTHIHSDP GSYYREIEYD VSDIPPMIAA
PYRVDNVHPV RDLAGIEVDQ VFIGTCTNGR LEDLEMAARI VKGKRVKIRT LVIPASREIY
LGALRSGVIE TLVEAGAMIG PPGCGPCLGA HMGVLGDGEV CLSTSNRNFP GRMGRNGKVY
LASPATAAAT AITGKITDPR DV