Gene Mthe_0903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0903 
Symbol 
ID4462367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp981817 
End bp983205 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content57% 
IMG OID639699922 
Productdihydrolipoamide dehydrogenase 
Protein accessionYP_843331 
Protein GI116754213 
COG category[C] Energy production and conversion 
COG ID[COG1249] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGACA AATACGATGT AATAGTGATC GGATCCGGAG CCGGGCTCAT AGTTGCGCAG 
AGGGCTCTCT TTGAGGGGCT CAGGGTGGCT CTCATCGAGC ACGGCCCTCT TGGCGGAACA
TGCCTCAACA CCGGATGCAT CCCCTCGAAG ATGCTGATAC ATCCAGCTGA TATCGTAAGG
ATGGTGGAGG ATGGCAGCAG GCTCGGCATA AAGGCACACG TCGACTCGAT AGATTTTGAA
TTCATCATGA AACGAATGAG GGATCTGATC GAGGCCGAGC GGGGGGAGAT GGAGAAGGCG
ATCGGGGAGG AGGAGCAGCT CCGCTGGTAC CGCAGCACAG GTGTTTTTGT CGGTGATCAT
CTCATCAGGG TAGGTGAGGA GGAGATAACA GCTCCCTGGA TCGTCATCGC AGCTGGTGCC
AGAACGCTTG TTCCTCCTGT GGCAGGTCTT GGCGAGGCGG GGTATCTGGA TAACGTCTCC
GTCTTCTCAT TAGCGAAACC CCCTGAAAGC CTGATCATCC TGGGAGGGGG ATACATCGCA
TGCGAGTTCG GGCACTTCTT CTCCGCGATG GGCTCTGATG TCACGATAGT GGGGCGGAAC
CCCCGGCTTC TCAAATCAGA GGATCACGAG ATATCGGATA TGGCGCTGAA GGCGTTATCC
AAACACATGC GAATCCACAC GAACATGGAG GCGATACGTG TGGATCTGGA GGGCGGGAAG
AAGGTCGTGA CAGCGATCGA TAGATCCAGG GGTGAGACAG TGAGCTTTGT GGGTGATGAG
ATACTCCTGG CAGCCGGCAG GCGTCCGAAC ACCGACATGC TGCAGCCGGA AAAGAGCGGA
GTGGAGATTG ACAGGGCCGG CTGGGTCAGG GTGAATGAGC ACCTGGAGAC CACAGCTCCC
GGGATATGGG CCCTCGGAGA CATCACTGGA AAGCACATGT TCAGGCACAC CGCGAACTAC
GAGGCATCGA TCGTGGCCCA CAATCTCATC AATGCTGCCA GAGGAGAAAA GGAGAAGGTA
AAGGTGGATT ACCATGCAGT GCCGCATGCC ATCTTCACAT ATCCCCAGAT CGCGGGCGTC
GGAATGACAG AGGAGCAGGC GAGGGCCAAC GGGTACGATA TCCTCGTCGG GCGTGCTTAC
TACAAACAGA CAGCCATGGG TTATGCGATG GATGAGGACG GCATGGCCAA GGCGATAGTC
GATGCCAGAA GCGGGCGGAT CCTGGGGTTC CACGTCATAG GATCCTCTGC ACCGGAGCTT
GTGCAGCAGG TCACGTATCT GATGAACGCC GAGAATCAGG ATGTAACGCC GATGGCAAGA
TCGCAGGTTA TCCACCCGGC GATAAGCGAG GTTGTGGCGA GAGCCTTCGG GAACCTGCGT
GGTGAATGA
 
Protein sequence
MDDKYDVIVI GSGAGLIVAQ RALFEGLRVA LIEHGPLGGT CLNTGCIPSK MLIHPADIVR 
MVEDGSRLGI KAHVDSIDFE FIMKRMRDLI EAERGEMEKA IGEEEQLRWY RSTGVFVGDH
LIRVGEEEIT APWIVIAAGA RTLVPPVAGL GEAGYLDNVS VFSLAKPPES LIILGGGYIA
CEFGHFFSAM GSDVTIVGRN PRLLKSEDHE ISDMALKALS KHMRIHTNME AIRVDLEGGK
KVVTAIDRSR GETVSFVGDE ILLAAGRRPN TDMLQPEKSG VEIDRAGWVR VNEHLETTAP
GIWALGDITG KHMFRHTANY EASIVAHNLI NAARGEKEKV KVDYHAVPHA IFTYPQIAGV
GMTEEQARAN GYDILVGRAY YKQTAMGYAM DEDGMAKAIV DARSGRILGF HVIGSSAPEL
VQQVTYLMNA ENQDVTPMAR SQVIHPAISE VVARAFGNLR GE