Gene Mthe_1173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1173 
Symbol 
ID4462638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1257208 
End bp1258491 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content57% 
IMG OID639700190 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_843595 
Protein GI116754477 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAATGC TTGAGGATGC CGCAGCTGGA AGGCTGAATG ACGAGATGAG ACTAGTGGCA 
CAGGCTGAGG GAAAGAGTCC TGAGTTCATA CGCAGAGGCA TTGCAAGCGG AAGAATAGTG
ATACCCATCT CTCCATACAG AGAGACCAGG CCGGTGGGTA TAGGAAAGGG CCTGCGCACC
AAGGTGAACG CATCCATAGG CACGAGCTCT GACATCGTGG ATGTGGATAT GGAGGTCGAG
AAGGCGCGTG TTGCGGAGAG CGCAGGGGCT GACACGCTGA TGGAGCTCTC AACAGGCGGT
GACCTGCGCG AGATCCGGAG GAGGGTGATA GAGGTGACGA GCCTCAGCGT CGGCAGCGTT
CCGCTCTATC AGGCATTCAT CGAGGCGATA AGGAAGCACG GCGCAGGCGT TGATATGACA
GAGGACGAGC TGTTCCGGGC TGTGGATGAG CAGGCGCGGA TGGGCACCAA CTTCATGGCG
ATACATACAG GAATAAACAG AATCTGCCTG GAGCGCCTGA AGGCGCAGGG CGGCAGGTTC
GGAGGGCTCT GCAGCCGTGG GGGCGCGTTC ATGATAGCCT GGATGCTTCA TAACGAAAAG
GAGAACCCTC TGTATAGCGA ATTCGACTAC CTTCTTGAGA TACTGAAGGA GCATGAGGTG
ACTCTTAGCC TCGGGAACGG CATGCGTGCG GGCGCGATTC ACGACTCGAC GGATAGAGCT
CAGATACAGG AGCTTGTAAT CAATGCGGAG CTCGCGGACA GAGCGCAGGC GGCAGGCGTC
CAGACGATCG TCGAGGGGCC GGGGCACATA CCTGTTGATG AGATAGAGGC GAACATAAGG
ATCATGAAGC GCATGACCGA TGAGCGGCCG TTTTACATGC TGGGTCCTCT GGTGACAGAT
ATAGCTCCCG GCTACGATCA TATCGTGGCT GCTATTGGGG CGAGCCTGTC CAGCGCATAC
GGTGCAGACT TCATCTGCTA CGTCACACCT GCGGAGCACC TTGCGCTCCC CACTCCGGAG
GATGTCAGGG AGGGTGTAAT CGCTGCAAGA ATCGCTGCTT ATATCGGGGA TATGATCAAG
CTCGGCAGAA GAGACAGGGA TCTGGAGATG GGGAGGGCAA GAAGAGATCT GCTCTGGGAT
ATGCAGTTTC ACCTGGCACT GGACCCGCAG AGGGCCAGGC AGATCAGGGC TGAGAGAGAG
CCTGCTGATA GCAGGGTCTG CACGATGTGC GGCGATTACT GCGCTCTGAA GATAATAAAG
AGCAGCATCA ACCTGAGCAA GTAG
 
Protein sequence
MGMLEDAAAG RLNDEMRLVA QAEGKSPEFI RRGIASGRIV IPISPYRETR PVGIGKGLRT 
KVNASIGTSS DIVDVDMEVE KARVAESAGA DTLMELSTGG DLREIRRRVI EVTSLSVGSV
PLYQAFIEAI RKHGAGVDMT EDELFRAVDE QARMGTNFMA IHTGINRICL ERLKAQGGRF
GGLCSRGGAF MIAWMLHNEK ENPLYSEFDY LLEILKEHEV TLSLGNGMRA GAIHDSTDRA
QIQELVINAE LADRAQAAGV QTIVEGPGHI PVDEIEANIR IMKRMTDERP FYMLGPLVTD
IAPGYDHIVA AIGASLSSAY GADFICYVTP AEHLALPTPE DVREGVIAAR IAAYIGDMIK
LGRRDRDLEM GRARRDLLWD MQFHLALDPQ RARQIRAERE PADSRVCTMC GDYCALKIIK
SSINLSK