Gene Mthe_0315 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0315 
Symbol 
ID4463293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp314665 
End bp315945 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content59% 
IMG OID639699320 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_842750 
Protein GI116753632 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.326801 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCTGA TGGAAGATGC AAAGAACGGG AGGATTCCCG ATGCGCTCAG GAGTGCCGCT 
GAGATCGAGG GAGTGGATCC GGAAGCGCTG AGGAGGCTCC TGGCGGCAGG GCGTGTGGTG
GTACCCATGA ACGCCCGGCG GATGAGAGAG AGGGCTGTGG GCATAGGAGA GCTTCTCTGT
ACAAAGGTCA ACGTCAATGT CGGCACATCC CCATCGCTTT CCAACATTGA GGAGGAGGTT
GAGAAGGCGC TGGCTGCAGT GAACGCGGGA GCTGATACGA TCATGGACCT CTCGACTGGA
GGGGATCTCG ATGAGATCAG GAGATCGATC CTCAAGAAGG TCGATGTGCC AGTTGGAACC
GTCCCGATAT ACCAGGCGGC TGTTGAGGAG AATCTCACGT CGCAGGGCAT GTTCAACGCT
CTTGAGAAGC ATGCAAAAGA TGGTGTCGAT TTCGTCACGG TCCATGTCGG CGTGAACAAA
GAGTCGATGA GACGCCTGTG CAGGGATCCG AGGCTCATGG GCGTTGTCTC ACGCGGCGGC
TCTCTGACAA TGAAGTACAT AACCGAGACC GGAGAGGAGA ACCCATACTA CGAGGAGTTC
GATTACCTGC TGGAGATAGC GAAGGAGCAC GACCTGACGC TGAGCCTGGG CGATGGCCTC
AGGCCTGGGT GCATAGAGGA TGCGAGCGAC CGGGCGAAGT ATATGGAGTT CATACTGCTT
GGAGAGATGG TTGCGCGCGC CAGGGAGGCA GGCGTCCAGG CGATGGTGGA GGGGCCAGGC
CACGTGCCCG CGGACGAGAT AGAGACCAGC GTGCGCGCGA TGAAGCACCT CACTGACGGC
GCCCCGCTAT ATCTTCTGGG TCCCATTGTC ACAGATGTTG CGCCAGGATA CGATCACATC
ACGGCTGCGA TGGGCGGGCT GATCGCCGGC ATGGCAGGTG CCGACTTCCT TTGCGCCACA
ACGCCAAGCG AGCACCTAGA TCTGCCCACT TTAGAGGATA TCATCGAGGG TACGGTGGTG
ACGAAGATCG CAGCTCATGC AGCCGATCTC ACAAAACCCG GAGTCAGGGA GCGCGCGAGG
GCATGGGACC GGAGGATGGC AGATGCCAGG GCGAACCTCG ACTGGGAGGC GCAGTTCAGA
GAGGCGATAG ATCCCACAAA GGCGAGAAGA ATCCGCCACA GAAGGGGAAT CGATGTGGAG
ACCTGCACGA TGTGCAGCGA ACTCTGCGCG ATAAGGATCG CGAGAGAGGC CCTGAAGCAT
GAGCGCTCAG ACGAATCTTG A
 
Protein sequence
MTLMEDAKNG RIPDALRSAA EIEGVDPEAL RRLLAAGRVV VPMNARRMRE RAVGIGELLC 
TKVNVNVGTS PSLSNIEEEV EKALAAVNAG ADTIMDLSTG GDLDEIRRSI LKKVDVPVGT
VPIYQAAVEE NLTSQGMFNA LEKHAKDGVD FVTVHVGVNK ESMRRLCRDP RLMGVVSRGG
SLTMKYITET GEENPYYEEF DYLLEIAKEH DLTLSLGDGL RPGCIEDASD RAKYMEFILL
GEMVARAREA GVQAMVEGPG HVPADEIETS VRAMKHLTDG APLYLLGPIV TDVAPGYDHI
TAAMGGLIAG MAGADFLCAT TPSEHLDLPT LEDIIEGTVV TKIAAHAADL TKPGVRERAR
AWDRRMADAR ANLDWEAQFR EAIDPTKARR IRHRRGIDVE TCTMCSELCA IRIAREALKH
ERSDES