Gene Cthe_0681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0681 
Symbol 
ID4810299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp837592 
End bp839085 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content43% 
IMG OID640106098 
Productinosine 5-monophosphate dehydrogenase 
Protein accessionYP_001037109 
Protein GI125973199 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0516] IMP dehydrogenase/GMP reductase 
TIGRFAM ID[TIGR01302] inosine-5'-monophosphate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000239611 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTATA TCTATGAAGA GGTGTCAAGA ACTTTCAGCG AATACTTACT AATACCAAAT 
CTTACAACGG AAAAATGTAC TCCGGATAAT ATAGACCTTA GCACTCCTTT GGTAAAATTC
AAAAAAGATG AAGAGTGTAG TTTAAAACTC AATATTCCTA TAGTATCCGC CATTATGCAG
TCGGTGTCCA ACGACACACT GGCAATAGCT CTTGCCAGAT GCGGAGGATT ATCTTTTATA
TATGCTTCTC AGCCAATTGA AAGCCAGGCT GAAATGGTTA AAAGAGTAAA AAAGTACAAG
TCGGGATTTG TTGTCAGCGA CTCCAATCTC ACCATCGACA GCACATTGAA GGATGTCATC
GAGTTAAAGA ACAGAACCGG CCATTCCACT ATCGCAATAA CGGACGACGG TACAGCTTCC
GGAAAGCTTC TTGGATTGGT CACTACAAGG GACTATAGAA TAAGCAGGGA TCCTTTGGAT
AAAAAAGTAA AGGATTTCAT GACACCCTTC TCAAAACTCG TTGTAGGTAA ATTGGGTATC
AGCTTAAGCG AAGCAAATGA CATAATATGG GAAAACAAGC TTAATTGTCT TCCTATTGTT
GACGATGAGC AAAGGCTTCA CTATTTAGTT TTCAGAAAAG ACTATGACGA CCATAAGCAG
AACCCTTATG AACTGCTTGA CAGCAACAAA AGACTTAGAG TCGGAGCAGG AATAAACACA
AGAGACTACA AAGAAAGAGT GCCGGCGCTG GTAGATGCGG GAGTTGACGT CCTGTGTATT
GATTCGTCCG ATGGATTTTC CGTATGGCAG AAATATACCC TGGATTATAT AAAATCAAAT
TACAATATAA AAGTCGGTGC GGGAAATGTG GTTGACAGGG AAGGTTTCTT GTACCTTGCC
GAGGCCGGAG CCGACTTTGT AAAGGTTGGA ATCGGTGGAG GCTCCATTTG TATAACCCGT
GAACAAAAAG GAATCGGAAG AGGACAAGCC ACGGCCGTTA TTGAAGTGGC AAAAGCCAGA
GATGAATATT TTGAGAAAAC CGGCGTTTAC ATTCCGATTT GCTCAGACGG CGGTATTGTT
CACGACTATC ACATTGTTCT GGCCCTGGCA ATGGGTGCTG ATTTCGTAAT GATGGGAAGA
TATTTTGCAA GATTCGATGA AAGTCCCACG AAGAAAGTTA AGAGCGGAAA CGGTTATGTT
AAAGAATATT GGGGGGAAGG CTCAAACAGA GCCAGGAACT GGCAGCGTTA CGACCATGGA
GGGGAAAGTA CCAATCTGAA ATTTGAAGAA GGTGTTGACA GCTACGTACC TTATGCGGGT
AAACTGAGAG ACAACCTTGA AATTACACTG AGCAAAATAA AAGCTACAAT GTCAAGCTGC
GGCGCAGCTT CCATAAGCGA GCTTCAGAAA ACCGCAAGGC TGACTGTGGT ATCTTCCACA
AGCATAATAG AAGGCGGGGC TCACGACGTT ATATTAAAGG ACAAGGATTA TTAA
 
Protein sequence
MAYIYEEVSR TFSEYLLIPN LTTEKCTPDN IDLSTPLVKF KKDEECSLKL NIPIVSAIMQ 
SVSNDTLAIA LARCGGLSFI YASQPIESQA EMVKRVKKYK SGFVVSDSNL TIDSTLKDVI
ELKNRTGHST IAITDDGTAS GKLLGLVTTR DYRISRDPLD KKVKDFMTPF SKLVVGKLGI
SLSEANDIIW ENKLNCLPIV DDEQRLHYLV FRKDYDDHKQ NPYELLDSNK RLRVGAGINT
RDYKERVPAL VDAGVDVLCI DSSDGFSVWQ KYTLDYIKSN YNIKVGAGNV VDREGFLYLA
EAGADFVKVG IGGGSICITR EQKGIGRGQA TAVIEVAKAR DEYFEKTGVY IPICSDGGIV
HDYHIVLALA MGADFVMMGR YFARFDESPT KKVKSGNGYV KEYWGEGSNR ARNWQRYDHG
GESTNLKFEE GVDSYVPYAG KLRDNLEITL SKIKATMSSC GAASISELQK TARLTVVSST
SIIEGGAHDV ILKDKDY