Gene Cthe_2664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2664 
Symbol 
ID4808832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3143384 
End bp3145105 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content40% 
IMG OID640108079 
Product2-octaprenylphenol hydroxylase 
Protein accessionYP_001039056 
Protein GI125975146 
COG category[R] General function prediction only 
COG ID[COG0661] Predicted unusual protein kinase 
TIGRFAM ID[TIGR01982] 2-polyprenylphenol 6-hydroxylase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.948881 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAGT CGGTGGATGA CATGGTTATA AATAAAAGGG TCAATATAAA AAGATATAAA 
GAAATAATAT CAGTTTTTGC GAAACATGGC TTTGGTCTGC TGTTTGACCA ACTAGGTATT
TTTGACTATT TAAAGATGGG AAAGAAGTTT TCATCCGATG AGGGAGAAGA CGACAGCGCA
AAACTCTCCA TGGGTGAGCG AATCCGGCTG TCCTGTGAAG AACTTGGCCC AACCTTTGTA
AAAATGGGTC AGATACTCAG TACAAGGCCC GATGTTATTC CATCCGATAT TATTGAAGAA
TTAAAAAAAC TTCAGGACTC AGTGCACCCG TTTTCCTTTG ATGAGGTCAG GACGGTAATA
GAAAGCGAGT TTGAAGACAA AATTGAGAAT ATTTTTATGG AGTTTTCAGA AGAACCCATT
GCTGCGGCTT CCATATCCCA GGTTCATTGC GCAAGGTTGA AATCCGGTGA AAAAGTTGCG
GTTAAGGTAC AAAGACCGGG TATAGAAAGA AACATAGCTT TGGACTTAAA TATTCTTAAA
GACCTTGTGC ATTTTATTGA GAATCACACA AAATACGGTA AGATTTATGA TCTTGTCGGA
ATTGTGGTAG ACTTTGAAAA CACGATAAAG AACGAGCTGG ATTTTACCAA AGAAGGGGAA
AACGCCGAAA CTTTCAGAAA AAATTTTGCC AAAGACGGAA TTGCAAGAGT TCCGGAGATT
AAGTGGACGT ATACCACACG GCGTGTACTT ACAATGGAAT ACGTTGAAGG TATTCAGATT
GACGACCTGG AGGGACTTGA AAAGGCGGGA ATTGACAAAG TGGAGCTTGC CAGAAACCTT
GCTACGTCAA TATGCAATCA AATACTTACA GACGGCTTTT ATCATGCTGA CCCGCACCCG
GGAAATATCA GAGTCCTTGC GGACGGAACG ATAGTTTTTT TAGATTTGGG AATGGTTGGA
ATCATCAATG AGTCGCGAAA GAAAATGATA TCAAATTTTT TTATAGGCGT TTCCACAAAA
AACAGCAAAA TAGTGGCCCG TTCAATTGTT GACCTGGGAG AAATGTCGGA GAGGCGTAAT
TTAAAAAGGT TTGAAAAAGA TATAGACAAA ATGATTGATA AATATATTGC CATGCCTTGG
AGTGAAATAA AAGTAGTTGA CCTGTTTTAT GAGGTGTTTA ATATTGCCTT TGTAAATGAA
ATAAAGATAC CGCGGGAATT TGCAATGCTG GCCAAAACCC TTGGTACTGC CCAGGGAGTT
TTGGAAAAGC TGGCACCGGA TCTTAACACC ATAGAAATCG CAAAACCCAT AGCGAAGAAG
CTTATGTATC GGTCCTTTTC GATAAAAAAT GTTACCGACA GCATAAAAAA GAATGCCTTA
AATTACCGGG ATGCGATGGA TCTGCTTAAT GAGTTTCCGT CCATTGTTTT AGGCATTCTG
GAAAAATTCG AAGACAGGGA CTACACTCTT CAGCTGGGGA TTAAAGACAT TGACAAAATT
ATGAAGAGGC TTGACAGAAA CTTCAACAGG ATGTCTTTGA GTGTGGTGCT TCTTGCCGTA
AGTATTATTA TTACGGGTGT TATTATAGGT TCCAGCCAGA GTGCAAGTCC GGGAAGTGAA
ATGTATTTTG TAAATGTTAC GGCCCTCAGA ATTGGGCTGG GAATTGTGGT GGCCATAATA
TTGGGCCTGG TTATTTCCAT GTTTCGGTCA AATCACTTTT AA
 
Protein sequence
MAKSVDDMVI NKRVNIKRYK EIISVFAKHG FGLLFDQLGI FDYLKMGKKF SSDEGEDDSA 
KLSMGERIRL SCEELGPTFV KMGQILSTRP DVIPSDIIEE LKKLQDSVHP FSFDEVRTVI
ESEFEDKIEN IFMEFSEEPI AAASISQVHC ARLKSGEKVA VKVQRPGIER NIALDLNILK
DLVHFIENHT KYGKIYDLVG IVVDFENTIK NELDFTKEGE NAETFRKNFA KDGIARVPEI
KWTYTTRRVL TMEYVEGIQI DDLEGLEKAG IDKVELARNL ATSICNQILT DGFYHADPHP
GNIRVLADGT IVFLDLGMVG IINESRKKMI SNFFIGVSTK NSKIVARSIV DLGEMSERRN
LKRFEKDIDK MIDKYIAMPW SEIKVVDLFY EVFNIAFVNE IKIPREFAML AKTLGTAQGV
LEKLAPDLNT IEIAKPIAKK LMYRSFSIKN VTDSIKKNAL NYRDAMDLLN EFPSIVLGIL
EKFEDRDYTL QLGIKDIDKI MKRLDRNFNR MSLSVVLLAV SIIITGVIIG SSQSASPGSE
MYFVNVTALR IGLGIVVAII LGLVISMFRS NHF