Gene Cthe_3041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3041 
Symbol 
ID4811113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3566571 
End bp3567533 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content37% 
IMG OID640108462 
Product1,4-dihydroxy-2-naphthoate octaprenyltransferase 
Protein accessionYP_001039430 
Protein GI125975520 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1575] 1,4-dihydroxy-2-naphthoate octaprenyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000195685 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCTTAG GCAGCTTTCT GAAACTGGTT GAGATTAAGA CCAAAATTGC CAGCATGGTA 
CCGTTCATGC TGGGTACGAT ATATGCAATA TACCGTTTTA ATGCTTTTAA CGTTAAAAAT
TTTTTATTGA TGTTTATATC CCTCTTGTCC TTTGATATGG TGACAACGGC TCTGAACAAC
TATTTTGATT ATAAAAGAGC GAGAAAAAAA GAGGGATATA ATTATGAACA GCATAATGCA
ATAGTACGGG ACAAGCTTAC AGAGCCTATG GTAATTACGG TTATACTTGT TCTTTTGGGT
ATAGCCATAT TATTTGGAGT ATTACTTTAT TTAAATACAA ATATTATTGT ATTGTTGGTG
GGGGCAATAT CTTTTGCCGT GGGAATTGTT TATTCCTTTG GCCCCCTTCC AATCTCCAGA
ATGCCCCTGG GGGAAGTGTT TTCGGGATTT TTCATGGGAT TTGTAATAAT ATTTGTTTCC
GCATTTGTAC ATATTTATGA CCGGAATATC ATCTTGCTGA CTCTTGAAGG GCAATGGCTG
TCCTTGCGGC TGAATGCCAT GGAGGTGTTG GCTCTTTTTG CCTTTGCTGT CCCTGCGGTA
TGCGGAATTG CAAACATAAT GCTTGCCAAC AATATATGTG ATGTGGATGA TGACATGGAG
AACAAGCGGT ACACACTCCC GATATACATT GGAAAGGAAA AGGCGCTGTG GTTGTTTGAA
ACACTTTATT ATATCGCATT TGTTGATATA ATCATACTTG CTGTTTTCAG GATTGTTTCA
CCAATAGTGT TATTGACATT GCTTGTATTT ATACCGGTAA GAAGGAATAT AGGACTTTTT
AGGAAAAAGC AGACCAAGAA GGACACCTTT GAACTTGCTG TCAAAAATTT TGTGGCGATA
TGTGGTTCGC AATTTATGCT GGTGGGTATT TCCATAATTT TTTCACTTAT AAATTTGTTT
TAA
 
Protein sequence
MRLGSFLKLV EIKTKIASMV PFMLGTIYAI YRFNAFNVKN FLLMFISLLS FDMVTTALNN 
YFDYKRARKK EGYNYEQHNA IVRDKLTEPM VITVILVLLG IAILFGVLLY LNTNIIVLLV
GAISFAVGIV YSFGPLPISR MPLGEVFSGF FMGFVIIFVS AFVHIYDRNI ILLTLEGQWL
SLRLNAMEVL ALFAFAVPAV CGIANIMLAN NICDVDDDME NKRYTLPIYI GKEKALWLFE
TLYYIAFVDI IILAVFRIVS PIVLLTLLVF IPVRRNIGLF RKKQTKKDTF ELAVKNFVAI
CGSQFMLVGI SIIFSLINLF