Gene Cthe_2792 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2792 
Symbol 
ID4810109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3292404 
End bp3293726 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content47% 
IMG OID640108212 
Productphenylacetate--CoA ligase 
Protein accessionYP_001039184 
Protein GI125975274 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1541] Coenzyme F390 synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTATGA GAAGATATTG GAATGAAGAA ATAGAGACCA TGTCAAGAAA GGACCTGGAG 
GATTATCAGT TTAGGCTTTT ATCGGAGCAT CTTGCGCTGG CATACGAAAA ATCTCAATAT
TACAGACAGT CTTTTGACGA GGCGGGGGTA AAACCGTCGG ATTTTAAAAA GCTTTCTGAC
ATTAGCAAAT TTCCTTTTGT GAACAAACAT ATAGAGCGGG AAAGACAGCA AAAAAAGCCT
TTGCTTGGCG ACATGACGGC TGTGGCCGAG GAGGAAGTGG TGTTTGTATC CGCTTCCAGC
GGCTCAACGG GAGTTCCTAC GCTAAGTCCC TTTACAAAGA AGGATTTTGA AGAATTTCAG
GATGTTCAAA GCAGGTTGTT TTGGGCGGCA GGAATGAGAC CCAACGACCG TTATGTTCAT
GCCCTCAATT TCACATTATT TGTGGGAGGT CCGGACGTTA TAGGCGCTCA AAATCTAGGG
GCTTTGTGCA TTTGGGCAGG AGCCATTCCT TCCGACAGGC TGCTCTTTAT CCTTAAAGAG
TTTCAGCCTA CCGTTATATG GACGACACCT TCCTATGCAT GGTACCTGGG GGAAACTGCG
AAAAAACAGG GAATTGACCC TGCAAAGGAC CTTTCCATCA ACAAAATCAT TGTGGCAGGA
GAGCCGGGAG GCTCTATTGA TGCCACAAGG CAAGCCATTG AGGAGCTTTG GGATGCAAAA
GTCTACGATT TCTACGGAAT TTCGGACATT TTCGGAGCAT GCGCGGGAAT GTGCAGCGAG
AGAAACGGTC TTCATTTGGT GGAGGACCAT ATTCTGGTTG AAGTAATCAA TCCCGATACT
TTAGAGCCGG TTGCGGAAGG AGAAAGAGGG GAACTGGTAT TTACCACTTT AAGAAAAACT
GCAAGGCCGA TGATTCGATT CCGGACGGGA GATATCGGCA CGGTAAACAG GGAGAAATGC
GCCTGCGGAC GTACCCATGC CCGCATAAAC ATTACAGGGC GCCTGGATGA TATGCTGATT
GTATCTGGAG TAAATGTGTT CCCCAGTGAT ATTGAGTATG TTGTACGCAA CATGGAAGAA
CTTTCGGGAG AATACAGGAT TACTGCCATA ACAGAAAACT TTACCACAAA ATTTAAGCTT
GAAGTGGAGA GGGCGCTCGG AAACCAGGAG CCCAAAGAAG TGCTTGCAGA GAAAGTATCA
GCCAGAATAA AGGCGCGCTT AGGTGTCAGG CCAAGAGAAG TCATTGTTCT GGAGAACGGT
GAACTTCCCA GGGCCACCCA CAAAGCAAAA AGGTTGATTG ATGAGAGAAA CGGGGGATTT
TAA
 
Protein sequence
MSMRRYWNEE IETMSRKDLE DYQFRLLSEH LALAYEKSQY YRQSFDEAGV KPSDFKKLSD 
ISKFPFVNKH IERERQQKKP LLGDMTAVAE EEVVFVSASS GSTGVPTLSP FTKKDFEEFQ
DVQSRLFWAA GMRPNDRYVH ALNFTLFVGG PDVIGAQNLG ALCIWAGAIP SDRLLFILKE
FQPTVIWTTP SYAWYLGETA KKQGIDPAKD LSINKIIVAG EPGGSIDATR QAIEELWDAK
VYDFYGISDI FGACAGMCSE RNGLHLVEDH ILVEVINPDT LEPVAEGERG ELVFTTLRKT
ARPMIRFRTG DIGTVNREKC ACGRTHARIN ITGRLDDMLI VSGVNVFPSD IEYVVRNMEE
LSGEYRITAI TENFTTKFKL EVERALGNQE PKEVLAEKVS ARIKARLGVR PREVIVLENG
ELPRATHKAK RLIDERNGGF