Gene Cthe_0354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0354 
Symbol 
ID4808503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp444274 
End bp445263 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content44% 
IMG OID640105768 
Productputative CoA-substrate-specific enzyme activase 
Protein accessionYP_001036785 
Protein GI125972875 
COG category[I] Lipid transport and metabolism 
COG ID[COG1924] Activator of 2-hydroxyglutaryl-CoA dehydratase (HSP70-class ATPase domain) 
TIGRFAM ID[TIGR00241] CoA-substrate-specific enzyme activase, putative 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCTGAAAA CCTCGTATTA TCTGGGAATT GATGTCGGTT CGGTCAGCAC GAACCTTGTG 
GTAATCAATG AAAATGATGA GATAGTTGAA AAATTGTACT TAAGAACAAG CGGACAGCCT
ATAAATGCAC TGAAAAATGG TATGAAGACA CTGTACCAAA GGCTTGGCAA GGACGTGAAA
ATAAGAGGTG TGGGAACTAC AGGAAGCGGC AGGCAGCTTG CAAGCGTAAT TGTGGGAGCT
GATATTGTCA AAAACGAGAT TACAACTCAC GCTATTGCGG CACAGAAACT TGTGCCTGAG
GTAAGAACCA TAATAGAAAT AGGCGGGCAG GATTCAAAGA TAATTATTCT GAAAAACGGT
ATTATTTATG ATTTTGCCAT GAATACCGTT TGTGCGGCGG GAACAGGCTC TTTCCTTGAC
AGGCAGGCCG CAAGGCTTGA AATACCGATT GAAGAGTTCG GCTCGTTTGC ACTAAGGTCC
AAGACTCCTG TAAGAATTGC AGGACGGTGT GCTGTATTTG CGGAATCGGA TATGATACAC
AAACAGCAGA CCGGACACAG CGTTGAAGAT ATTATCAACG GTCTTTGCGA GGCATTGGTG
AGGAATTATC TGAATAACCT GGCAAAAGGC AAAGACATCG AAGAACCCAT AGTCTTTCAG
GGCGGAGTTG CCGCGAATGT GGGAATTGTA GCTGCTTTTG AAAGAGCAAT AGGAAAAAAG
ATAATTATAC CTCAGCACTA TGATGTAATG GGAGCGTACG GAGCTGCGCT TATAGCAAAA
GAAGAAATGA TGAAAAACGG CAAAAACACC AACTTCTTTG GTTTTGATAA TATCCACAAT
GACTTTAGAG CCAGAAGCAT AGAATGTAAC GGCTGCTCCA ACATGTGCGA AGTAATTGAA
ATAGTATCAA ATGATGCGGT TGTGGCATGC TGGGGAGACC GATGCGGAAA ATGGTCCGCG
GTGAAAAAGG AAAATCAAAG TGTGGGGTGA
 
Protein sequence
MLKTSYYLGI DVGSVSTNLV VINENDEIVE KLYLRTSGQP INALKNGMKT LYQRLGKDVK 
IRGVGTTGSG RQLASVIVGA DIVKNEITTH AIAAQKLVPE VRTIIEIGGQ DSKIIILKNG
IIYDFAMNTV CAAGTGSFLD RQAARLEIPI EEFGSFALRS KTPVRIAGRC AVFAESDMIH
KQQTGHSVED IINGLCEALV RNYLNNLAKG KDIEEPIVFQ GGVAANVGIV AAFERAIGKK
IIIPQHYDVM GAYGAALIAK EEMMKNGKNT NFFGFDNIHN DFRARSIECN GCSNMCEVIE
IVSNDAVVAC WGDRCGKWSA VKKENQSVG