Gene Cthe_1653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1653 
Symbol 
ID4808903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1978551 
End bp1979909 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content44% 
IMG OID640107068 
ProductSNF2-related protein 
Protein accessionYP_001038069 
Protein GI125974159 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGTGCA GCGCTCCTGA TGGAAATGGG CTGAAAAGTA CCGGGAAAAC TCTTGTCTCA 
ATCGCCATTA TAGGAGCATT GCTTGCTGCA GGAAGAATAA AACGTGTGCT GATTGTTGCG
CCACTTTCCA TTCTGGGAGT GTGGGAAGAC GAGTTTAAAC GGTTCGCAGA TTTCCCATAT
CAGCTTATAG TCCTTAATGG CACGATTCAA AAGAAAATCC AACAGCTAAG ATTTTTAACT
GGCGAAGGTG TCCATGTGGT GGTAGTCAAC TATGAATCTG CATGGAGAAT GGAAAAGGAA
CTGGCTAATT GGCATCCTGA CCTTATCATT GCTGATGAAG GACACAAAAT TAAAACTCAT
AATACTTCTG TTTCAAAGGC AATGCACCGG TTAGGCTTGC TTGCCCGGTA TCGGCTCTTA
CTGACAGGAA CGGTTATAAC CAACAAAGCC ATAGATGTAT TCAGCCAATA TAAGTTTCTC
GATCCACGCA TCTTTGGTAA CAGCTTCTAT GCCTTCAGAA ACCGGTATTT CAATATGGTT
GGCTATGGCA ACCATACGCC AGTGCTGAAA AAATCAATGG AACAGGATTT GATGAAAAGG
ATTCACAGCA TTGCATTCCG GGCGACCAAA GCGGAGTGTC TGGATTTGCC GGAAACCACC
GATATTATTC GCCATATTGA GCTTGAGCCT GTCACTTTAA AGAAATATAA AGAGCTTGTC
AAACAAAGCT ATACTGAGCT GTCAGCAGGA GAAGTAACAG CTACAAACAT ACTGACACGC
TTGCTTCGTC TTTCGCAATT AACCGGCGGC TTCATCGGAA GCGATGACGG TGGGAAAATC
GAGCAAGTCA GTGATGCCAA GTTGAAAGCT CTTGAAGATA TCCTTGAAAG CAGTATTCAA
GAAGGACATA AGCTGGTTGT CATAGCAAGG TTTATCCCTG AAATTCATGC TATATGCAGG
TTGCTGGAGA AAAAGAACAT CGGCTATGCG TGTATTTATG GTGCGACTAA GGATCGCCAA
GAACAAGTAA ACCGGTTTCA ATATGATCCC GACTGCATGG TGTTTGTAGG CCAGATTGCA
ACCGCTGGAC TCGGTATTAC GCTGACTGCT GCAAGCACAA TGGTATTTTA CTCCCTTGAT
TATTCCATGT CGAATTTCGA GCAGACAAAG GCCCGCATCC ATAGAGTTGG ACAGAAGAAT
GGCTGCACAT ATATCTACCT TATTGCCAAG GGTACTGTGG ATTCAAAAAT CCTGACTGCC
CTACGCAATA AGGCAGATCT TGCAAAAATG CTGATAGACG ACTACCGCAA AGGAGCAAAT
CCTTTTGCCC CAGAGGGAGG TGAAAGCTAT GAGCGATAA
 
Protein sequence
MGCSAPDGNG LKSTGKTLVS IAIIGALLAA GRIKRVLIVA PLSILGVWED EFKRFADFPY 
QLIVLNGTIQ KKIQQLRFLT GEGVHVVVVN YESAWRMEKE LANWHPDLII ADEGHKIKTH
NTSVSKAMHR LGLLARYRLL LTGTVITNKA IDVFSQYKFL DPRIFGNSFY AFRNRYFNMV
GYGNHTPVLK KSMEQDLMKR IHSIAFRATK AECLDLPETT DIIRHIELEP VTLKKYKELV
KQSYTELSAG EVTATNILTR LLRLSQLTGG FIGSDDGGKI EQVSDAKLKA LEDILESSIQ
EGHKLVVIAR FIPEIHAICR LLEKKNIGYA CIYGATKDRQ EQVNRFQYDP DCMVFVGQIA
TAGLGITLTA ASTMVFYSLD YSMSNFEQTK ARIHRVGQKN GCTYIYLIAK GTVDSKILTA
LRNKADLAKM LIDDYRKGAN PFAPEGGESY ER