Gene Cthe_0289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0289 
Symbol 
ID4808507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp359967 
End bp362324 
Gene Length2358 bp 
Protein Length785 aa 
Translation table11 
GC content42% 
IMG OID640105701 
ProductDEAD_2 
Protein accessionYP_001036721 
Protein GI125972811 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1199] Rad3-related DNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00121679 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAAT ATAAAAAGGA AATAAAGATA TCGGTAAGGA ATCTGGTTGA GTTTGTGTTA 
AGGACCGGTG ACATTGACAG CTCTTTTACC GGAAGCAGCA GGGCCGTCGA AGGTACAAGG
CTTCATAAAA AAATACAGAA AACCCAGGGC AAGGAGTATA GCCCGGAAGT GTTTCTGAAA
ACCACTGTTG AGTTTGACGA CTTTTTCCTC ACAGTTGAAG GACGGGCGGA CGGTGTAATA
AATGAGGACG GCTGTTTTAT AATTGATGAG ATAAAAACAA CAACGGCACC TTTGGAGTTG
ATAGATGAAT TCTACAATCC CCTGCACTGG GCACAGGCCA AATGCTACGC ATATATCCAT
GCGTTAAATG AAAATTTGGA GAAAATAAAG ATAAGGCTTA CCTATTGCCA TTTGGAAACA
GAAGAGATAA AATACCTGGT CAGTGAATTC GATTTTGCAG AGCTTAGCCG GTTTTTCGAG
GAACTTGTTG AAAAATATTA TGTATGGGCA AAGCTTGCCT GTGACTGGCA GGTTAAAAGG
GACTGTTCGA TTAAAGTTCT TGAGTTTCCT TTTGAGAAAT ACAGAAAAGG TCAGAGAAAA
CTTGCTGTGG CTGTTTACAA GACCGTTACG GAGGGTAAAA AACTTTATGT GAAGGCGCCC
ACAGGTATTG GAAAGACCAT TTCAACCCTG TTTCCGGCAG TCAAGGCAAT AGGGGAGGGA
CATGCCTCAA AAATTTTTTA CCTTACGGCA AAGACCGTTA CAGGCGGTGT CGCCAAAGAA
GCTTTTGCAA AAATGAGGCA AAAGGGACTT TTATTTAAAA CGGTGACTCT CACTGCAAAG
GAAAAAATAT GTTTTATGGA AAAAGCCGTA TGCAAACCGG AAAAATGCGA GTATGCCAAG
GGGCATTTTG ACAGAGTGAA CGAGGCAATA ATGGATATAC TGACCAATGA GGATGAAATC
AAAAGAGAAG TTATAGAAGA GTATGCAAAG GCCCACAGAG TTTGTCCCTT TGAGCTGGCG
CTGGATCTTA CCATTTGGGC CGATGCGGTA ATTTGTGATT ACAATTATGT GTTTGATCCG
AGGGTGTACC TGAAAAGATT TTTCTCCGAT GCGGGCGGTG ACTATATTTT TCTGGTGGAT
GAGGCGCACA ACCTTGTGGA CAGGGCAAGA GAAATGTTTT CGGCGCAGCT TTCGAAAAAG
AGCTTTCTTG AGCTGAAAAA GGCGATGAAA GAGGAAAGTC CCAAAATATC GAAAACACTG
CAAAAGCTTA ACACATTTAT GCTGGGTATG AAAAAACTTT GCGGCGATAA CGACTACTTT
GTAAGCAAGG AGGAGCAAAG TGAAATATAC CTGCTTTTAA GAAGACTTAT CGGTGAATGC
GAAGAATATT TGACGGACAG GGCAAAGAAC GGAATTGAAA ATGAGGATTT GCTGCAGCTT
TATTTTGATG CCCTTATGTA TGTCAGGATA GCCGAGTTTT ATGATGACAG GTATGTTACC
TTTGTGGAAA AATCCGATAA CGATGTTAGA ATAAAGCTTT TTTGTATCGA CCCTTCCCAT
CTTTTAAGCG AAGCTTTAAA AAGAGGAAAG GCAGCGGTTT TCTTTTCGGC CACGTTGCTT
CCTTTGAGTT ATTTCAAGGA AGTTTTGGGA GGAGGGCCGG ATGATTACAC GATGTGTTTG
GATTCACCTT TTGAAGTGAA TAACAGATGC CTCATGATAG CCGACAGAAT ATCCACCCGT
TATCAGGACA GGGGCAAAAG CTGCGATGAG GTGGTGCAGT GCATAAAATC CATCGTTTGT
GCCAAAAAGG GAAATTACAT TGCTTTTTTT CCGTCCTATC AGTATATGAA CATGATTTAT
GAATTGTTTG AAAAGGAATG CGGTGATATT AAGCTTTATG TTCAGTCTTC CTCCATGACG
GAAAAGGAAA GGGAGGATTT TCTTGAGCGT TTTAAAGCGG ACCCTCAGGA GACGGTATTG
GGTTTTTGCG TGCTGGGAGG GATTTTTTCC GAAGGAATTG ATCTTAGGGA CGACAGGCTG
ATAGGGGCGA TTATTGTTGG TGTAGGTCTT CCCCAGATAT GTATTGAAAG AGATATCATA
AGGGATTATT ATCAGAATAA AAACCGGCTC GGATTTGAAT ACTCTTACAT GTATCCCGGC
ATGAACAAGG TTATGCAGGC GGCGGGAAGA GTTATAAGGT CGGAGAATGA CAAGGGGGTT
ATACTTTTAA TTGATGACAG GTTTACAAAC CCAAGTTATC TTGCCCTTTT TCCAAATGAG
TGGTTCCCAT ATATCAGAGT TACGGGGAAT AATATATCAG AGCATGTAAA GAAGTTTTGG
AGCCGACATG GGGCTTGA
 
Protein sequence
MNEYKKEIKI SVRNLVEFVL RTGDIDSSFT GSSRAVEGTR LHKKIQKTQG KEYSPEVFLK 
TTVEFDDFFL TVEGRADGVI NEDGCFIIDE IKTTTAPLEL IDEFYNPLHW AQAKCYAYIH
ALNENLEKIK IRLTYCHLET EEIKYLVSEF DFAELSRFFE ELVEKYYVWA KLACDWQVKR
DCSIKVLEFP FEKYRKGQRK LAVAVYKTVT EGKKLYVKAP TGIGKTISTL FPAVKAIGEG
HASKIFYLTA KTVTGGVAKE AFAKMRQKGL LFKTVTLTAK EKICFMEKAV CKPEKCEYAK
GHFDRVNEAI MDILTNEDEI KREVIEEYAK AHRVCPFELA LDLTIWADAV ICDYNYVFDP
RVYLKRFFSD AGGDYIFLVD EAHNLVDRAR EMFSAQLSKK SFLELKKAMK EESPKISKTL
QKLNTFMLGM KKLCGDNDYF VSKEEQSEIY LLLRRLIGEC EEYLTDRAKN GIENEDLLQL
YFDALMYVRI AEFYDDRYVT FVEKSDNDVR IKLFCIDPSH LLSEALKRGK AAVFFSATLL
PLSYFKEVLG GGPDDYTMCL DSPFEVNNRC LMIADRISTR YQDRGKSCDE VVQCIKSIVC
AKKGNYIAFF PSYQYMNMIY ELFEKECGDI KLYVQSSSMT EKEREDFLER FKADPQETVL
GFCVLGGIFS EGIDLRDDRL IGAIIVGVGL PQICIERDII RDYYQNKNRL GFEYSYMYPG
MNKVMQAAGR VIRSENDKGV ILLIDDRFTN PSYLALFPNE WFPYIRVTGN NISEHVKKFW
SRHGA