Gene Cthe_0521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0521 
Symbol 
ID4808270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp638258 
End bp641527 
Gene Length3270 bp 
Protein Length1089 aa 
Translation table11 
GC content37% 
IMG OID640105936 
Producthelicase-like protein 
Protein accessionYP_001036951 
Protein GI125973041 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.036679 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGCCAC CTAAGGTTTT GGATAATAAA AAGTATAGGG TTATAGATGA GTTAAAGGCG 
GAATTACGTA AAGGTTCAAA GTTATCAATC ATATCAGCTT ACTTTACCAT CTATGCTTAT
GAAGAACTTA AAAAGGAACT TAGTAAGATA GATAGTATGA GGTTCATTTT TACCGAGCCT
ACTTTTGTAC GCAAGGACCA AGAACTTTAT AGGGAATATT ACATAGTTCG TCACCCTGAA
AAAAAGATAT CTGGCAATGA GTTTGAAATA AAGCTAAGGA ATGAAATGAA GCAGGCGGCC
ATAGCCAAGG AATGTGCCGA GTGGCTGGAA AAGAAAGCAG AAATAAAATC TCTAAGACGG
CCAAATCCTG CTCAACCAAG ACTGGTTTAT ATAGATAATC CTGAAGATAA TGTTTTAATT
CATGGAACTG TTGACTTTAC TACTGATGGT TTAGGTATCA CTCCTTCTAA TAGATTAGAC
TATAACATGT GTATGTATGG TAAGGAATAC ACTATTGATT TTTTACAGTC CTTCAATGAA
CTATGGGAGG ATGATACTGC TGTTCAGGAT GTTAAAGATA AAGTTCTTGA GCAAATGAGA
ATTCTATATA AAGAAAATAC ACCTGAGTTT ATTTACTTTG TTACTCTTTA TAATATATTC
TATGATTACC TTGATGAGCT AACCGAAGAT AACATTGTGA AAAGCCGTAC AGGCTTTAAA
GAAACCCTTA TCTGGAATAA GCTTTATAGA TTCCAGAAGG ATGCCGTTAT GGGAGCAATT
GATAAGCTCG AGAAGTACAA TGGCTGCATT ATAGCTGACA GTGTGGGACT TGGAAAAACT
TTTACTGCAC TGGCTGTAAT TAAATACTAT GAACTAAGAA ACGATAGGGT TTTGGTACTC
GTACCCAAGA AACTTCGTGA AAACTGGACC ATTTATACCC AAAATGATAA ACGCAATATT
TTTGCTGCAG ACAGGTTTAA TTACGATGTT CTGAACCACA CTGATCTTAG CAGGACAAGC
GGCTATTCCG GTGAGATTAA CCTTGCTACC TTAAACTGGT CAAACTATGA CCTTGTTGTA
ATAGATGAGA GCCACAACTT CAGGAATAAT CCGCCGGTCA AAGGGCGTGT AACTCGTTAT
CAGCGCTTGA TGAATGACAT TATAAAATCC GGTGTTAAAA CAAAGGTGCT TATGTTGTCA
GCAACTCCTG TGAACAACCG GATGAATGAT ATCAAGAACC AAATAGCTTT TATAACTGAG
GGCAGGGATG ATGCTTTTAA GGACGTGGGT CTTGATAGTA TTGAAATAAT CCTCAGAAAA
GCCCAAGCTG TATTCAACAA ATGGTCTGAA CTGCCTGAAC ATGAAAGAAC CACTGAAACT
TTTATTAACA TGATGGATAC AGATTATTTT AAACTCCTGG ATACGGTGAC TATTGCCCGT
TCCAGAAAGC ATATAGAAAA ATATTATAAC TTAGAGGAGA TAGGCAAGTT TCCCACCAGG
CTACCCCCTG AGAACAGATA TCCGGCAATT GATGCCAAAG GAGAGTTTCC TCCTATAGAA
GAAATCAACA GACTTATAAA AAAACTTACC TTGTGTATCT ATTCGCCATT GGCCTATGTG
CGTCCTGAAA AAAGAGCTGC TTATGAAGAA AAATATGATA TGGTAGTAGG CTCAAGTAAA
AGTATCTTCA GACAGACAGA CAGGGAACAA AGCTTAGTTG GCCTTATGCG TGTAAACATT
TTAAAAAGAT TGGAAAGCTC CATCAATTCC TTCGCAATAA CTGTTGAGAA TATTCTCCAT
AAAATAGATA AGGCCCTTGA AGCCATAGAA AAAAGGCAGT TTGATTATGA TGCGGAATTG
GATATCAATG ATATTGATAT AGACGACCCG GAGCTTGAAA GCCTCATGTT TGGCAACAAT
GTAAAGGTAT TGCTCCAAGA TATGGACCTT ATCAAATGGA AACAAGACCT GCTGTCGGAT
AAGGACAAGC TCGAAACCAT TTTATTGGAA GCTATGAATA TTACACCAGA TAGGGACGCC
AAATTAATTG AACTTAGGAC TATTATAGAA AACAAAATAC AAAATCAAAT AAACCCAGGG
AACAAAAAGG TTATAGTATT TACTGCCTTT GCTGATACAG CCAGATATTT GTATGAAAAT
TTGGCTGATT ATTTTGCTAA AAAAGGTATT TATTCAGCTG TTGTAACAGG AAGTGGCGAC
AATCATTCCA ATTTACCTGT ACCAAAGGAA TTAAGAAAAA CGGTTAAAAT GTCAGATATA
AATACCATTT TGACATTGTT CTCTCCAGTT TCAAAGGAGT GTTCCAAGAT ATATCCCGAA
GTTGATAGGT ATATAGATAT ACTTATTGCT ACGGATTGTG TATCTGAAGG ACAGAACCTT
CAGGACTGTG ATTATTTGAT AAATTACGAT ATTCATTGGA ATCCGGTTAG AATTATACAA
AGGTTCGGGC GTATTGACCG TATAGGTTCA AAAAACCAAT GTGTTAAATT AGTAAACTTT
TGGGCTACCA AAGACTTGGA CGAATATATA AACCTTCAGC AGCGTGTCAG GGGCAGGATG
GTTTTATTGG ATGTGTCTGC CACAGGTGAG GAAAATATTA TAGAAACAGA CTCTGTTAAA
GAGATGAGAG ACCTTGAATA CCGTAAGAAG CAGCTCGAAA GACTGCAGAA AGAGGTTGTG
GACCTGGAAG ATATATCTGG AGGTATATCG ATAACAGACC TGACCTTTAA TGATTTTAAG
ATTGAACTTA TGGAATATAT GAAAAAGAAC AGAAAGCTTT TGGATGAAGC ACCAAATGGA
ATGTATGCTG TTGCAAAGAT AGACGAAAGC GTAAAAGATA TTATAAAGCC AGGAGTGATT
TTTACCCTTA GACAAGTCAA AGGGAAACAG CAAAGCAAAG AGCAAAACCC TTTATTTCCC
TACTATATGG TTTATATCGC AGATGATGGG GAAGTAAAGC TATCTTTCCT TCATGCAAAG
AAGATACTGG ATTATTATAA AAAACTATGT TCCGGACAAA AAGAAGTATT TAAAGAGTTG
GTAGAGGAGT TTAATAAAGA GACCAATGAT GGTCGTAAAA TGAAACATTA TTCCGATTTG
TTGGAAACTT CTATTGAAAA CATTATAGGC AAGAAGCAGG AGATTGGAGT GGCCAGCCTC
TTTAGCAAAG GCGGCACCAC CATGCCGAAA AAAATTTTTG ATGGTATTGA AGATTTTGAA
TTGATTACTT TTTTGGTTAT AAAGAGCTAG
 
Protein sequence
MRPPKVLDNK KYRVIDELKA ELRKGSKLSI ISAYFTIYAY EELKKELSKI DSMRFIFTEP 
TFVRKDQELY REYYIVRHPE KKISGNEFEI KLRNEMKQAA IAKECAEWLE KKAEIKSLRR
PNPAQPRLVY IDNPEDNVLI HGTVDFTTDG LGITPSNRLD YNMCMYGKEY TIDFLQSFNE
LWEDDTAVQD VKDKVLEQMR ILYKENTPEF IYFVTLYNIF YDYLDELTED NIVKSRTGFK
ETLIWNKLYR FQKDAVMGAI DKLEKYNGCI IADSVGLGKT FTALAVIKYY ELRNDRVLVL
VPKKLRENWT IYTQNDKRNI FAADRFNYDV LNHTDLSRTS GYSGEINLAT LNWSNYDLVV
IDESHNFRNN PPVKGRVTRY QRLMNDIIKS GVKTKVLMLS ATPVNNRMND IKNQIAFITE
GRDDAFKDVG LDSIEIILRK AQAVFNKWSE LPEHERTTET FINMMDTDYF KLLDTVTIAR
SRKHIEKYYN LEEIGKFPTR LPPENRYPAI DAKGEFPPIE EINRLIKKLT LCIYSPLAYV
RPEKRAAYEE KYDMVVGSSK SIFRQTDREQ SLVGLMRVNI LKRLESSINS FAITVENILH
KIDKALEAIE KRQFDYDAEL DINDIDIDDP ELESLMFGNN VKVLLQDMDL IKWKQDLLSD
KDKLETILLE AMNITPDRDA KLIELRTIIE NKIQNQINPG NKKVIVFTAF ADTARYLYEN
LADYFAKKGI YSAVVTGSGD NHSNLPVPKE LRKTVKMSDI NTILTLFSPV SKECSKIYPE
VDRYIDILIA TDCVSEGQNL QDCDYLINYD IHWNPVRIIQ RFGRIDRIGS KNQCVKLVNF
WATKDLDEYI NLQQRVRGRM VLLDVSATGE ENIIETDSVK EMRDLEYRKK QLERLQKEVV
DLEDISGGIS ITDLTFNDFK IELMEYMKKN RKLLDEAPNG MYAVAKIDES VKDIIKPGVI
FTLRQVKGKQ QSKEQNPLFP YYMVYIADDG EVKLSFLHAK KILDYYKKLC SGQKEVFKEL
VEEFNKETND GRKMKHYSDL LETSIENIIG KKQEIGVASL FSKGGTTMPK KIFDGIEDFE
LITFLVIKS