Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0521 |
Symbol | |
ID | 4808270 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 638258 |
End bp | 641527 |
Gene Length | 3270 bp |
Protein Length | 1089 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640105936 |
Product | helicase-like protein |
Protein accession | YP_001036951 |
Protein GI | 125973041 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.036679 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGCCAC CTAAGGTTTT GGATAATAAA AAGTATAGGG TTATAGATGA GTTAAAGGCG GAATTACGTA AAGGTTCAAA GTTATCAATC ATATCAGCTT ACTTTACCAT CTATGCTTAT GAAGAACTTA AAAAGGAACT TAGTAAGATA GATAGTATGA GGTTCATTTT TACCGAGCCT ACTTTTGTAC GCAAGGACCA AGAACTTTAT AGGGAATATT ACATAGTTCG TCACCCTGAA AAAAAGATAT CTGGCAATGA GTTTGAAATA AAGCTAAGGA ATGAAATGAA GCAGGCGGCC ATAGCCAAGG AATGTGCCGA GTGGCTGGAA AAGAAAGCAG AAATAAAATC TCTAAGACGG CCAAATCCTG CTCAACCAAG ACTGGTTTAT ATAGATAATC CTGAAGATAA TGTTTTAATT CATGGAACTG TTGACTTTAC TACTGATGGT TTAGGTATCA CTCCTTCTAA TAGATTAGAC TATAACATGT GTATGTATGG TAAGGAATAC ACTATTGATT TTTTACAGTC CTTCAATGAA CTATGGGAGG ATGATACTGC TGTTCAGGAT GTTAAAGATA AAGTTCTTGA GCAAATGAGA ATTCTATATA AAGAAAATAC ACCTGAGTTT ATTTACTTTG TTACTCTTTA TAATATATTC TATGATTACC TTGATGAGCT AACCGAAGAT AACATTGTGA AAAGCCGTAC AGGCTTTAAA GAAACCCTTA TCTGGAATAA GCTTTATAGA TTCCAGAAGG ATGCCGTTAT GGGAGCAATT GATAAGCTCG AGAAGTACAA TGGCTGCATT ATAGCTGACA GTGTGGGACT TGGAAAAACT TTTACTGCAC TGGCTGTAAT TAAATACTAT GAACTAAGAA ACGATAGGGT TTTGGTACTC GTACCCAAGA AACTTCGTGA AAACTGGACC ATTTATACCC AAAATGATAA ACGCAATATT TTTGCTGCAG ACAGGTTTAA TTACGATGTT CTGAACCACA CTGATCTTAG CAGGACAAGC GGCTATTCCG GTGAGATTAA CCTTGCTACC TTAAACTGGT CAAACTATGA CCTTGTTGTA ATAGATGAGA GCCACAACTT CAGGAATAAT CCGCCGGTCA AAGGGCGTGT AACTCGTTAT CAGCGCTTGA TGAATGACAT TATAAAATCC GGTGTTAAAA CAAAGGTGCT TATGTTGTCA GCAACTCCTG TGAACAACCG GATGAATGAT ATCAAGAACC AAATAGCTTT TATAACTGAG GGCAGGGATG ATGCTTTTAA GGACGTGGGT CTTGATAGTA TTGAAATAAT CCTCAGAAAA GCCCAAGCTG TATTCAACAA ATGGTCTGAA CTGCCTGAAC ATGAAAGAAC CACTGAAACT TTTATTAACA TGATGGATAC AGATTATTTT AAACTCCTGG ATACGGTGAC TATTGCCCGT TCCAGAAAGC ATATAGAAAA ATATTATAAC TTAGAGGAGA TAGGCAAGTT TCCCACCAGG CTACCCCCTG AGAACAGATA TCCGGCAATT GATGCCAAAG GAGAGTTTCC TCCTATAGAA GAAATCAACA GACTTATAAA AAAACTTACC TTGTGTATCT ATTCGCCATT GGCCTATGTG CGTCCTGAAA AAAGAGCTGC TTATGAAGAA AAATATGATA TGGTAGTAGG CTCAAGTAAA AGTATCTTCA GACAGACAGA CAGGGAACAA AGCTTAGTTG GCCTTATGCG TGTAAACATT TTAAAAAGAT TGGAAAGCTC CATCAATTCC TTCGCAATAA CTGTTGAGAA TATTCTCCAT AAAATAGATA AGGCCCTTGA AGCCATAGAA AAAAGGCAGT TTGATTATGA TGCGGAATTG GATATCAATG ATATTGATAT AGACGACCCG GAGCTTGAAA GCCTCATGTT TGGCAACAAT GTAAAGGTAT TGCTCCAAGA TATGGACCTT ATCAAATGGA AACAAGACCT GCTGTCGGAT AAGGACAAGC TCGAAACCAT TTTATTGGAA GCTATGAATA TTACACCAGA TAGGGACGCC AAATTAATTG AACTTAGGAC TATTATAGAA AACAAAATAC AAAATCAAAT AAACCCAGGG AACAAAAAGG TTATAGTATT TACTGCCTTT GCTGATACAG CCAGATATTT GTATGAAAAT TTGGCTGATT ATTTTGCTAA AAAAGGTATT TATTCAGCTG TTGTAACAGG AAGTGGCGAC AATCATTCCA ATTTACCTGT ACCAAAGGAA TTAAGAAAAA CGGTTAAAAT GTCAGATATA AATACCATTT TGACATTGTT CTCTCCAGTT TCAAAGGAGT GTTCCAAGAT ATATCCCGAA GTTGATAGGT ATATAGATAT ACTTATTGCT ACGGATTGTG TATCTGAAGG ACAGAACCTT CAGGACTGTG ATTATTTGAT AAATTACGAT ATTCATTGGA ATCCGGTTAG AATTATACAA AGGTTCGGGC GTATTGACCG TATAGGTTCA AAAAACCAAT GTGTTAAATT AGTAAACTTT TGGGCTACCA AAGACTTGGA CGAATATATA AACCTTCAGC AGCGTGTCAG GGGCAGGATG GTTTTATTGG ATGTGTCTGC CACAGGTGAG GAAAATATTA TAGAAACAGA CTCTGTTAAA GAGATGAGAG ACCTTGAATA CCGTAAGAAG CAGCTCGAAA GACTGCAGAA AGAGGTTGTG GACCTGGAAG ATATATCTGG AGGTATATCG ATAACAGACC TGACCTTTAA TGATTTTAAG ATTGAACTTA TGGAATATAT GAAAAAGAAC AGAAAGCTTT TGGATGAAGC ACCAAATGGA ATGTATGCTG TTGCAAAGAT AGACGAAAGC GTAAAAGATA TTATAAAGCC AGGAGTGATT TTTACCCTTA GACAAGTCAA AGGGAAACAG CAAAGCAAAG AGCAAAACCC TTTATTTCCC TACTATATGG TTTATATCGC AGATGATGGG GAAGTAAAGC TATCTTTCCT TCATGCAAAG AAGATACTGG ATTATTATAA AAAACTATGT TCCGGACAAA AAGAAGTATT TAAAGAGTTG GTAGAGGAGT TTAATAAAGA GACCAATGAT GGTCGTAAAA TGAAACATTA TTCCGATTTG TTGGAAACTT CTATTGAAAA CATTATAGGC AAGAAGCAGG AGATTGGAGT GGCCAGCCTC TTTAGCAAAG GCGGCACCAC CATGCCGAAA AAAATTTTTG ATGGTATTGA AGATTTTGAA TTGATTACTT TTTTGGTTAT AAAGAGCTAG
|
Protein sequence | MRPPKVLDNK KYRVIDELKA ELRKGSKLSI ISAYFTIYAY EELKKELSKI DSMRFIFTEP TFVRKDQELY REYYIVRHPE KKISGNEFEI KLRNEMKQAA IAKECAEWLE KKAEIKSLRR PNPAQPRLVY IDNPEDNVLI HGTVDFTTDG LGITPSNRLD YNMCMYGKEY TIDFLQSFNE LWEDDTAVQD VKDKVLEQMR ILYKENTPEF IYFVTLYNIF YDYLDELTED NIVKSRTGFK ETLIWNKLYR FQKDAVMGAI DKLEKYNGCI IADSVGLGKT FTALAVIKYY ELRNDRVLVL VPKKLRENWT IYTQNDKRNI FAADRFNYDV LNHTDLSRTS GYSGEINLAT LNWSNYDLVV IDESHNFRNN PPVKGRVTRY QRLMNDIIKS GVKTKVLMLS ATPVNNRMND IKNQIAFITE GRDDAFKDVG LDSIEIILRK AQAVFNKWSE LPEHERTTET FINMMDTDYF KLLDTVTIAR SRKHIEKYYN LEEIGKFPTR LPPENRYPAI DAKGEFPPIE EINRLIKKLT LCIYSPLAYV RPEKRAAYEE KYDMVVGSSK SIFRQTDREQ SLVGLMRVNI LKRLESSINS FAITVENILH KIDKALEAIE KRQFDYDAEL DINDIDIDDP ELESLMFGNN VKVLLQDMDL IKWKQDLLSD KDKLETILLE AMNITPDRDA KLIELRTIIE NKIQNQINPG NKKVIVFTAF ADTARYLYEN LADYFAKKGI YSAVVTGSGD NHSNLPVPKE LRKTVKMSDI NTILTLFSPV SKECSKIYPE VDRYIDILIA TDCVSEGQNL QDCDYLINYD IHWNPVRIIQ RFGRIDRIGS KNQCVKLVNF WATKDLDEYI NLQQRVRGRM VLLDVSATGE ENIIETDSVK EMRDLEYRKK QLERLQKEVV DLEDISGGIS ITDLTFNDFK IELMEYMKKN RKLLDEAPNG MYAVAKIDES VKDIIKPGVI FTLRQVKGKQ QSKEQNPLFP YYMVYIADDG EVKLSFLHAK KILDYYKKLC SGQKEVFKEL VEEFNKETND GRKMKHYSDL LETSIENIIG KKQEIGVASL FSKGGTTMPK KIFDGIEDFE LITFLVIKS
|
| |