Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0289 |
Symbol | |
ID | 4808507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 359967 |
End bp | 362324 |
Gene Length | 2358 bp |
Protein Length | 785 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640105701 |
Product | DEAD_2 |
Protein accession | YP_001036721 |
Protein GI | 125972811 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1199] Rad3-related DNA helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00121679 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGAAT ATAAAAAGGA AATAAAGATA TCGGTAAGGA ATCTGGTTGA GTTTGTGTTA AGGACCGGTG ACATTGACAG CTCTTTTACC GGAAGCAGCA GGGCCGTCGA AGGTACAAGG CTTCATAAAA AAATACAGAA AACCCAGGGC AAGGAGTATA GCCCGGAAGT GTTTCTGAAA ACCACTGTTG AGTTTGACGA CTTTTTCCTC ACAGTTGAAG GACGGGCGGA CGGTGTAATA AATGAGGACG GCTGTTTTAT AATTGATGAG ATAAAAACAA CAACGGCACC TTTGGAGTTG ATAGATGAAT TCTACAATCC CCTGCACTGG GCACAGGCCA AATGCTACGC ATATATCCAT GCGTTAAATG AAAATTTGGA GAAAATAAAG ATAAGGCTTA CCTATTGCCA TTTGGAAACA GAAGAGATAA AATACCTGGT CAGTGAATTC GATTTTGCAG AGCTTAGCCG GTTTTTCGAG GAACTTGTTG AAAAATATTA TGTATGGGCA AAGCTTGCCT GTGACTGGCA GGTTAAAAGG GACTGTTCGA TTAAAGTTCT TGAGTTTCCT TTTGAGAAAT ACAGAAAAGG TCAGAGAAAA CTTGCTGTGG CTGTTTACAA GACCGTTACG GAGGGTAAAA AACTTTATGT GAAGGCGCCC ACAGGTATTG GAAAGACCAT TTCAACCCTG TTTCCGGCAG TCAAGGCAAT AGGGGAGGGA CATGCCTCAA AAATTTTTTA CCTTACGGCA AAGACCGTTA CAGGCGGTGT CGCCAAAGAA GCTTTTGCAA AAATGAGGCA AAAGGGACTT TTATTTAAAA CGGTGACTCT CACTGCAAAG GAAAAAATAT GTTTTATGGA AAAAGCCGTA TGCAAACCGG AAAAATGCGA GTATGCCAAG GGGCATTTTG ACAGAGTGAA CGAGGCAATA ATGGATATAC TGACCAATGA GGATGAAATC AAAAGAGAAG TTATAGAAGA GTATGCAAAG GCCCACAGAG TTTGTCCCTT TGAGCTGGCG CTGGATCTTA CCATTTGGGC CGATGCGGTA ATTTGTGATT ACAATTATGT GTTTGATCCG AGGGTGTACC TGAAAAGATT TTTCTCCGAT GCGGGCGGTG ACTATATTTT TCTGGTGGAT GAGGCGCACA ACCTTGTGGA CAGGGCAAGA GAAATGTTTT CGGCGCAGCT TTCGAAAAAG AGCTTTCTTG AGCTGAAAAA GGCGATGAAA GAGGAAAGTC CCAAAATATC GAAAACACTG CAAAAGCTTA ACACATTTAT GCTGGGTATG AAAAAACTTT GCGGCGATAA CGACTACTTT GTAAGCAAGG AGGAGCAAAG TGAAATATAC CTGCTTTTAA GAAGACTTAT CGGTGAATGC GAAGAATATT TGACGGACAG GGCAAAGAAC GGAATTGAAA ATGAGGATTT GCTGCAGCTT TATTTTGATG CCCTTATGTA TGTCAGGATA GCCGAGTTTT ATGATGACAG GTATGTTACC TTTGTGGAAA AATCCGATAA CGATGTTAGA ATAAAGCTTT TTTGTATCGA CCCTTCCCAT CTTTTAAGCG AAGCTTTAAA AAGAGGAAAG GCAGCGGTTT TCTTTTCGGC CACGTTGCTT CCTTTGAGTT ATTTCAAGGA AGTTTTGGGA GGAGGGCCGG ATGATTACAC GATGTGTTTG GATTCACCTT TTGAAGTGAA TAACAGATGC CTCATGATAG CCGACAGAAT ATCCACCCGT TATCAGGACA GGGGCAAAAG CTGCGATGAG GTGGTGCAGT GCATAAAATC CATCGTTTGT GCCAAAAAGG GAAATTACAT TGCTTTTTTT CCGTCCTATC AGTATATGAA CATGATTTAT GAATTGTTTG AAAAGGAATG CGGTGATATT AAGCTTTATG TTCAGTCTTC CTCCATGACG GAAAAGGAAA GGGAGGATTT TCTTGAGCGT TTTAAAGCGG ACCCTCAGGA GACGGTATTG GGTTTTTGCG TGCTGGGAGG GATTTTTTCC GAAGGAATTG ATCTTAGGGA CGACAGGCTG ATAGGGGCGA TTATTGTTGG TGTAGGTCTT CCCCAGATAT GTATTGAAAG AGATATCATA AGGGATTATT ATCAGAATAA AAACCGGCTC GGATTTGAAT ACTCTTACAT GTATCCCGGC ATGAACAAGG TTATGCAGGC GGCGGGAAGA GTTATAAGGT CGGAGAATGA CAAGGGGGTT ATACTTTTAA TTGATGACAG GTTTACAAAC CCAAGTTATC TTGCCCTTTT TCCAAATGAG TGGTTCCCAT ATATCAGAGT TACGGGGAAT AATATATCAG AGCATGTAAA GAAGTTTTGG AGCCGACATG GGGCTTGA
|
Protein sequence | MNEYKKEIKI SVRNLVEFVL RTGDIDSSFT GSSRAVEGTR LHKKIQKTQG KEYSPEVFLK TTVEFDDFFL TVEGRADGVI NEDGCFIIDE IKTTTAPLEL IDEFYNPLHW AQAKCYAYIH ALNENLEKIK IRLTYCHLET EEIKYLVSEF DFAELSRFFE ELVEKYYVWA KLACDWQVKR DCSIKVLEFP FEKYRKGQRK LAVAVYKTVT EGKKLYVKAP TGIGKTISTL FPAVKAIGEG HASKIFYLTA KTVTGGVAKE AFAKMRQKGL LFKTVTLTAK EKICFMEKAV CKPEKCEYAK GHFDRVNEAI MDILTNEDEI KREVIEEYAK AHRVCPFELA LDLTIWADAV ICDYNYVFDP RVYLKRFFSD AGGDYIFLVD EAHNLVDRAR EMFSAQLSKK SFLELKKAMK EESPKISKTL QKLNTFMLGM KKLCGDNDYF VSKEEQSEIY LLLRRLIGEC EEYLTDRAKN GIENEDLLQL YFDALMYVRI AEFYDDRYVT FVEKSDNDVR IKLFCIDPSH LLSEALKRGK AAVFFSATLL PLSYFKEVLG GGPDDYTMCL DSPFEVNNRC LMIADRISTR YQDRGKSCDE VVQCIKSIVC AKKGNYIAFF PSYQYMNMIY ELFEKECGDI KLYVQSSSMT EKEREDFLER FKADPQETVL GFCVLGGIFS EGIDLRDDRL IGAIIVGVGL PQICIERDII RDYYQNKNRL GFEYSYMYPG MNKVMQAAGR VIRSENDKGV ILLIDDRFTN PSYLALFPNE WFPYIRVTGN NISEHVKKFW SRHGA
|
| |