Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2040 |
Symbol | |
ID | 4811010 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2428242 |
End bp | 2431682 |
Gene Length | 3441 bp |
Protein Length | 1146 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640107447 |
Product | DNA helicase/exodeoxyribonuclease V, subunit B |
Protein accession | YP_001038442 |
Protein GI | 125974532 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3857] ATP-dependent nuclease, subunit B |
TIGRFAM ID | [TIGR02773] ATP-dependent nuclease subunit B |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTTGA GATTTATATA TGGCAGGGCG GGCAGTGGGA AAACCCGTTT TTGCCTTGAA GAAATAAAAT CAAGGATTAC TTCAAAAGCA ACACATCCGC TGGTTTTGCT TGTTCCCGAG CAGTTTACCT TTCAGGCTGA AAGAGACCTT ATCAGCGTAC TTGGGACAGG GGGTATCCTG AAGACCGAGG TATTGAGCTT TAGCCGTATT GCTTACAGAA CATTCAATGA AGCGGGTGGT ATTACCTATC CCCATATCCA CTCGGCAGGC AAATGCATGA TTCTTTACAG GATTTTGGAC AAAATGAAAG GTAGCTTTAG AGTGTTTTCC AAAACTGCTG ACCGGCAGGG TTTTGTCAAT ACGTTGTCCA CTCTTATTAC AGAATTCAAG AAATATAATG TTACACCGGA AGACTTGGAA AAGGTGAGTA AAGAACTTGA AGAGGATAAT CCTGTGAAGG AAAAGCTTAT GGAGCTTACT GCGATATATG ACCTGTTTGA AAAGACCATT GCGGAAAGAT ACAGGGATCC GGATGACGAC CTGACTTTGG CGGCAAAAAA GCTTGGCTCC ATTCCCCTTT ACGACGGTGC CGAAATCTGG ATTGACGGTT TTACCGGGTT TACTCCCCAG GAATATCAAA TAATAGGCCA ACTTATGAAA AAAGCTCAAA GGGTTAACAT AAGTTTTTGC ACCGATTGCC TGGACGGCGA CTTAAATGAT ACCGATATCT TTTCATCAAT TAAAACCGCC TACAGAAAAC TTGTAAAGAT GGCAAAGGAA AATGGTATTC CTGTGGAGCC TTCCGTTGTT TTGAACAGCA AGCCTTTGTT CCGCTTCAGC CAAAGCCCGG AGCTTTCCCA TCTTGAACAG TATCTTTACG CATATCCGTA TAAAACATAC AATGAAAAAA CCAAGGATAT ATCCCTCTTT TCTTCAGTCA ATATATTTGC CGAAGTTGAA GCTTGTGCCA GGGACATTGT ACGGCTTTGC CGGGACAGGG GAATGCGCTA CAGGGAGATT GCCGTTGTTA CCGGAAACCT TGACGGCTAT GAAAAGCTTA TCGAAGCTGT TTTTTCAGAA TACGGAATTC CATGCTTTAT TGACAGGAAA GTGGACATAG TTAACCACCC TTTGGTGCGG CTGATTATGT CGATGCTGGA TATTTTCATT GAAAACTGGT CATATGAAGC GGTGTTCCGC TACCTAAAAA CCGGGCTCAC CGGCATTGAC CGAGAGAGCA TCGACCGTTT GGAGAATTAT GTCTTAGCCT GCGGTATCCG GGGCAGCTGC TGGACCGAAA CAGAGGAATG GAAAATGGTT CCCGAGCTGA TTCCGAATGA AAAAAGCCTT GAAGAAGCAA AGGAGCTCCT GGAAGACGTA AATCGTATCA GGGCACAGGT AGTGGCGCCG CTTATGGAAT TCAGGAAGAA AACCAAAGGC AGAAAGAAAG CTTCCGACTT TTGTGCAAGC CTTTATGATT TCCTTTGCAC CCTGGGAATT CCCGAGAAAA TTGAAGATGC CATTGAAAAG TTTAGAGAAA GCGGGAATCT GAATCTTGCC AATGAATACT CCCAGGTTTG GAATGCAGTC ATGGAAGTTT TTGACCACAC AGTGGAGGTT ATGGGGGATG AGACCTTTGG AATTGAGAAG TTTGCCCGTA TACTTGAAAT CGGATTTGGA GAATGCAAAA TAGGATTGAT TCCCGCTTCC CTGGACCAGG TGCTTGTAGG GAGTCTGGAA CGTTCCAGAA GCCATGAAAT AAAAGCTTTG TATATATTGG GAGCCAATGA CGGGGTTTTC CCGCCTGCAG TGATGGAGGA AGGCATTCTT TCCGATCAGG ACAGAGCCGT GCTTAACAAT GCGGGGATTG AACTTGCCAG TGATACAAGA ACTCAGGCTT TTGACGGACA ATACTTGATA TACAGGGCAT TGACCACAGC CGGAAATTAT TTAAGAATCA GCTGGTCCAT TGCGGACCAT GAAGGAAGAA CCTTGCGGCC TTCCCTGGTT GTATTCCGGC TTCGGAAGTT GTTTTTGAAC ATCACGGAAA CGAGTAATAT TCTTCCTTCG GGTTCTTTGG AGGAGGAAAT GGAGCTTTTA TCCGGAAACA GCCCGGCATT TAAGTCCATG GTGTCGGCTT TGCGCCAAAA AGCGGACGGA AAAGAGATAA AGCCTGTCTG GCAGGAAGCG TACCGCTGGT TTGCTGTGCA GGATGAATGG AGAGGGAAAT GTGAAGCACT GCGGGCTGCT TTTCAATATA AAAATCTAGC CCAACCGGTA AGCCGTGAGA AAATTGCGGC TCTTTACGGA GAACCGGCGG TTTCCAGCGT ATCCCGGCTC GAAAAATACA CTGCCTGTCC CTTTGCCTTT TATGTGCAAT ATGGGCTTGG AGCAAAAGAA AGGCAGATAT ATTCTTTGCG CCCGCCGGAC GTTGGAACTT TCATGCATGC CGTCATTGAA AAGTTTTCAA GGATGGTTGC GAAACGGAAT ATTTCATGGA GAGATTTGGA CCGTGACTGG TGTAGTGAAA AGGTTTCGGA AATCGTGGAT GAAATGCTTG AAAAAATGCA AGGGTCGGGA ATTGCAGCTT CCAGAAGATA CACGGCTTTG ACCTTAAGGC TCAAGCGCGT GGTGGCAAGA GCTGTCTGGC TTATTGCGGA ACACATTCGC AGAAGCAGCT TCGAACCGGT GGCATATGAA GTAGGCTTTG GAGAAAACGG AAAGTATCCG CCCATTGTAA TTGAACTTGA TTCAGGTGAA AAAATTCATC TTACAGGAAG GATTGACAGG GTGGATGCGT TAAAAACCGA GGACGGCACC TATTTGAGGA TAGTTGACTA TAAATCGGGC GGCAAGGATT TCAAGCTGTC GGATGTTTTC TATGGGCTTC AGATTCAATT AATCACCTAT TTGGATGCCC TCTGGGAAAG CGGTGAGGCG GATGAGAACA ATCCGGTACT TCCCGGAGGA GTGCTGTATT TTAAGATTGA CGACCCGATT ATCAGAGGAA ACGGCAGAAT GACTGAGGAA GAGATTGAAA AAGCCATAAT GAAACAGCTC AGAATGAAAG GACTTCTTTT GGCAGATGTG AAACTGATAA GGGAAATGGA TAAGGACATT GAAGGAAGTT CCATGATTAT ACCCGCCACT GTTAATAAAG ACGGCAGTCT CGGAAAGAAT ACGTCTGCAG CAACGATGGA GCAGTTTAAG CTGCTTCGAA AATATGTAAG AAAACTTTTG AAGAATTTGT GCGAGGAAAT TATGAAGGGA AATGTATCCA TAAATCCATA CAAAAAGAAG GGAACCACGT CCTGCAAGTA TTGCAGTTTC TTGCCGGTGT GCCAGTTTGA CACCACAATG AAGGAAAACA CTTTCAAATT GCTTTACGAT AAAAAGGATG ATGAGATATG GAGTCTTATG GCGCAGGAGG AAGAGGAATA A
|
Protein sequence | MSLRFIYGRA GSGKTRFCLE EIKSRITSKA THPLVLLVPE QFTFQAERDL ISVLGTGGIL KTEVLSFSRI AYRTFNEAGG ITYPHIHSAG KCMILYRILD KMKGSFRVFS KTADRQGFVN TLSTLITEFK KYNVTPEDLE KVSKELEEDN PVKEKLMELT AIYDLFEKTI AERYRDPDDD LTLAAKKLGS IPLYDGAEIW IDGFTGFTPQ EYQIIGQLMK KAQRVNISFC TDCLDGDLND TDIFSSIKTA YRKLVKMAKE NGIPVEPSVV LNSKPLFRFS QSPELSHLEQ YLYAYPYKTY NEKTKDISLF SSVNIFAEVE ACARDIVRLC RDRGMRYREI AVVTGNLDGY EKLIEAVFSE YGIPCFIDRK VDIVNHPLVR LIMSMLDIFI ENWSYEAVFR YLKTGLTGID RESIDRLENY VLACGIRGSC WTETEEWKMV PELIPNEKSL EEAKELLEDV NRIRAQVVAP LMEFRKKTKG RKKASDFCAS LYDFLCTLGI PEKIEDAIEK FRESGNLNLA NEYSQVWNAV MEVFDHTVEV MGDETFGIEK FARILEIGFG ECKIGLIPAS LDQVLVGSLE RSRSHEIKAL YILGANDGVF PPAVMEEGIL SDQDRAVLNN AGIELASDTR TQAFDGQYLI YRALTTAGNY LRISWSIADH EGRTLRPSLV VFRLRKLFLN ITETSNILPS GSLEEEMELL SGNSPAFKSM VSALRQKADG KEIKPVWQEA YRWFAVQDEW RGKCEALRAA FQYKNLAQPV SREKIAALYG EPAVSSVSRL EKYTACPFAF YVQYGLGAKE RQIYSLRPPD VGTFMHAVIE KFSRMVAKRN ISWRDLDRDW CSEKVSEIVD EMLEKMQGSG IAASRRYTAL TLRLKRVVAR AVWLIAEHIR RSSFEPVAYE VGFGENGKYP PIVIELDSGE KIHLTGRIDR VDALKTEDGT YLRIVDYKSG GKDFKLSDVF YGLQIQLITY LDALWESGEA DENNPVLPGG VLYFKIDDPI IRGNGRMTEE EIEKAIMKQL RMKGLLLADV KLIREMDKDI EGSSMIIPAT VNKDGSLGKN TSAATMEQFK LLRKYVRKLL KNLCEEIMKG NVSINPYKKK GTTSCKYCSF LPVCQFDTTM KENTFKLLYD KKDDEIWSLM AQEEEE
|
| |