Gene Cthe_2040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2040 
Symbol 
ID4811010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2428242 
End bp2431682 
Gene Length3441 bp 
Protein Length1146 aa 
Translation table11 
GC content45% 
IMG OID640107447 
ProductDNA helicase/exodeoxyribonuclease V, subunit B 
Protein accessionYP_001038442 
Protein GI125974532 
COG category[L] Replication, recombination and repair 
COG ID[COG3857] ATP-dependent nuclease, subunit B 
TIGRFAM ID[TIGR02773] ATP-dependent nuclease subunit B 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTTGA GATTTATATA TGGCAGGGCG GGCAGTGGGA AAACCCGTTT TTGCCTTGAA 
GAAATAAAAT CAAGGATTAC TTCAAAAGCA ACACATCCGC TGGTTTTGCT TGTTCCCGAG
CAGTTTACCT TTCAGGCTGA AAGAGACCTT ATCAGCGTAC TTGGGACAGG GGGTATCCTG
AAGACCGAGG TATTGAGCTT TAGCCGTATT GCTTACAGAA CATTCAATGA AGCGGGTGGT
ATTACCTATC CCCATATCCA CTCGGCAGGC AAATGCATGA TTCTTTACAG GATTTTGGAC
AAAATGAAAG GTAGCTTTAG AGTGTTTTCC AAAACTGCTG ACCGGCAGGG TTTTGTCAAT
ACGTTGTCCA CTCTTATTAC AGAATTCAAG AAATATAATG TTACACCGGA AGACTTGGAA
AAGGTGAGTA AAGAACTTGA AGAGGATAAT CCTGTGAAGG AAAAGCTTAT GGAGCTTACT
GCGATATATG ACCTGTTTGA AAAGACCATT GCGGAAAGAT ACAGGGATCC GGATGACGAC
CTGACTTTGG CGGCAAAAAA GCTTGGCTCC ATTCCCCTTT ACGACGGTGC CGAAATCTGG
ATTGACGGTT TTACCGGGTT TACTCCCCAG GAATATCAAA TAATAGGCCA ACTTATGAAA
AAAGCTCAAA GGGTTAACAT AAGTTTTTGC ACCGATTGCC TGGACGGCGA CTTAAATGAT
ACCGATATCT TTTCATCAAT TAAAACCGCC TACAGAAAAC TTGTAAAGAT GGCAAAGGAA
AATGGTATTC CTGTGGAGCC TTCCGTTGTT TTGAACAGCA AGCCTTTGTT CCGCTTCAGC
CAAAGCCCGG AGCTTTCCCA TCTTGAACAG TATCTTTACG CATATCCGTA TAAAACATAC
AATGAAAAAA CCAAGGATAT ATCCCTCTTT TCTTCAGTCA ATATATTTGC CGAAGTTGAA
GCTTGTGCCA GGGACATTGT ACGGCTTTGC CGGGACAGGG GAATGCGCTA CAGGGAGATT
GCCGTTGTTA CCGGAAACCT TGACGGCTAT GAAAAGCTTA TCGAAGCTGT TTTTTCAGAA
TACGGAATTC CATGCTTTAT TGACAGGAAA GTGGACATAG TTAACCACCC TTTGGTGCGG
CTGATTATGT CGATGCTGGA TATTTTCATT GAAAACTGGT CATATGAAGC GGTGTTCCGC
TACCTAAAAA CCGGGCTCAC CGGCATTGAC CGAGAGAGCA TCGACCGTTT GGAGAATTAT
GTCTTAGCCT GCGGTATCCG GGGCAGCTGC TGGACCGAAA CAGAGGAATG GAAAATGGTT
CCCGAGCTGA TTCCGAATGA AAAAAGCCTT GAAGAAGCAA AGGAGCTCCT GGAAGACGTA
AATCGTATCA GGGCACAGGT AGTGGCGCCG CTTATGGAAT TCAGGAAGAA AACCAAAGGC
AGAAAGAAAG CTTCCGACTT TTGTGCAAGC CTTTATGATT TCCTTTGCAC CCTGGGAATT
CCCGAGAAAA TTGAAGATGC CATTGAAAAG TTTAGAGAAA GCGGGAATCT GAATCTTGCC
AATGAATACT CCCAGGTTTG GAATGCAGTC ATGGAAGTTT TTGACCACAC AGTGGAGGTT
ATGGGGGATG AGACCTTTGG AATTGAGAAG TTTGCCCGTA TACTTGAAAT CGGATTTGGA
GAATGCAAAA TAGGATTGAT TCCCGCTTCC CTGGACCAGG TGCTTGTAGG GAGTCTGGAA
CGTTCCAGAA GCCATGAAAT AAAAGCTTTG TATATATTGG GAGCCAATGA CGGGGTTTTC
CCGCCTGCAG TGATGGAGGA AGGCATTCTT TCCGATCAGG ACAGAGCCGT GCTTAACAAT
GCGGGGATTG AACTTGCCAG TGATACAAGA ACTCAGGCTT TTGACGGACA ATACTTGATA
TACAGGGCAT TGACCACAGC CGGAAATTAT TTAAGAATCA GCTGGTCCAT TGCGGACCAT
GAAGGAAGAA CCTTGCGGCC TTCCCTGGTT GTATTCCGGC TTCGGAAGTT GTTTTTGAAC
ATCACGGAAA CGAGTAATAT TCTTCCTTCG GGTTCTTTGG AGGAGGAAAT GGAGCTTTTA
TCCGGAAACA GCCCGGCATT TAAGTCCATG GTGTCGGCTT TGCGCCAAAA AGCGGACGGA
AAAGAGATAA AGCCTGTCTG GCAGGAAGCG TACCGCTGGT TTGCTGTGCA GGATGAATGG
AGAGGGAAAT GTGAAGCACT GCGGGCTGCT TTTCAATATA AAAATCTAGC CCAACCGGTA
AGCCGTGAGA AAATTGCGGC TCTTTACGGA GAACCGGCGG TTTCCAGCGT ATCCCGGCTC
GAAAAATACA CTGCCTGTCC CTTTGCCTTT TATGTGCAAT ATGGGCTTGG AGCAAAAGAA
AGGCAGATAT ATTCTTTGCG CCCGCCGGAC GTTGGAACTT TCATGCATGC CGTCATTGAA
AAGTTTTCAA GGATGGTTGC GAAACGGAAT ATTTCATGGA GAGATTTGGA CCGTGACTGG
TGTAGTGAAA AGGTTTCGGA AATCGTGGAT GAAATGCTTG AAAAAATGCA AGGGTCGGGA
ATTGCAGCTT CCAGAAGATA CACGGCTTTG ACCTTAAGGC TCAAGCGCGT GGTGGCAAGA
GCTGTCTGGC TTATTGCGGA ACACATTCGC AGAAGCAGCT TCGAACCGGT GGCATATGAA
GTAGGCTTTG GAGAAAACGG AAAGTATCCG CCCATTGTAA TTGAACTTGA TTCAGGTGAA
AAAATTCATC TTACAGGAAG GATTGACAGG GTGGATGCGT TAAAAACCGA GGACGGCACC
TATTTGAGGA TAGTTGACTA TAAATCGGGC GGCAAGGATT TCAAGCTGTC GGATGTTTTC
TATGGGCTTC AGATTCAATT AATCACCTAT TTGGATGCCC TCTGGGAAAG CGGTGAGGCG
GATGAGAACA ATCCGGTACT TCCCGGAGGA GTGCTGTATT TTAAGATTGA CGACCCGATT
ATCAGAGGAA ACGGCAGAAT GACTGAGGAA GAGATTGAAA AAGCCATAAT GAAACAGCTC
AGAATGAAAG GACTTCTTTT GGCAGATGTG AAACTGATAA GGGAAATGGA TAAGGACATT
GAAGGAAGTT CCATGATTAT ACCCGCCACT GTTAATAAAG ACGGCAGTCT CGGAAAGAAT
ACGTCTGCAG CAACGATGGA GCAGTTTAAG CTGCTTCGAA AATATGTAAG AAAACTTTTG
AAGAATTTGT GCGAGGAAAT TATGAAGGGA AATGTATCCA TAAATCCATA CAAAAAGAAG
GGAACCACGT CCTGCAAGTA TTGCAGTTTC TTGCCGGTGT GCCAGTTTGA CACCACAATG
AAGGAAAACA CTTTCAAATT GCTTTACGAT AAAAAGGATG ATGAGATATG GAGTCTTATG
GCGCAGGAGG AAGAGGAATA A
 
Protein sequence
MSLRFIYGRA GSGKTRFCLE EIKSRITSKA THPLVLLVPE QFTFQAERDL ISVLGTGGIL 
KTEVLSFSRI AYRTFNEAGG ITYPHIHSAG KCMILYRILD KMKGSFRVFS KTADRQGFVN
TLSTLITEFK KYNVTPEDLE KVSKELEEDN PVKEKLMELT AIYDLFEKTI AERYRDPDDD
LTLAAKKLGS IPLYDGAEIW IDGFTGFTPQ EYQIIGQLMK KAQRVNISFC TDCLDGDLND
TDIFSSIKTA YRKLVKMAKE NGIPVEPSVV LNSKPLFRFS QSPELSHLEQ YLYAYPYKTY
NEKTKDISLF SSVNIFAEVE ACARDIVRLC RDRGMRYREI AVVTGNLDGY EKLIEAVFSE
YGIPCFIDRK VDIVNHPLVR LIMSMLDIFI ENWSYEAVFR YLKTGLTGID RESIDRLENY
VLACGIRGSC WTETEEWKMV PELIPNEKSL EEAKELLEDV NRIRAQVVAP LMEFRKKTKG
RKKASDFCAS LYDFLCTLGI PEKIEDAIEK FRESGNLNLA NEYSQVWNAV MEVFDHTVEV
MGDETFGIEK FARILEIGFG ECKIGLIPAS LDQVLVGSLE RSRSHEIKAL YILGANDGVF
PPAVMEEGIL SDQDRAVLNN AGIELASDTR TQAFDGQYLI YRALTTAGNY LRISWSIADH
EGRTLRPSLV VFRLRKLFLN ITETSNILPS GSLEEEMELL SGNSPAFKSM VSALRQKADG
KEIKPVWQEA YRWFAVQDEW RGKCEALRAA FQYKNLAQPV SREKIAALYG EPAVSSVSRL
EKYTACPFAF YVQYGLGAKE RQIYSLRPPD VGTFMHAVIE KFSRMVAKRN ISWRDLDRDW
CSEKVSEIVD EMLEKMQGSG IAASRRYTAL TLRLKRVVAR AVWLIAEHIR RSSFEPVAYE
VGFGENGKYP PIVIELDSGE KIHLTGRIDR VDALKTEDGT YLRIVDYKSG GKDFKLSDVF
YGLQIQLITY LDALWESGEA DENNPVLPGG VLYFKIDDPI IRGNGRMTEE EIEKAIMKQL
RMKGLLLADV KLIREMDKDI EGSSMIIPAT VNKDGSLGKN TSAATMEQFK LLRKYVRKLL
KNLCEEIMKG NVSINPYKKK GTTSCKYCSF LPVCQFDTTM KENTFKLLYD KKDDEIWSLM
AQEEEE