Gene Cthe_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2039 
Symbol 
ID4811009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2424458 
End bp2428213 
Gene Length3756 bp 
Protein Length1251 aa 
Translation table11 
GC content47% 
IMG OID640107446 
ProductDNA helicase/exodeoxyribonuclease V, subunit A 
Protein accessionYP_001038441 
Protein GI125974531 
COG category[L] Replication, recombination and repair 
COG ID[COG1074] ATP-dependent exoDNAse (exonuclease V) beta subunit (contains helicase and exonuclease domains) 
TIGRFAM ID[TIGR02785] recombination helicase AddA, Firmicutes type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAGA GGACTAAATG GACGGATGAA CAATGGGAGG CCATAACCGG GAATGAAAAG 
AGTCTTTTGG TTTCAGCGGC GGCAGGAGCG GGAAAAACAG CGGTACTGGT GGAAAGAATC
ATCCGAAAAA TCACCGACGA AGAAAATCCC GTGGACATTG ACAGGCTGCT GGTGGTTACC
TTTACCAATG CTGCCGCCAC CGAAATGAGG GAAAGAATTG CCCAGGCCAT TTCCGAAAAA
TTGGAGGAGA ATCCCGGCTC GGCCAATATT CAAAGACAAC TTACACTGCT GGGAAAAGCA
TGTATTACCA CCATTCACTC TTTTTGCCTT GAGGTGATAC GCAGCAATTT CCAGCAGATT
AACATAGATC CGGGTTTTCG TATTGCCGAT GAAACCGAAA GCCGGCTTAT GAAACTGGAA
GCTTTGGATG AAGTGTTTGA AGAACAGTAT GAGAATGAAA ACGAAGATTT TTTTGAACTT
CTGGAATGCT ACGGTGGCAA CAGGGATGAC AGGGCACTTC AGGATATGGT GCTGAATCTG
TACGATTTTA TCCAGAGCAG TCCGTGGCCG GAAGAATGGC TGGAAAAAAT GACGGAGAGC
ATGAACATTC CCGATGGAAC GGATTTTGGA AAGACTCTTT GGGGCAGTGT GCTTTTAAGC
TCTGTGAAGA TTGAACTTGA AGGCTTGAAA GAAATGATAT CCCGTGCGCT GGAGATATTG
AAGGATGCTT CAGGGCTGGA GAAATACCGG GCTGTATATA TGGAGGACCT CGCCAATGTT
GATGCGCTGC TCAAGCTTTT AAATGAAGAA AGCGAAATGC AGTGGGACAG GATTTTTAAC
GCGCTCCAGG GCTTTGAGTT TGCAACGCTG CCCCGCTGCG GCAGAGAGGT TGACAAGGAT
AAGCAGGAAA TTGTAAAGAA AATCCGGGAT GATGTAAAGC AAAGGATAAG AAAGTTTCGG
GAAAAGGTAA TTACGTCTGT TTCCAATGAA ATTATAAGCG ATTTAAAAGC CTTGTATCCA
AAAATGAAAT GCCTGGCAAA CTTGGTAAAG CAGCTTGCGG AAAAGTATGC GGAAAAGAAA
AACCGCAAAT CTGTAGTGGA TTTTAACGAC CTGGAGCATT TTTGTCTGGA GATTCTTACA
GAAAGAAAGG AAGACGGAAG CATAATACCT TCCAGAACCG CCATTTCATA CAGGGAACGG
TTTGCGGAGA TTTTGGTGGA CGAGTACCAG GACAGCAATC TGGTGCAGGA GACCATTATC
AATATGATAT CCAAAGGGGA TGACGCAAGT CCCGGTGTAT TTATGGTGGG AGATGTTAAG
CAGAGTATTT ACCGTTTCAG GCAGGCAAGG CCCGAGTTGT TTCTTGAAAA GTACAACACC
TATTTGCCGG ATAAGGGCAG TCCCTGCCGC AAAATAATAC TCTCCAGGAA TTTCAGAAGC
CGCAGGGAAG TGATTGACGC TGTCAATTTC CTGTTCAAAC AGATTATGTC AACAGGCGCC
GGGGAACTGG ATTACACTGA TGCCGAGGCT TTGAATTTCG GAGCGGTTTT TGATGAAAAC
GCCAAGGAAG ATATAACGGT CGGTGGAGAA GTTGAGTTCC ATCTGATTCA GACGGAGGAC
GAAGATAAAA ATTTTACGTT TGAAAATGAA GGTGAGGAAG GGCGGCAGGC CGACGAAGGA
GAAGAGGATG AGGAAATGCT GGACAGCATC CAGTGTGAAG CAAGGCTGGT GGGCCGAAGA
ATCCTTGAAC TGATGAAACC GGATGAAAAT GGGCGGTATT TTAGTGTTTT TGACAAGGCA
AAAAACGAGT ACCGCAGGGT TGAATACCGG GACGTTGTAA TACTTCTCAG AACCACAAGG
AACTGGGCGG AGGTTTTTGT TGATGAACTG TCCGTGATGG GAATACCGGT TTTTGCCGAT
ACCGGAACCG GATTTTTTAA AACAGTGGAA GTCCAGGTAA TGCTGTCCCT TTTGCAGATT
ATTGACAATC CTTTGCAGGA CATTCCTTTG CTTTCGGTGC TGCGTTCGCC GATTGTCGGC
TTTACCACCG ATGAGCTTGC GGAATTGAGG CTTGTTGACA AAAAAGCGCT TCTGTTTGAT
GCATTAAAAA AGCTGGCGGA AAGCGGGCAG GGAGAAGCGG CAGGGAAAGC TTCGGCATTT
CTTGAAAACC TTCAGAAATG GAGGGAAATG TCGCTGTACA TGTCCACTGA CCGGTTGTTG
TGGCAGCTTT ACAATGATAC CGGTTATTAC AGCATTGTCG GAGCAATGCC TGCAGGGGAG
CAGCGGCAGG CAAACCTTAG AATATTGTAT GAGCGGGCCC GGCAGTTTGA GGAGACCAGC
TATAAAGGAT TATTCAATTT TATAAACTTT ATAGACAAAT TAAAAAGCAG CAGGGGCGAC
ATGGGAAGCG CAAAAATTTT GAGTGAAAAT GACAATGTGG TGCGCATAAT GAGCATTCAC
AAGAGCAAGG GACTGGAGTT TCCGGTGGTG ATTGTTGCTG GGTGTGGAAA GAAGTTCAAC
CTTCAGGATA TGAACAAGAG CATTCTTCTG CACCATGAAC TTGGTTTCGG CCCGGATGTG
GTGGACCACA AGCTGAGACT GTCATGGCCG TCGGTGGCCA AACAGGCTAT AAGGGAAAAG
ATTAAGGCTG AAACCCTTTC AGAGGAAATG AGGATACTGT ACGTTGCCCT GACCAGAGCA
CGGGAAAAGC TTGTTATAAC CGGTGCTGTG AAGAATGTGC GCAAGGCGGT GGAGAAGTGG
CTGGATAGTG CATCCGTTCA GAAAAGCAGG CTTTCGGCCT ATGATATGCT AAGCGGAGCC
AATTACCTTG ACTGGATTGG ACCTGCGCTT TTGAGGCACA AAAACTGCGG CGGACTTAGG
GACTGTGTGG GAAGCGCCGG TTTCCGGGGA CTGCTCATAG ATGACCCGTC GGTGTGGAGT
GTAAAGATAT GGAATAAAAC CGATGTGCAA AGCAGCGGAG TTTCGGAGGA ACAGGGAGAA
AGTGAGTTTA TAAAGTGGCT GGACAGTTTG GAAAAAGAAG AGCCTTCGGA ATATGCAGAA
GAAACAGCAA GAAGGCTAAG CTGGAGTTAT CCTTACGTTA AAGCTTCCAA AGTGCCTGCA
AAGGTTTCAG TGACGGAGCT GAAAAGGCGT TACAACGAGG TGGTTTCGGA AGATGTAATG
CAATTTCCGG ATTACATGCC GGTTTTGGTG AAAAAGCCAA TGTTTTTGGA GGAAAAGAAA
GGCCTGACTT ATGCCGAGAA GGGCACGATA CTTCACTTTG TCATGCAGCA CTTGGATTAC
GGTAGGGAGG ATATTGAAGC CCAGATTGAA GAAATGGTGG CAAAAGATTT GTTGACACCT
CAACAGGCAC AGAGTGTGGA TGCAGCCAGA ATCCGGCGTT TTCTAAATTC CCCTCTTGGA
AAGAGGATGC TGGCCTCAAA AAGCATAAAC CGTGAGGTGC CGTTTAATAT TGAGATACCG
TGCCATGAGC TGTACAGGGA TATGGAAGAT GAGGCCTGTC ACGGTGAGAC ACTTCTTCTG
CAGGGAGTTG TCGACTGCTA TTTTGAAGAG CCGGACGGTA TTGTGCTGGT GGATTACAAG
ACCGATTATG TGGCTCCGGG GAATGTTGAG ACGATTCGGG AAAGATACAA GGTGCAGATT
CTTTATTATG CCAGGGCGCT GGAGATGCTC ACCGGAAAGA AGGTAAAGGA GAAGTATATA
TATCTTTTCT GGGATGGGAG AATTTTGGGT TTTTGA
 
Protein sequence
MSERTKWTDE QWEAITGNEK SLLVSAAAGA GKTAVLVERI IRKITDEENP VDIDRLLVVT 
FTNAAATEMR ERIAQAISEK LEENPGSANI QRQLTLLGKA CITTIHSFCL EVIRSNFQQI
NIDPGFRIAD ETESRLMKLE ALDEVFEEQY ENENEDFFEL LECYGGNRDD RALQDMVLNL
YDFIQSSPWP EEWLEKMTES MNIPDGTDFG KTLWGSVLLS SVKIELEGLK EMISRALEIL
KDASGLEKYR AVYMEDLANV DALLKLLNEE SEMQWDRIFN ALQGFEFATL PRCGREVDKD
KQEIVKKIRD DVKQRIRKFR EKVITSVSNE IISDLKALYP KMKCLANLVK QLAEKYAEKK
NRKSVVDFND LEHFCLEILT ERKEDGSIIP SRTAISYRER FAEILVDEYQ DSNLVQETII
NMISKGDDAS PGVFMVGDVK QSIYRFRQAR PELFLEKYNT YLPDKGSPCR KIILSRNFRS
RREVIDAVNF LFKQIMSTGA GELDYTDAEA LNFGAVFDEN AKEDITVGGE VEFHLIQTED
EDKNFTFENE GEEGRQADEG EEDEEMLDSI QCEARLVGRR ILELMKPDEN GRYFSVFDKA
KNEYRRVEYR DVVILLRTTR NWAEVFVDEL SVMGIPVFAD TGTGFFKTVE VQVMLSLLQI
IDNPLQDIPL LSVLRSPIVG FTTDELAELR LVDKKALLFD ALKKLAESGQ GEAAGKASAF
LENLQKWREM SLYMSTDRLL WQLYNDTGYY SIVGAMPAGE QRQANLRILY ERARQFEETS
YKGLFNFINF IDKLKSSRGD MGSAKILSEN DNVVRIMSIH KSKGLEFPVV IVAGCGKKFN
LQDMNKSILL HHELGFGPDV VDHKLRLSWP SVAKQAIREK IKAETLSEEM RILYVALTRA
REKLVITGAV KNVRKAVEKW LDSASVQKSR LSAYDMLSGA NYLDWIGPAL LRHKNCGGLR
DCVGSAGFRG LLIDDPSVWS VKIWNKTDVQ SSGVSEEQGE SEFIKWLDSL EKEEPSEYAE
ETARRLSWSY PYVKASKVPA KVSVTELKRR YNEVVSEDVM QFPDYMPVLV KKPMFLEEKK
GLTYAEKGTI LHFVMQHLDY GREDIEAQIE EMVAKDLLTP QQAQSVDAAR IRRFLNSPLG
KRMLASKSIN REVPFNIEIP CHELYRDMED EACHGETLLL QGVVDCYFEE PDGIVLVDYK
TDYVAPGNVE TIRERYKVQI LYYARALEML TGKKVKEKYI YLFWDGRILG F