Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2039 |
Symbol | |
ID | 4811009 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2424458 |
End bp | 2428213 |
Gene Length | 3756 bp |
Protein Length | 1251 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640107446 |
Product | DNA helicase/exodeoxyribonuclease V, subunit A |
Protein accession | YP_001038441 |
Protein GI | 125974531 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1074] ATP-dependent exoDNAse (exonuclease V) beta subunit (contains helicase and exonuclease domains) |
TIGRFAM ID | [TIGR02785] recombination helicase AddA, Firmicutes type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAGA GGACTAAATG GACGGATGAA CAATGGGAGG CCATAACCGG GAATGAAAAG AGTCTTTTGG TTTCAGCGGC GGCAGGAGCG GGAAAAACAG CGGTACTGGT GGAAAGAATC ATCCGAAAAA TCACCGACGA AGAAAATCCC GTGGACATTG ACAGGCTGCT GGTGGTTACC TTTACCAATG CTGCCGCCAC CGAAATGAGG GAAAGAATTG CCCAGGCCAT TTCCGAAAAA TTGGAGGAGA ATCCCGGCTC GGCCAATATT CAAAGACAAC TTACACTGCT GGGAAAAGCA TGTATTACCA CCATTCACTC TTTTTGCCTT GAGGTGATAC GCAGCAATTT CCAGCAGATT AACATAGATC CGGGTTTTCG TATTGCCGAT GAAACCGAAA GCCGGCTTAT GAAACTGGAA GCTTTGGATG AAGTGTTTGA AGAACAGTAT GAGAATGAAA ACGAAGATTT TTTTGAACTT CTGGAATGCT ACGGTGGCAA CAGGGATGAC AGGGCACTTC AGGATATGGT GCTGAATCTG TACGATTTTA TCCAGAGCAG TCCGTGGCCG GAAGAATGGC TGGAAAAAAT GACGGAGAGC ATGAACATTC CCGATGGAAC GGATTTTGGA AAGACTCTTT GGGGCAGTGT GCTTTTAAGC TCTGTGAAGA TTGAACTTGA AGGCTTGAAA GAAATGATAT CCCGTGCGCT GGAGATATTG AAGGATGCTT CAGGGCTGGA GAAATACCGG GCTGTATATA TGGAGGACCT CGCCAATGTT GATGCGCTGC TCAAGCTTTT AAATGAAGAA AGCGAAATGC AGTGGGACAG GATTTTTAAC GCGCTCCAGG GCTTTGAGTT TGCAACGCTG CCCCGCTGCG GCAGAGAGGT TGACAAGGAT AAGCAGGAAA TTGTAAAGAA AATCCGGGAT GATGTAAAGC AAAGGATAAG AAAGTTTCGG GAAAAGGTAA TTACGTCTGT TTCCAATGAA ATTATAAGCG ATTTAAAAGC CTTGTATCCA AAAATGAAAT GCCTGGCAAA CTTGGTAAAG CAGCTTGCGG AAAAGTATGC GGAAAAGAAA AACCGCAAAT CTGTAGTGGA TTTTAACGAC CTGGAGCATT TTTGTCTGGA GATTCTTACA GAAAGAAAGG AAGACGGAAG CATAATACCT TCCAGAACCG CCATTTCATA CAGGGAACGG TTTGCGGAGA TTTTGGTGGA CGAGTACCAG GACAGCAATC TGGTGCAGGA GACCATTATC AATATGATAT CCAAAGGGGA TGACGCAAGT CCCGGTGTAT TTATGGTGGG AGATGTTAAG CAGAGTATTT ACCGTTTCAG GCAGGCAAGG CCCGAGTTGT TTCTTGAAAA GTACAACACC TATTTGCCGG ATAAGGGCAG TCCCTGCCGC AAAATAATAC TCTCCAGGAA TTTCAGAAGC CGCAGGGAAG TGATTGACGC TGTCAATTTC CTGTTCAAAC AGATTATGTC AACAGGCGCC GGGGAACTGG ATTACACTGA TGCCGAGGCT TTGAATTTCG GAGCGGTTTT TGATGAAAAC GCCAAGGAAG ATATAACGGT CGGTGGAGAA GTTGAGTTCC ATCTGATTCA GACGGAGGAC GAAGATAAAA ATTTTACGTT TGAAAATGAA GGTGAGGAAG GGCGGCAGGC CGACGAAGGA GAAGAGGATG AGGAAATGCT GGACAGCATC CAGTGTGAAG CAAGGCTGGT GGGCCGAAGA ATCCTTGAAC TGATGAAACC GGATGAAAAT GGGCGGTATT TTAGTGTTTT TGACAAGGCA AAAAACGAGT ACCGCAGGGT TGAATACCGG GACGTTGTAA TACTTCTCAG AACCACAAGG AACTGGGCGG AGGTTTTTGT TGATGAACTG TCCGTGATGG GAATACCGGT TTTTGCCGAT ACCGGAACCG GATTTTTTAA AACAGTGGAA GTCCAGGTAA TGCTGTCCCT TTTGCAGATT ATTGACAATC CTTTGCAGGA CATTCCTTTG CTTTCGGTGC TGCGTTCGCC GATTGTCGGC TTTACCACCG ATGAGCTTGC GGAATTGAGG CTTGTTGACA AAAAAGCGCT TCTGTTTGAT GCATTAAAAA AGCTGGCGGA AAGCGGGCAG GGAGAAGCGG CAGGGAAAGC TTCGGCATTT CTTGAAAACC TTCAGAAATG GAGGGAAATG TCGCTGTACA TGTCCACTGA CCGGTTGTTG TGGCAGCTTT ACAATGATAC CGGTTATTAC AGCATTGTCG GAGCAATGCC TGCAGGGGAG CAGCGGCAGG CAAACCTTAG AATATTGTAT GAGCGGGCCC GGCAGTTTGA GGAGACCAGC TATAAAGGAT TATTCAATTT TATAAACTTT ATAGACAAAT TAAAAAGCAG CAGGGGCGAC ATGGGAAGCG CAAAAATTTT GAGTGAAAAT GACAATGTGG TGCGCATAAT GAGCATTCAC AAGAGCAAGG GACTGGAGTT TCCGGTGGTG ATTGTTGCTG GGTGTGGAAA GAAGTTCAAC CTTCAGGATA TGAACAAGAG CATTCTTCTG CACCATGAAC TTGGTTTCGG CCCGGATGTG GTGGACCACA AGCTGAGACT GTCATGGCCG TCGGTGGCCA AACAGGCTAT AAGGGAAAAG ATTAAGGCTG AAACCCTTTC AGAGGAAATG AGGATACTGT ACGTTGCCCT GACCAGAGCA CGGGAAAAGC TTGTTATAAC CGGTGCTGTG AAGAATGTGC GCAAGGCGGT GGAGAAGTGG CTGGATAGTG CATCCGTTCA GAAAAGCAGG CTTTCGGCCT ATGATATGCT AAGCGGAGCC AATTACCTTG ACTGGATTGG ACCTGCGCTT TTGAGGCACA AAAACTGCGG CGGACTTAGG GACTGTGTGG GAAGCGCCGG TTTCCGGGGA CTGCTCATAG ATGACCCGTC GGTGTGGAGT GTAAAGATAT GGAATAAAAC CGATGTGCAA AGCAGCGGAG TTTCGGAGGA ACAGGGAGAA AGTGAGTTTA TAAAGTGGCT GGACAGTTTG GAAAAAGAAG AGCCTTCGGA ATATGCAGAA GAAACAGCAA GAAGGCTAAG CTGGAGTTAT CCTTACGTTA AAGCTTCCAA AGTGCCTGCA AAGGTTTCAG TGACGGAGCT GAAAAGGCGT TACAACGAGG TGGTTTCGGA AGATGTAATG CAATTTCCGG ATTACATGCC GGTTTTGGTG AAAAAGCCAA TGTTTTTGGA GGAAAAGAAA GGCCTGACTT ATGCCGAGAA GGGCACGATA CTTCACTTTG TCATGCAGCA CTTGGATTAC GGTAGGGAGG ATATTGAAGC CCAGATTGAA GAAATGGTGG CAAAAGATTT GTTGACACCT CAACAGGCAC AGAGTGTGGA TGCAGCCAGA ATCCGGCGTT TTCTAAATTC CCCTCTTGGA AAGAGGATGC TGGCCTCAAA AAGCATAAAC CGTGAGGTGC CGTTTAATAT TGAGATACCG TGCCATGAGC TGTACAGGGA TATGGAAGAT GAGGCCTGTC ACGGTGAGAC ACTTCTTCTG CAGGGAGTTG TCGACTGCTA TTTTGAAGAG CCGGACGGTA TTGTGCTGGT GGATTACAAG ACCGATTATG TGGCTCCGGG GAATGTTGAG ACGATTCGGG AAAGATACAA GGTGCAGATT CTTTATTATG CCAGGGCGCT GGAGATGCTC ACCGGAAAGA AGGTAAAGGA GAAGTATATA TATCTTTTCT GGGATGGGAG AATTTTGGGT TTTTGA
|
Protein sequence | MSERTKWTDE QWEAITGNEK SLLVSAAAGA GKTAVLVERI IRKITDEENP VDIDRLLVVT FTNAAATEMR ERIAQAISEK LEENPGSANI QRQLTLLGKA CITTIHSFCL EVIRSNFQQI NIDPGFRIAD ETESRLMKLE ALDEVFEEQY ENENEDFFEL LECYGGNRDD RALQDMVLNL YDFIQSSPWP EEWLEKMTES MNIPDGTDFG KTLWGSVLLS SVKIELEGLK EMISRALEIL KDASGLEKYR AVYMEDLANV DALLKLLNEE SEMQWDRIFN ALQGFEFATL PRCGREVDKD KQEIVKKIRD DVKQRIRKFR EKVITSVSNE IISDLKALYP KMKCLANLVK QLAEKYAEKK NRKSVVDFND LEHFCLEILT ERKEDGSIIP SRTAISYRER FAEILVDEYQ DSNLVQETII NMISKGDDAS PGVFMVGDVK QSIYRFRQAR PELFLEKYNT YLPDKGSPCR KIILSRNFRS RREVIDAVNF LFKQIMSTGA GELDYTDAEA LNFGAVFDEN AKEDITVGGE VEFHLIQTED EDKNFTFENE GEEGRQADEG EEDEEMLDSI QCEARLVGRR ILELMKPDEN GRYFSVFDKA KNEYRRVEYR DVVILLRTTR NWAEVFVDEL SVMGIPVFAD TGTGFFKTVE VQVMLSLLQI IDNPLQDIPL LSVLRSPIVG FTTDELAELR LVDKKALLFD ALKKLAESGQ GEAAGKASAF LENLQKWREM SLYMSTDRLL WQLYNDTGYY SIVGAMPAGE QRQANLRILY ERARQFEETS YKGLFNFINF IDKLKSSRGD MGSAKILSEN DNVVRIMSIH KSKGLEFPVV IVAGCGKKFN LQDMNKSILL HHELGFGPDV VDHKLRLSWP SVAKQAIREK IKAETLSEEM RILYVALTRA REKLVITGAV KNVRKAVEKW LDSASVQKSR LSAYDMLSGA NYLDWIGPAL LRHKNCGGLR DCVGSAGFRG LLIDDPSVWS VKIWNKTDVQ SSGVSEEQGE SEFIKWLDSL EKEEPSEYAE ETARRLSWSY PYVKASKVPA KVSVTELKRR YNEVVSEDVM QFPDYMPVLV KKPMFLEEKK GLTYAEKGTI LHFVMQHLDY GREDIEAQIE EMVAKDLLTP QQAQSVDAAR IRRFLNSPLG KRMLASKSIN REVPFNIEIP CHELYRDMED EACHGETLLL QGVVDCYFEE PDGIVLVDYK TDYVAPGNVE TIRERYKVQI LYYARALEML TGKKVKEKYI YLFWDGRILG F
|
| |