Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0309 |
Symbol | |
ID | 4808527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 386019 |
End bp | 388001 |
Gene Length | 1983 bp |
Protein Length | 660 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640105720 |
Product | excinuclease ABC subunit B |
Protein accession | YP_001036740 |
Protein GI | 125972830 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0556] Helicase subunit of the DNA excision repair complex |
TIGRFAM ID | [TIGR00631] excinuclease ABC, B subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00254602 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATAAAT TTAAACTGGT GTCTGATTAC AAACCCTGCG GAGATCAACC GGAGGCTATA GATAAACTTG TGGAAGGCAT TAACAGGGGA TACAGGGGTC AGACTCTTTT GGGTGTGACC GGTTCCGGTA AAACATTTAC CATGGCCAAT GTTATTGAAA GGGTTCAGAA ACCCACCCTT GTGATTGCCC ACAACAAGAC TCTTGCGGCA CAGCTTTGCA GTGAGTTTAA GGAGTTTTTT CCCAACAACT GTGTTGAATA TTTTGTAAGC TATTATGACT ATTACCAGCC GGAGGCTTAT ATTCCCGCCA CCGATACCTA TATTGAGAAG GATTCGTCGA TAAATGACGA GATTGACAAG CTGAGGCACT CGGCTACTGC TGCGCTGTTT GAACGAAGGG ACGTTATAAT AGTTGCAAGC GTTTCGTGTA TTTACGGTTT GGGAGACCCG GAAGATTATA CTGATTTGAT GCTTTCGTTG CGTCCCGGCA TGATAAAGGA CAGGGATGAA ATAATCAGGA AGCTGGTGGA CATTCAGTAC GAAAGAAATG AAATTGATTT TAAAAGAGGA AAATTCAGGG TCAGGGGTGA CATACTGGAG ATATTCCCGG CTTCATCTTC TGACAAAGTG ATACGGGTGG AGTTTTTCGG AGAAGAAATT GATAGGATTA CCGAAGTTGA TTCACTGACC GGTGAAATAA CCGGAGTATG CTCTCATGTT GCCATTTTTC CGGCTTCCCA CTATGCCACG ACAAAGGCTA AAATGCAAAG GGCCATAGCT TCAATTGAAC AGGAGCTTGA GGAGCGTGTC AGGGAATTAA AATCCCAGGG AAAACTCCTT GAAGCCCAGA GGCTTGAACA GCGGACAAGG TATGACCTTG AAATGATGCA GGAAATAGGG TTTTGCCAAG GTATAGAGAA TTATTCGAGA CATATAAGCG GAAGAGCTCC GGGCAGTCCG CCTTTTACGC TTATTGATTA CTTCCCGAAG GATTTTTTGC TGATAATTGA CGAGTCCCAT GTGACCATTC CCCAGATTGG AGCCATGTAC AACGGTGACC GGTCGAGAAA GGAATCCCTT GTGGAATACG GTTTCAGGCT CCCTTCGGCT TTTGACAACA GACCGCTTAC GTTTGAGGAG TTTGAGAAAA AAATCAACCA GGTTATATTT GTCAGTGCTA CGCCGGCAAA GTATGAAAGG GAGCATTCGC AGCAGATAGT GGAGCAGATT ATAAGGCCTA CAGGGTTGCT TGATCCTGAG ATTGTTGTGA AACCTGTAAA AGGTCAGATT GACGACCTGA TTGGGGAAAT AAGTGAAAGA GTACAGAAAA ATCAGAGAGT TATGATTACC ACCCTTACCA AAAAGATGGC GGAGGATTTG ACGGATTATT TGAGGGAGCT TGACTTTAAA GTTGAATATT TGCATTCTGA CATTGATACC ATTGAGAGAA TGGAAATAAT ACGAAACTTA AGACTTGGAG TTTTTGACGT CCTGGTTGGC ATAAATTTGC TAAGAGAAGG TCTGGATATT CCGGAAGTGT CCCTTGTGGC CATTCTGGAT GCCGACAAGG AAGGCTTCCT GCGTTCGGAG ACCTCATTGA TTCAGACTAT AGGAAGAGCG GCCAGAAATG TTGAGGGAAA AGTCATAATG TATGCCGATA CCATTACGGA CTCGATGAGA AGAGCCATTG ACGAGACCAA CAGAAGAAGA AAGATACAGT CGGAATACAA TCAGAAGCAC GGTATAACGC CGAAGAGCGT TCAAAAGGGT ATAAGGGATG TAATTGAAAT TACAAAAGTT GCTGAAGAAG ATGCAAAGTA CTTTATTCGC GGCGACGAGG ATTCGATGGA TAAGGATGAA GTTTTGGACC TTATTGAAAA ACTCACCAAT GAAATGAAGG CGGCAGCGGC GGAACTTCAA TTTGAACGGG CTGCGGAGCT CAGGGACAAA ATTGCCGAAC TGAAAAAGAA AATAGGAGCT TAA
|
Protein sequence | MHKFKLVSDY KPCGDQPEAI DKLVEGINRG YRGQTLLGVT GSGKTFTMAN VIERVQKPTL VIAHNKTLAA QLCSEFKEFF PNNCVEYFVS YYDYYQPEAY IPATDTYIEK DSSINDEIDK LRHSATAALF ERRDVIIVAS VSCIYGLGDP EDYTDLMLSL RPGMIKDRDE IIRKLVDIQY ERNEIDFKRG KFRVRGDILE IFPASSSDKV IRVEFFGEEI DRITEVDSLT GEITGVCSHV AIFPASHYAT TKAKMQRAIA SIEQELEERV RELKSQGKLL EAQRLEQRTR YDLEMMQEIG FCQGIENYSR HISGRAPGSP PFTLIDYFPK DFLLIIDESH VTIPQIGAMY NGDRSRKESL VEYGFRLPSA FDNRPLTFEE FEKKINQVIF VSATPAKYER EHSQQIVEQI IRPTGLLDPE IVVKPVKGQI DDLIGEISER VQKNQRVMIT TLTKKMAEDL TDYLRELDFK VEYLHSDIDT IERMEIIRNL RLGVFDVLVG INLLREGLDI PEVSLVAILD ADKEGFLRSE TSLIQTIGRA ARNVEGKVIM YADTITDSMR RAIDETNRRR KIQSEYNQKH GITPKSVQKG IRDVIEITKV AEEDAKYFIR GDEDSMDKDE VLDLIEKLTN EMKAAAAELQ FERAAELRDK IAELKKKIGA
|
| |