Gene Cthe_0309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0309 
Symbol 
ID4808527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp386019 
End bp388001 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content44% 
IMG OID640105720 
Productexcinuclease ABC subunit B 
Protein accessionYP_001036740 
Protein GI125972830 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00254602 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATAAAT TTAAACTGGT GTCTGATTAC AAACCCTGCG GAGATCAACC GGAGGCTATA 
GATAAACTTG TGGAAGGCAT TAACAGGGGA TACAGGGGTC AGACTCTTTT GGGTGTGACC
GGTTCCGGTA AAACATTTAC CATGGCCAAT GTTATTGAAA GGGTTCAGAA ACCCACCCTT
GTGATTGCCC ACAACAAGAC TCTTGCGGCA CAGCTTTGCA GTGAGTTTAA GGAGTTTTTT
CCCAACAACT GTGTTGAATA TTTTGTAAGC TATTATGACT ATTACCAGCC GGAGGCTTAT
ATTCCCGCCA CCGATACCTA TATTGAGAAG GATTCGTCGA TAAATGACGA GATTGACAAG
CTGAGGCACT CGGCTACTGC TGCGCTGTTT GAACGAAGGG ACGTTATAAT AGTTGCAAGC
GTTTCGTGTA TTTACGGTTT GGGAGACCCG GAAGATTATA CTGATTTGAT GCTTTCGTTG
CGTCCCGGCA TGATAAAGGA CAGGGATGAA ATAATCAGGA AGCTGGTGGA CATTCAGTAC
GAAAGAAATG AAATTGATTT TAAAAGAGGA AAATTCAGGG TCAGGGGTGA CATACTGGAG
ATATTCCCGG CTTCATCTTC TGACAAAGTG ATACGGGTGG AGTTTTTCGG AGAAGAAATT
GATAGGATTA CCGAAGTTGA TTCACTGACC GGTGAAATAA CCGGAGTATG CTCTCATGTT
GCCATTTTTC CGGCTTCCCA CTATGCCACG ACAAAGGCTA AAATGCAAAG GGCCATAGCT
TCAATTGAAC AGGAGCTTGA GGAGCGTGTC AGGGAATTAA AATCCCAGGG AAAACTCCTT
GAAGCCCAGA GGCTTGAACA GCGGACAAGG TATGACCTTG AAATGATGCA GGAAATAGGG
TTTTGCCAAG GTATAGAGAA TTATTCGAGA CATATAAGCG GAAGAGCTCC GGGCAGTCCG
CCTTTTACGC TTATTGATTA CTTCCCGAAG GATTTTTTGC TGATAATTGA CGAGTCCCAT
GTGACCATTC CCCAGATTGG AGCCATGTAC AACGGTGACC GGTCGAGAAA GGAATCCCTT
GTGGAATACG GTTTCAGGCT CCCTTCGGCT TTTGACAACA GACCGCTTAC GTTTGAGGAG
TTTGAGAAAA AAATCAACCA GGTTATATTT GTCAGTGCTA CGCCGGCAAA GTATGAAAGG
GAGCATTCGC AGCAGATAGT GGAGCAGATT ATAAGGCCTA CAGGGTTGCT TGATCCTGAG
ATTGTTGTGA AACCTGTAAA AGGTCAGATT GACGACCTGA TTGGGGAAAT AAGTGAAAGA
GTACAGAAAA ATCAGAGAGT TATGATTACC ACCCTTACCA AAAAGATGGC GGAGGATTTG
ACGGATTATT TGAGGGAGCT TGACTTTAAA GTTGAATATT TGCATTCTGA CATTGATACC
ATTGAGAGAA TGGAAATAAT ACGAAACTTA AGACTTGGAG TTTTTGACGT CCTGGTTGGC
ATAAATTTGC TAAGAGAAGG TCTGGATATT CCGGAAGTGT CCCTTGTGGC CATTCTGGAT
GCCGACAAGG AAGGCTTCCT GCGTTCGGAG ACCTCATTGA TTCAGACTAT AGGAAGAGCG
GCCAGAAATG TTGAGGGAAA AGTCATAATG TATGCCGATA CCATTACGGA CTCGATGAGA
AGAGCCATTG ACGAGACCAA CAGAAGAAGA AAGATACAGT CGGAATACAA TCAGAAGCAC
GGTATAACGC CGAAGAGCGT TCAAAAGGGT ATAAGGGATG TAATTGAAAT TACAAAAGTT
GCTGAAGAAG ATGCAAAGTA CTTTATTCGC GGCGACGAGG ATTCGATGGA TAAGGATGAA
GTTTTGGACC TTATTGAAAA ACTCACCAAT GAAATGAAGG CGGCAGCGGC GGAACTTCAA
TTTGAACGGG CTGCGGAGCT CAGGGACAAA ATTGCCGAAC TGAAAAAGAA AATAGGAGCT
TAA
 
Protein sequence
MHKFKLVSDY KPCGDQPEAI DKLVEGINRG YRGQTLLGVT GSGKTFTMAN VIERVQKPTL 
VIAHNKTLAA QLCSEFKEFF PNNCVEYFVS YYDYYQPEAY IPATDTYIEK DSSINDEIDK
LRHSATAALF ERRDVIIVAS VSCIYGLGDP EDYTDLMLSL RPGMIKDRDE IIRKLVDIQY
ERNEIDFKRG KFRVRGDILE IFPASSSDKV IRVEFFGEEI DRITEVDSLT GEITGVCSHV
AIFPASHYAT TKAKMQRAIA SIEQELEERV RELKSQGKLL EAQRLEQRTR YDLEMMQEIG
FCQGIENYSR HISGRAPGSP PFTLIDYFPK DFLLIIDESH VTIPQIGAMY NGDRSRKESL
VEYGFRLPSA FDNRPLTFEE FEKKINQVIF VSATPAKYER EHSQQIVEQI IRPTGLLDPE
IVVKPVKGQI DDLIGEISER VQKNQRVMIT TLTKKMAEDL TDYLRELDFK VEYLHSDIDT
IERMEIIRNL RLGVFDVLVG INLLREGLDI PEVSLVAILD ADKEGFLRSE TSLIQTIGRA
ARNVEGKVIM YADTITDSMR RAIDETNRRR KIQSEYNQKH GITPKSVQKG IRDVIEITKV
AEEDAKYFIR GDEDSMDKDE VLDLIEKLTN EMKAAAAELQ FERAAELRDK IAELKKKIGA