Gene Cthe_0208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0208 
Symbol 
ID4808626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp252608 
End bp254899 
Gene Length2292 bp 
Protein Length763 aa 
Translation table11 
GC content42% 
IMG OID640105621 
Productsingle-stranded-DNA-specific exonuclease RecJ 
Protein accessionYP_001036642 
Protein GI125972732 
COG category[L] Replication, recombination and repair 
COG ID[COG0608] Single-stranded DNA-specific exonuclease 
TIGRFAM ID[TIGR00644] single-stranded-DNA-specific exonuclease RecJ 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.904497 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAGTCA GGGTGAATAT ATTAAACAAG GGAGTTTCCA TTCCGCAGGA AGTGCTGGAT 
GCTTGCGGCG GTGACGAGCT TGTGGCAAGG ATTTTTTACA ACCGGGGATA TAAAAATCCG
GAAACTATAA GGCAGATGCT GAATCCAGAG CTTTATGTGC CGACAAAGCC GGATGAATTT
CCGGATATGC CCAGGGCTGT AGACAGGATT CTTCGTGCTG CTGACAACGA GGAAAAAATA
TGTGTGTATG GGGACTATGA TGTTGACGGT GTTACCAGTA CCGTTACCCT TGTTGAATGC
CTGAATTTTT TTACATCAAA GGTGGTCTAC CATGTTCCGG ACAGGTTTAC GGAAGGGTAC
GGCATGAATG AGGAAATAGT GAGAAAACTT GCTCAGGATG GCGTTTCATT AATAATTACC
TGTGACTGTG GAATTTCGAA TGTCAGGGAA ATAACTCTTG CCAAGGAGCT GGGGATGGAT
GTGGTGCTTA CCGACCATCA TACCGTTCCC GACGAACTTC CTCCTGCCGA TGCCATATTA
AATCCCAAGC TTTTGGAAGA GGGGCACAGG GCAAGGAATA TCTCCGGCTG TGGGATGGTG
TATTTTTTAT GCCTTGCCTT GCTTGAGAAA AAGGGCTTTC CTGACAGGGC GGAAAGATTT
CTGGACATGC TTGCGCTATC CCTTATAGCG GATGTTGTGA GCCTTAACGG AGAGAACAGG
TACCTTCTTC AAAAAGCTCT GCCGGCTTTG TTTAATACCA GAAGAATTGG GCTCAGACAG
CTTCTTGAGG TGGCGGAAAG AAATGGAAAG CTTGAAAATG AGGAAGATGT TGCATTTCAG
ATAGCGCCCA GAATAAATGC GGCGGGAAGG ATGGACACGG CACGGCTTCC CGTGGAGCTT
TTTTTATGTC AGGACTTGGA AAAAGCACGC ATTATGGCTG AGAAAATTGA CTCACTGAAT
ACAGAAAGAA AAAGGGTGCA GCAGTCAATT GTGGATGAAG CCGTGGAGAT GGTAGAAACA
AGGAAAAAGA ACAAGACAAT TCTGGTGCTG TATAAAGAGT ACTGGCATCA TGGTATAATC
GGAATTGCTG CCGGAAGGAT TTGCGAGCTT TATCGAAAGC CGGCAATTCT GTTTTCCTTA
AAAGAAGATG GAATTACGGC AGTAGGTTCT GCCAGGTCCA TTGAGGAGGT AAACATTTAC
GAACTGATTA AGGAATGCAG CGGAAAGCTT TTAAAGTTTG GAGGACACTC CCAGGCTGCC
GGACTTTCAA TAAGAAAAGA TGATATTGAG GAGTTTATAA GCCAGATTGA GATGGAGGCG
GAAAACAGGT ATTTTATAAA AGACATGGTG AATGTCAATG CCGATATGGA ACTTGGCATA
GAAGATATAA ATGAAGAGCT CTACGACAGA ATTCAATCGG CGGGACCTTA CGGAGAAGGG
TTTGAGGCTC CTTGTTTTTG CATAAGAAAT GTTATTGTTT TAAGTGACAG AATGACGGAG
AAAAAACATC ATATAATGGT ATTGGAAGAT CAAAAAGGAA ACAGAATACC TGCAGTCAAG
TGGTTTGGTG AGGATGAATC CTTTGAGGGC AGGTGTTTTG ATGTAACCTG CAGAATAGGC
CGGAACAATT ACAGCAAGGA CGCGGGTATT CAACTGACTT TGGAATATAT GGTTGAAAGT
TTCGGGAAAT TTAAAAAGCT GTTTGAAGGT GAAATAATAG ATGAGAGAAA AACAACTGTT
GAAAACCTCT TAAGGAAATA TCCCAATGCC CAAATATTCT ACGAAGGGCT TCAGACGGCA
TGTCCCGTTG AGAATACCAT TGACAGGTTT TCGGTGAAAA ACTGTAAGGA GCTTGTTTTT
TTGTCCACCC CTGCGAATAC TGAGATTTTC AAAGAGGTTA TTGCCCTTGC CAATCCAGAA
AAGGTGATAA TAAATTTTGC CGTCCTTTCG AACTATACCT TTAAAGGCTT TGTATTAAAC
CTTTTGGGGC TTATAAAGCA CATAATAAAG AGACGGGACG GAAGAGCTTA TATTGATGAG
CTTTCTTTGA AGCTTTGCGT TGAGGAGAAC ATTGTAAAAG CGGGATTAAA ATATCTTGGC
TCTTCTGGAA TGTTAAACTA TACTTTAAGT GACGATGAGC AAAAAGTTTA TTTGTCTGAA
GGAAAAGGTG TGGCTGACAG AAATGCTTTT ATGGCGAAAA AAAACCTTTC CGACGCTTTG
GCGGAAAAGA ATGCCTATCA GCAGTTTATT TTAAAGATGG AGATAGATAA ATTCAGGGAA
TATCTTAAGT AA
 
Protein sequence
MKVRVNILNK GVSIPQEVLD ACGGDELVAR IFYNRGYKNP ETIRQMLNPE LYVPTKPDEF 
PDMPRAVDRI LRAADNEEKI CVYGDYDVDG VTSTVTLVEC LNFFTSKVVY HVPDRFTEGY
GMNEEIVRKL AQDGVSLIIT CDCGISNVRE ITLAKELGMD VVLTDHHTVP DELPPADAIL
NPKLLEEGHR ARNISGCGMV YFLCLALLEK KGFPDRAERF LDMLALSLIA DVVSLNGENR
YLLQKALPAL FNTRRIGLRQ LLEVAERNGK LENEEDVAFQ IAPRINAAGR MDTARLPVEL
FLCQDLEKAR IMAEKIDSLN TERKRVQQSI VDEAVEMVET RKKNKTILVL YKEYWHHGII
GIAAGRICEL YRKPAILFSL KEDGITAVGS ARSIEEVNIY ELIKECSGKL LKFGGHSQAA
GLSIRKDDIE EFISQIEMEA ENRYFIKDMV NVNADMELGI EDINEELYDR IQSAGPYGEG
FEAPCFCIRN VIVLSDRMTE KKHHIMVLED QKGNRIPAVK WFGEDESFEG RCFDVTCRIG
RNNYSKDAGI QLTLEYMVES FGKFKKLFEG EIIDERKTTV ENLLRKYPNA QIFYEGLQTA
CPVENTIDRF SVKNCKELVF LSTPANTEIF KEVIALANPE KVIINFAVLS NYTFKGFVLN
LLGLIKHIIK RRDGRAYIDE LSLKLCVEEN IVKAGLKYLG SSGMLNYTLS DDEQKVYLSE
GKGVADRNAF MAKKNLSDAL AEKNAYQQFI LKMEIDKFRE YLK