Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0208 |
Symbol | |
ID | 4808626 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 252608 |
End bp | 254899 |
Gene Length | 2292 bp |
Protein Length | 763 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640105621 |
Product | single-stranded-DNA-specific exonuclease RecJ |
Protein accession | YP_001036642 |
Protein GI | 125972732 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0608] Single-stranded DNA-specific exonuclease |
TIGRFAM ID | [TIGR00644] single-stranded-DNA-specific exonuclease RecJ |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.904497 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAAGTCA GGGTGAATAT ATTAAACAAG GGAGTTTCCA TTCCGCAGGA AGTGCTGGAT GCTTGCGGCG GTGACGAGCT TGTGGCAAGG ATTTTTTACA ACCGGGGATA TAAAAATCCG GAAACTATAA GGCAGATGCT GAATCCAGAG CTTTATGTGC CGACAAAGCC GGATGAATTT CCGGATATGC CCAGGGCTGT AGACAGGATT CTTCGTGCTG CTGACAACGA GGAAAAAATA TGTGTGTATG GGGACTATGA TGTTGACGGT GTTACCAGTA CCGTTACCCT TGTTGAATGC CTGAATTTTT TTACATCAAA GGTGGTCTAC CATGTTCCGG ACAGGTTTAC GGAAGGGTAC GGCATGAATG AGGAAATAGT GAGAAAACTT GCTCAGGATG GCGTTTCATT AATAATTACC TGTGACTGTG GAATTTCGAA TGTCAGGGAA ATAACTCTTG CCAAGGAGCT GGGGATGGAT GTGGTGCTTA CCGACCATCA TACCGTTCCC GACGAACTTC CTCCTGCCGA TGCCATATTA AATCCCAAGC TTTTGGAAGA GGGGCACAGG GCAAGGAATA TCTCCGGCTG TGGGATGGTG TATTTTTTAT GCCTTGCCTT GCTTGAGAAA AAGGGCTTTC CTGACAGGGC GGAAAGATTT CTGGACATGC TTGCGCTATC CCTTATAGCG GATGTTGTGA GCCTTAACGG AGAGAACAGG TACCTTCTTC AAAAAGCTCT GCCGGCTTTG TTTAATACCA GAAGAATTGG GCTCAGACAG CTTCTTGAGG TGGCGGAAAG AAATGGAAAG CTTGAAAATG AGGAAGATGT TGCATTTCAG ATAGCGCCCA GAATAAATGC GGCGGGAAGG ATGGACACGG CACGGCTTCC CGTGGAGCTT TTTTTATGTC AGGACTTGGA AAAAGCACGC ATTATGGCTG AGAAAATTGA CTCACTGAAT ACAGAAAGAA AAAGGGTGCA GCAGTCAATT GTGGATGAAG CCGTGGAGAT GGTAGAAACA AGGAAAAAGA ACAAGACAAT TCTGGTGCTG TATAAAGAGT ACTGGCATCA TGGTATAATC GGAATTGCTG CCGGAAGGAT TTGCGAGCTT TATCGAAAGC CGGCAATTCT GTTTTCCTTA AAAGAAGATG GAATTACGGC AGTAGGTTCT GCCAGGTCCA TTGAGGAGGT AAACATTTAC GAACTGATTA AGGAATGCAG CGGAAAGCTT TTAAAGTTTG GAGGACACTC CCAGGCTGCC GGACTTTCAA TAAGAAAAGA TGATATTGAG GAGTTTATAA GCCAGATTGA GATGGAGGCG GAAAACAGGT ATTTTATAAA AGACATGGTG AATGTCAATG CCGATATGGA ACTTGGCATA GAAGATATAA ATGAAGAGCT CTACGACAGA ATTCAATCGG CGGGACCTTA CGGAGAAGGG TTTGAGGCTC CTTGTTTTTG CATAAGAAAT GTTATTGTTT TAAGTGACAG AATGACGGAG AAAAAACATC ATATAATGGT ATTGGAAGAT CAAAAAGGAA ACAGAATACC TGCAGTCAAG TGGTTTGGTG AGGATGAATC CTTTGAGGGC AGGTGTTTTG ATGTAACCTG CAGAATAGGC CGGAACAATT ACAGCAAGGA CGCGGGTATT CAACTGACTT TGGAATATAT GGTTGAAAGT TTCGGGAAAT TTAAAAAGCT GTTTGAAGGT GAAATAATAG ATGAGAGAAA AACAACTGTT GAAAACCTCT TAAGGAAATA TCCCAATGCC CAAATATTCT ACGAAGGGCT TCAGACGGCA TGTCCCGTTG AGAATACCAT TGACAGGTTT TCGGTGAAAA ACTGTAAGGA GCTTGTTTTT TTGTCCACCC CTGCGAATAC TGAGATTTTC AAAGAGGTTA TTGCCCTTGC CAATCCAGAA AAGGTGATAA TAAATTTTGC CGTCCTTTCG AACTATACCT TTAAAGGCTT TGTATTAAAC CTTTTGGGGC TTATAAAGCA CATAATAAAG AGACGGGACG GAAGAGCTTA TATTGATGAG CTTTCTTTGA AGCTTTGCGT TGAGGAGAAC ATTGTAAAAG CGGGATTAAA ATATCTTGGC TCTTCTGGAA TGTTAAACTA TACTTTAAGT GACGATGAGC AAAAAGTTTA TTTGTCTGAA GGAAAAGGTG TGGCTGACAG AAATGCTTTT ATGGCGAAAA AAAACCTTTC CGACGCTTTG GCGGAAAAGA ATGCCTATCA GCAGTTTATT TTAAAGATGG AGATAGATAA ATTCAGGGAA TATCTTAAGT AA
|
Protein sequence | MKVRVNILNK GVSIPQEVLD ACGGDELVAR IFYNRGYKNP ETIRQMLNPE LYVPTKPDEF PDMPRAVDRI LRAADNEEKI CVYGDYDVDG VTSTVTLVEC LNFFTSKVVY HVPDRFTEGY GMNEEIVRKL AQDGVSLIIT CDCGISNVRE ITLAKELGMD VVLTDHHTVP DELPPADAIL NPKLLEEGHR ARNISGCGMV YFLCLALLEK KGFPDRAERF LDMLALSLIA DVVSLNGENR YLLQKALPAL FNTRRIGLRQ LLEVAERNGK LENEEDVAFQ IAPRINAAGR MDTARLPVEL FLCQDLEKAR IMAEKIDSLN TERKRVQQSI VDEAVEMVET RKKNKTILVL YKEYWHHGII GIAAGRICEL YRKPAILFSL KEDGITAVGS ARSIEEVNIY ELIKECSGKL LKFGGHSQAA GLSIRKDDIE EFISQIEMEA ENRYFIKDMV NVNADMELGI EDINEELYDR IQSAGPYGEG FEAPCFCIRN VIVLSDRMTE KKHHIMVLED QKGNRIPAVK WFGEDESFEG RCFDVTCRIG RNNYSKDAGI QLTLEYMVES FGKFKKLFEG EIIDERKTTV ENLLRKYPNA QIFYEGLQTA CPVENTIDRF SVKNCKELVF LSTPANTEIF KEVIALANPE KVIINFAVLS NYTFKGFVLN LLGLIKHIIK RRDGRAYIDE LSLKLCVEEN IVKAGLKYLG SSGMLNYTLS DDEQKVYLSE GKGVADRNAF MAKKNLSDAL AEKNAYQQFI LKMEIDKFRE YLK
|
| |