Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1786 |
Symbol | |
ID | 4810031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2106433 |
End bp | 2107797 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640107200 |
Product | DNA repair protein RadA |
Protein accession | YP_001038200 |
Protein GI | 125974290 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1066] Predicted ATP-dependent serine protease |
TIGRFAM ID | [TIGR00416] DNA repair protein RadA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000301687 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAAAAC AAAAATCGTT TTATGTGTGT CAGGAATGCG GCTATGAAAG TATTGGCTGG ATGGGTAAAT GTCCTTCCTG CAATCAGTGG AACACCTTTG TGGAGGAGAT TCAGGAACCA AAAAGTAAAA GCAGGAGCGG AGCTGTTTCC ACCAATGTGA AGCCGGTTAA TATAAATGAC ATAGAAGCGG ATACTGAAGA ACGTTATCTT ACGGGAATAA AGGAAATGGA CCGGGTACTC GGAGGTGGAA TTGTAAAAGG TTCCCTTATT CTTGTGGGTG GAGATCCGGG TATAGGCAAA TCCACTCTTT TGCTGCAGAT ATGCGACAAA ATAAAGACAA ATGCCAAAAT TTTATATGTT TCCGGCGAGG AATCTATTAA GCAAATAAAG CTGCGAGCCG ATAGATTGAA TGTAAGGAAT CCCAATCTGC TGATGTTGTC TGAAACAAAC TTTAAAGTCA TACAGGCTCT GAGCGAAACC GAACGGCCGG ATCTTATTGT AATCGATTCA ATACAAACGA TGTTTAATGA TGAACTTCCC TCAGCTCCGG GAAGTGTAAG CCAGGTAAGG GAAATTACCT CCGGACTCAT GAGAATTGCA AAAACGCTGA ACATAGCGAT AATAATTGTG GGCCATGTAA CCAAAGAGGG AGCTATAGCA GGACCCAGGG TTCTTGAGCA CATGGTGGAC ACTGTGCTGT ACTTTGAGGG CGAAAGACAT TTAAGTTACA GGATTTTGAG AGCGGTAAAG AACCGCTTTG GCTCCACAAA CGAAATAGGC ATATTCGAGA TGCGGGATGT GGGGCTTGTG GAAGTGGAAA ATCCGTCATC AATGCTTTTG TCGGAAAGAA CGGAAAGCGT TCCGGGTTCT GTGGCGGTAG CAACTTTGGA AGGCACAAGG CCCATGCTGA TAGAAATTCA GGCACTGGTC TGTCCCACAA GTTTCGGGAT GCCCAGGAGA ATGGCAACCG GACTGGATTA CAACAGGATT ACTTTGCTTA TGGCGGTTTT GGAAAAGAGA GTGGGCATGC AGCTTCACAA TTATGACGCA TATGTAAACG TAGTGGGAGG ACTTAAGATT GATGAACCTG CATGCGACCT TGGTGTGGTG ACTGCCATAG CTTCGAGCTT TAGAAACATA CCGGTGGATA TGGATACCGT TTTAATAGGA GAAGTCGGCC TGACCGGGGA GGTTAGGGCT GTAAGCCAGA TAGACAAAAG AATCAGAGAG GCCGTGAGAA TAGGTTTTAA GAATTGTGTT GTTCCTGCTG GAAATATGAA GGTTATAAAG CAGATGAAAG ATATAAATAA TATAAATGTA AAGTTCGTGG AGAATGTACA GGAGGCATTG AATATAATTC TATAG
|
Protein sequence | MPKQKSFYVC QECGYESIGW MGKCPSCNQW NTFVEEIQEP KSKSRSGAVS TNVKPVNIND IEADTEERYL TGIKEMDRVL GGGIVKGSLI LVGGDPGIGK STLLLQICDK IKTNAKILYV SGEESIKQIK LRADRLNVRN PNLLMLSETN FKVIQALSET ERPDLIVIDS IQTMFNDELP SAPGSVSQVR EITSGLMRIA KTLNIAIIIV GHVTKEGAIA GPRVLEHMVD TVLYFEGERH LSYRILRAVK NRFGSTNEIG IFEMRDVGLV EVENPSSMLL SERTESVPGS VAVATLEGTR PMLIEIQALV CPTSFGMPRR MATGLDYNRI TLLMAVLEKR VGMQLHNYDA YVNVVGGLKI DEPACDLGVV TAIASSFRNI PVDMDTVLIG EVGLTGEVRA VSQIDKRIRE AVRIGFKNCV VPAGNMKVIK QMKDINNINV KFVENVQEAL NIIL
|
| |