Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0518 |
Symbol | |
ID | 4808267 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 632700 |
End bp | 635675 |
Gene Length | 2976 bp |
Protein Length | 991 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640105933 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_001036948 |
Protein GI | 125973038 |
COG category | [V] Defense mechanisms |
COG ID | [COG3587] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCTGA AATTTGATAA AAACCTGGAA TACCAACAGC AGGCCATAGC TTCCGTTGTG GATTTGTTTA GAGGCCAAAC GCCTATGCAT ACTAATTTTA CCGTATCAGC TTATAATGGG CAAATAGGCC TGTTTGATAC AGAAAACGGT ATAGGCAACA GGCTTGAACT GGATGAAGAA GAGATACTTA AGAATTTGCA AGAAGTTCAG CTTAGGAATG GCCTGCCACA GACAAAGTTT CTAAAAGCCG GAGAATACGA CTTTGATATA GAGATGGAAA CCGGTACTGG TAAAACTTAT GTTTACTTAA GGACCATCTT TGAACTCCAT AAAAACTATG GGTTTTCCAA GTTTATTATT GTTGTTCCCA GTATTGCCAT TAAAGAAGGT GTGTATAAGA CCTTACAGAT TACAGAGGAA CATTTTAAGG AATTATATGA TAATACAATT TACCACTACT TCATATATGA TAGTAGCAAA CTGGAACAGG TAAGAAGTTT TGCGGTGAGC GATAATATTG AGATTATGGT TATAAACATT GACGCTTTTA GGAAGAGTTT TACAGACCCT ACTAAGGAAA ATAAGGCCAA TATTATTCAT AGGACAAATG ATAGATTAAA CGGTATGAAA CCTATTGAGC TTATTCAAGA GACGAGACCC ATTGTTATTA TAGATGAGCC ACAATCGGTA GACACGACTC CTAAAGCCAA AGAAGCTATT AAATCTCTAA ATCCTTTATG TATATTACGT TACTCAGCAA CCCATGTGGA AAGGCACAAT CTGGTTTATA AGCTGGATGC AGTAGATAGT TATAATTTGG GTTTGGTTAA GCAGATAGAA GTTGCTGGGT TTACAACTAA AGACTACCAC AATAAAGCTT ACTTGAAGCT TTTATCTGTA GACAATAAAA AATCTCCTAT AACAGCCAAA ATAGAGATGG ATGTCAAAGA TAGAAAAGGT GTGGTAAAAC GGAAAGCGGT AACAGTCAAA CGTGGAGATG ACCTTTATGA AAAATCCGGA GGGCGTGATG TCTATGAAGG ATATATTGTC AGCGAGATTT ATTGCGAAGA AGGGAACGAA TACGTTGCCT TTACAAGCAA GCCGGATATT CTACGTATAG GTAAAGCTAT AGGAGATGTG GACGACCTGG CCATAAAAGA ACAGATGATT AGAAAGACTA TAGAGGAGCA TTTAGACAAA GAACTGGTAT TAAATAAGCT GGGAATAAAA GTGCTGAGCC TGTTCTTTAT AGACCGAGTG GCAAATTATC GCTACTATGA TGAAAATGGG AATCCTCAAA AGGGTATCTA TGCAAAGCTC TTTGAAAAAC ATTATAAAGA CCTCATAAGA CTGCCTAAGT ATAATACGCT GTTTAAAGAT ATTGATTTAG ATACCGCTGC TGAAGAAGTC CACAATGGCT ATTTTTCAGC GGATAAGAAA GGTGTCTTAA AAGATACCAG CGGTTCTACG CAAGATGATG AGGATACTTA CAACTTGATT ATGAAAGACA AGGAGCGCCT TTTATCTTTT GATACTAAAC TTAGATTTAT CTTCTCCCAC TCGGCTTTGC GTGAAGGCTG GGATAACCCC AATGTGTTCC AGATATGTAC CCTAAATGAA ACACAGAGCG AGGTTAAAAA GCGTCAAGAG ATTGGTCGTG GCCTGCGTCT TTGTGTTAAT CAGGAAGGAG AACGCCAGTA TGGTTCTTTT ATAAACACAT TGACGGTTAT CGCTAATGAA AGCTATGAGG AATTTGCAGC TAAACTTCAG AAAGAATACG AAACTGAAAG CGGCATAAGA TTTGGTATTA TTGAAAGTCA TTTGTTTGCC AACATCCCGG TAAAACAAGT GGATGGAAGT GTAAAATATT TAGGTCAGGA AGCTTCCGAA ACCATTTTCA AAGCATTTTT GAATAATGGT TATATTAATG AATCAGGTGA AGTGCAGTAC AAGCTGAAAA ATGATATAAA AGACAACAAG CTAAATGTAC CGGAAGAATA TGAACATGTA AGAGCGGAGA TTACCGCTCT TGCCAGAAAA GTGTGCAGCG GACTTAATAT CAGGAACAAT AGTGATAAAA AAACTATTAA GCTTAATAAG CAGGTATACC TTGACCCTGA ATTTAAAGAG CTTTGGAACA GGATAAAATA CAAGACTACC TATTCTGTGG ACTTTGACAG TGAAAAGCTT ATTGAGGAAT GTTGCAAAGA GATGCAGAGA AGCCTGTTTG TAAGTTCACC AAAGCTAATA TATACAAAAG CAGGGCTTGA TATTAGTGCG GGTGGAATTG AGGCAAAAGA ATCAGACAGA TATGCAGTTG TTTTGGATAA CCAGAAGGAA ACTCTGCCTG ATATTATTGC TTATCTTCAG AACGAAACCA ATTTAACAAG AAAAACTATT GTTGAGATAC TGATAAGAAG TAAAACACTC CACCTTTTTA AGAAGAACCC ACAAAAATAT ATGGAGCAGG TATCACAAAT TATAACCGCT AAAATGAGGA ATATGATAGT TGACGGCATT AAATACACAA AGATAGGTGA CGACGAATAT TATGCACAAG AACTCTTTGA AAGTGAGGAG CTTATAGGCT ATTTATCCAA AAACATGATG GCAAGTAAGA AGTCAGTATA TGAATATGTT GTATATGATA GTGCTACAGA AGAGAGATTT GCCAGAAGTT TCGAAAACAA TAGCAAGGTT AAGTTGTATG CAAAACTTCC CGGCTGGTTT ACTATACCCA CTCCATTAGG CAGCTATAAT CCTGACTGGG CTGTACTAAT TGATGTAGAT GGCAAAGATA AACTGTACTT TGTACTGGAA ACCAAAGCTG ATACTATGTT TGATGCCCTA AGACCAACAG AAAGAGCTAA AATTGAATGC GGGAAGAAAC ATTTTGAGGC TTTAGGTACT GAAGTAGGAT TTGAAGATAT AGATAGTTTT GAAGGATTTA TAGAGGAGAA AGTAGTAGTA AAGTAG
|
Protein sequence | MKLKFDKNLE YQQQAIASVV DLFRGQTPMH TNFTVSAYNG QIGLFDTENG IGNRLELDEE EILKNLQEVQ LRNGLPQTKF LKAGEYDFDI EMETGTGKTY VYLRTIFELH KNYGFSKFII VVPSIAIKEG VYKTLQITEE HFKELYDNTI YHYFIYDSSK LEQVRSFAVS DNIEIMVINI DAFRKSFTDP TKENKANIIH RTNDRLNGMK PIELIQETRP IVIIDEPQSV DTTPKAKEAI KSLNPLCILR YSATHVERHN LVYKLDAVDS YNLGLVKQIE VAGFTTKDYH NKAYLKLLSV DNKKSPITAK IEMDVKDRKG VVKRKAVTVK RGDDLYEKSG GRDVYEGYIV SEIYCEEGNE YVAFTSKPDI LRIGKAIGDV DDLAIKEQMI RKTIEEHLDK ELVLNKLGIK VLSLFFIDRV ANYRYYDENG NPQKGIYAKL FEKHYKDLIR LPKYNTLFKD IDLDTAAEEV HNGYFSADKK GVLKDTSGST QDDEDTYNLI MKDKERLLSF DTKLRFIFSH SALREGWDNP NVFQICTLNE TQSEVKKRQE IGRGLRLCVN QEGERQYGSF INTLTVIANE SYEEFAAKLQ KEYETESGIR FGIIESHLFA NIPVKQVDGS VKYLGQEASE TIFKAFLNNG YINESGEVQY KLKNDIKDNK LNVPEEYEHV RAEITALARK VCSGLNIRNN SDKKTIKLNK QVYLDPEFKE LWNRIKYKTT YSVDFDSEKL IEECCKEMQR SLFVSSPKLI YTKAGLDISA GGIEAKESDR YAVVLDNQKE TLPDIIAYLQ NETNLTRKTI VEILIRSKTL HLFKKNPQKY MEQVSQIITA KMRNMIVDGI KYTKIGDDEY YAQELFESEE LIGYLSKNMM ASKKSVYEYV VYDSATEERF ARSFENNSKV KLYAKLPGWF TIPTPLGSYN PDWAVLIDVD GKDKLYFVLE TKADTMFDAL RPTERAKIEC GKKHFEALGT EVGFEDIDSF EGFIEEKVVV K
|
| |