Gene Cthe_0518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0518 
Symbol 
ID4808267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp632700 
End bp635675 
Gene Length2976 bp 
Protein Length991 aa 
Translation table11 
GC content37% 
IMG OID640105933 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_001036948 
Protein GI125973038 
COG category[V] Defense mechanisms 
COG ID[COG3587] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCTGA AATTTGATAA AAACCTGGAA TACCAACAGC AGGCCATAGC TTCCGTTGTG 
GATTTGTTTA GAGGCCAAAC GCCTATGCAT ACTAATTTTA CCGTATCAGC TTATAATGGG
CAAATAGGCC TGTTTGATAC AGAAAACGGT ATAGGCAACA GGCTTGAACT GGATGAAGAA
GAGATACTTA AGAATTTGCA AGAAGTTCAG CTTAGGAATG GCCTGCCACA GACAAAGTTT
CTAAAAGCCG GAGAATACGA CTTTGATATA GAGATGGAAA CCGGTACTGG TAAAACTTAT
GTTTACTTAA GGACCATCTT TGAACTCCAT AAAAACTATG GGTTTTCCAA GTTTATTATT
GTTGTTCCCA GTATTGCCAT TAAAGAAGGT GTGTATAAGA CCTTACAGAT TACAGAGGAA
CATTTTAAGG AATTATATGA TAATACAATT TACCACTACT TCATATATGA TAGTAGCAAA
CTGGAACAGG TAAGAAGTTT TGCGGTGAGC GATAATATTG AGATTATGGT TATAAACATT
GACGCTTTTA GGAAGAGTTT TACAGACCCT ACTAAGGAAA ATAAGGCCAA TATTATTCAT
AGGACAAATG ATAGATTAAA CGGTATGAAA CCTATTGAGC TTATTCAAGA GACGAGACCC
ATTGTTATTA TAGATGAGCC ACAATCGGTA GACACGACTC CTAAAGCCAA AGAAGCTATT
AAATCTCTAA ATCCTTTATG TATATTACGT TACTCAGCAA CCCATGTGGA AAGGCACAAT
CTGGTTTATA AGCTGGATGC AGTAGATAGT TATAATTTGG GTTTGGTTAA GCAGATAGAA
GTTGCTGGGT TTACAACTAA AGACTACCAC AATAAAGCTT ACTTGAAGCT TTTATCTGTA
GACAATAAAA AATCTCCTAT AACAGCCAAA ATAGAGATGG ATGTCAAAGA TAGAAAAGGT
GTGGTAAAAC GGAAAGCGGT AACAGTCAAA CGTGGAGATG ACCTTTATGA AAAATCCGGA
GGGCGTGATG TCTATGAAGG ATATATTGTC AGCGAGATTT ATTGCGAAGA AGGGAACGAA
TACGTTGCCT TTACAAGCAA GCCGGATATT CTACGTATAG GTAAAGCTAT AGGAGATGTG
GACGACCTGG CCATAAAAGA ACAGATGATT AGAAAGACTA TAGAGGAGCA TTTAGACAAA
GAACTGGTAT TAAATAAGCT GGGAATAAAA GTGCTGAGCC TGTTCTTTAT AGACCGAGTG
GCAAATTATC GCTACTATGA TGAAAATGGG AATCCTCAAA AGGGTATCTA TGCAAAGCTC
TTTGAAAAAC ATTATAAAGA CCTCATAAGA CTGCCTAAGT ATAATACGCT GTTTAAAGAT
ATTGATTTAG ATACCGCTGC TGAAGAAGTC CACAATGGCT ATTTTTCAGC GGATAAGAAA
GGTGTCTTAA AAGATACCAG CGGTTCTACG CAAGATGATG AGGATACTTA CAACTTGATT
ATGAAAGACA AGGAGCGCCT TTTATCTTTT GATACTAAAC TTAGATTTAT CTTCTCCCAC
TCGGCTTTGC GTGAAGGCTG GGATAACCCC AATGTGTTCC AGATATGTAC CCTAAATGAA
ACACAGAGCG AGGTTAAAAA GCGTCAAGAG ATTGGTCGTG GCCTGCGTCT TTGTGTTAAT
CAGGAAGGAG AACGCCAGTA TGGTTCTTTT ATAAACACAT TGACGGTTAT CGCTAATGAA
AGCTATGAGG AATTTGCAGC TAAACTTCAG AAAGAATACG AAACTGAAAG CGGCATAAGA
TTTGGTATTA TTGAAAGTCA TTTGTTTGCC AACATCCCGG TAAAACAAGT GGATGGAAGT
GTAAAATATT TAGGTCAGGA AGCTTCCGAA ACCATTTTCA AAGCATTTTT GAATAATGGT
TATATTAATG AATCAGGTGA AGTGCAGTAC AAGCTGAAAA ATGATATAAA AGACAACAAG
CTAAATGTAC CGGAAGAATA TGAACATGTA AGAGCGGAGA TTACCGCTCT TGCCAGAAAA
GTGTGCAGCG GACTTAATAT CAGGAACAAT AGTGATAAAA AAACTATTAA GCTTAATAAG
CAGGTATACC TTGACCCTGA ATTTAAAGAG CTTTGGAACA GGATAAAATA CAAGACTACC
TATTCTGTGG ACTTTGACAG TGAAAAGCTT ATTGAGGAAT GTTGCAAAGA GATGCAGAGA
AGCCTGTTTG TAAGTTCACC AAAGCTAATA TATACAAAAG CAGGGCTTGA TATTAGTGCG
GGTGGAATTG AGGCAAAAGA ATCAGACAGA TATGCAGTTG TTTTGGATAA CCAGAAGGAA
ACTCTGCCTG ATATTATTGC TTATCTTCAG AACGAAACCA ATTTAACAAG AAAAACTATT
GTTGAGATAC TGATAAGAAG TAAAACACTC CACCTTTTTA AGAAGAACCC ACAAAAATAT
ATGGAGCAGG TATCACAAAT TATAACCGCT AAAATGAGGA ATATGATAGT TGACGGCATT
AAATACACAA AGATAGGTGA CGACGAATAT TATGCACAAG AACTCTTTGA AAGTGAGGAG
CTTATAGGCT ATTTATCCAA AAACATGATG GCAAGTAAGA AGTCAGTATA TGAATATGTT
GTATATGATA GTGCTACAGA AGAGAGATTT GCCAGAAGTT TCGAAAACAA TAGCAAGGTT
AAGTTGTATG CAAAACTTCC CGGCTGGTTT ACTATACCCA CTCCATTAGG CAGCTATAAT
CCTGACTGGG CTGTACTAAT TGATGTAGAT GGCAAAGATA AACTGTACTT TGTACTGGAA
ACCAAAGCTG ATACTATGTT TGATGCCCTA AGACCAACAG AAAGAGCTAA AATTGAATGC
GGGAAGAAAC ATTTTGAGGC TTTAGGTACT GAAGTAGGAT TTGAAGATAT AGATAGTTTT
GAAGGATTTA TAGAGGAGAA AGTAGTAGTA AAGTAG
 
Protein sequence
MKLKFDKNLE YQQQAIASVV DLFRGQTPMH TNFTVSAYNG QIGLFDTENG IGNRLELDEE 
EILKNLQEVQ LRNGLPQTKF LKAGEYDFDI EMETGTGKTY VYLRTIFELH KNYGFSKFII
VVPSIAIKEG VYKTLQITEE HFKELYDNTI YHYFIYDSSK LEQVRSFAVS DNIEIMVINI
DAFRKSFTDP TKENKANIIH RTNDRLNGMK PIELIQETRP IVIIDEPQSV DTTPKAKEAI
KSLNPLCILR YSATHVERHN LVYKLDAVDS YNLGLVKQIE VAGFTTKDYH NKAYLKLLSV
DNKKSPITAK IEMDVKDRKG VVKRKAVTVK RGDDLYEKSG GRDVYEGYIV SEIYCEEGNE
YVAFTSKPDI LRIGKAIGDV DDLAIKEQMI RKTIEEHLDK ELVLNKLGIK VLSLFFIDRV
ANYRYYDENG NPQKGIYAKL FEKHYKDLIR LPKYNTLFKD IDLDTAAEEV HNGYFSADKK
GVLKDTSGST QDDEDTYNLI MKDKERLLSF DTKLRFIFSH SALREGWDNP NVFQICTLNE
TQSEVKKRQE IGRGLRLCVN QEGERQYGSF INTLTVIANE SYEEFAAKLQ KEYETESGIR
FGIIESHLFA NIPVKQVDGS VKYLGQEASE TIFKAFLNNG YINESGEVQY KLKNDIKDNK
LNVPEEYEHV RAEITALARK VCSGLNIRNN SDKKTIKLNK QVYLDPEFKE LWNRIKYKTT
YSVDFDSEKL IEECCKEMQR SLFVSSPKLI YTKAGLDISA GGIEAKESDR YAVVLDNQKE
TLPDIIAYLQ NETNLTRKTI VEILIRSKTL HLFKKNPQKY MEQVSQIITA KMRNMIVDGI
KYTKIGDDEY YAQELFESEE LIGYLSKNMM ASKKSVYEYV VYDSATEERF ARSFENNSKV
KLYAKLPGWF TIPTPLGSYN PDWAVLIDVD GKDKLYFVLE TKADTMFDAL RPTERAKIEC
GKKHFEALGT EVGFEDIDSF EGFIEEKVVV K