Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1144 |
Symbol | |
ID | 4810812 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1360911 |
End bp | 1362332 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 27% |
IMG OID | 640106566 |
Product | restriction modification system DNA specificity subunit |
Protein accession | YP_001037569 |
Protein GI | 125973659 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTAATA ATGTGTTGGT ATGGAAGCAA CCAAATATCT CTTGGATTCA TCCTATTAAT ATAGATAATA GTATTAATGC CAATAACTTT AATTTTGACT ATCTAGATAC ATTAAATAAG TTAAGAAGTA GCAATTTAAA AATTTTAGAA TTAAGAGATA TAGCAGATAA AATATCAGAT GGACCGTTTG GCAGTCAATT GAAGGTAGAA GAATATAAGG AACAAGGATT TCCAGTATAT AGAGTTAAAA ATATTATTGA TACTCAAATT TTGGATGATG ATATTGTATA TATTGATGCT AAAAAGCAAC AACAATTAAA GAGAAGTGAA GTATTACCTG GGGATGTATT AATAACTAAA GCAGGCAGAA TAGGTTCTGC TGCTGTTGTA CCAAGTAAAT TTGGAAATGG GAACATAACT TCACATTTAG TGTTAGTTAG ATTAAAAAAA ACAATCAATA ACTATTATTT GGTTGCTTAT TTAGAATGTA AGTATGGTAA AGTTATTACA GGTCGAGAGA GTTATAAGTC AACAAGACCT GAATTGACAA AAAATGAAAT AGGAAATGTT ATAATCCCCA TCCCATCTCC TGAAATTCAA AAATACATAG GAGATAAGGT TAGAAAAGCA GAAGAGTTGA GAGAAGAAGC GAAAAGGTTG AAGAAAGAGG CTGAAACATT TCTTTATGAA ATGATTCAAC TTAAACCATT AAATGATTTT GATAAAGATA TGTTTTCATT TGTCAATAGT AATTATATTG ATTCTGAAAG ATTAGATTCA GAGTATTATA AAACAAAATA TATTACATTA GAGAAACTCT TAAAAAGTAA AAAAGTTACT TCTTTTAAGG ATATTATAAT CGAAAGTAAG TATGGAGCAT CTGTACCAGC AGATTACACA ATGGTTGGTA TACCTTTTAT TAGAGGAAAT AATTTAACTG ATAATGAAAT TAATATTGAT GATATTGTAT ATTTAAATAA AAAATTAAAA GATGAAGTTA AAGACCATCA TGTAAATACT GGAGATATTT TGATAACAAG AAGTGGAACT GTTGGTATTA GTGCAGTTGT TGATGAAAAA TGCGATGGGT TCTCATTTGG TTCATTTATG ATAAAACTAC GTATTGATAT GAGAATATGG AACCCTTATT ATATAGCAGC ATTCTTAAAT TCATTTTGGG GAAAATGGCA AATTGAAAGG TTACAAAATG GTGCTGTTCA GCAAAATATT AATTTACAAG AAATTGGTAG AATTATAATA CCTATTATTT CAAAAGAAAA TCAAGATAAA ATTGAAGAAT TAATCAAAAA TTATATTAAT AAAAAAAGAC AATCAAAACA ACTAATTCAA GAAGCAAAAC AGGACGTAGA AGACCTTATA GAAGGCAACT TTGATATGTC AAAAGTAAAA GCAAATAGTT AA
|
Protein sequence | MINNVLVWKQ PNISWIHPIN IDNSINANNF NFDYLDTLNK LRSSNLKILE LRDIADKISD GPFGSQLKVE EYKEQGFPVY RVKNIIDTQI LDDDIVYIDA KKQQQLKRSE VLPGDVLITK AGRIGSAAVV PSKFGNGNIT SHLVLVRLKK TINNYYLVAY LECKYGKVIT GRESYKSTRP ELTKNEIGNV IIPIPSPEIQ KYIGDKVRKA EELREEAKRL KKEAETFLYE MIQLKPLNDF DKDMFSFVNS NYIDSERLDS EYYKTKYITL EKLLKSKKVT SFKDIIIESK YGASVPADYT MVGIPFIRGN NLTDNEINID DIVYLNKKLK DEVKDHHVNT GDILITRSGT VGISAVVDEK CDGFSFGSFM IKLRIDMRIW NPYYIAAFLN SFWGKWQIER LQNGAVQQNI NLQEIGRIII PIISKENQDK IEELIKNYIN KKRQSKQLIQ EAKQDVEDLI EGNFDMSKVK ANS
|
| |