Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1287 |
Symbol | |
ID | 4600595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1228909 |
End bp | 1229622 |
Gene Length | 714 bp |
Protein Length | 237 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639774063 |
Product | CRISPR-associated Csm3 family protein |
Protein accession | YP_920688 |
Protein GI | 119720193 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1337] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) |
TIGRFAM ID | [TIGR02581] CRISPR-associated RAMP protein, SSO1426 family |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.609283 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCAGTACA ACGACCTCGA CAGGCTCGAC CTCTTCACCA GGGTGACGGG GGTTCTCGAG AACCTCACGC CGCTGAGGGT GGGCGCTGGC AGGGAGGCTC AGCTCGGCTC CCCGGTCGAC TTGGAGCCGT TGAGGGTTAG GCTCGGGGAT AGGAGCGTCC CCTACATACC CGGTAGTAGC TTGAAGGGTG TGTTCAGGAG CCTGGCGGAG GCTATAGCGA GGGCTGAGGG GCACGTTATC CACGACCCGT GGGACTTCGA GGCGGCCGAG CAGGAGGCGC GGGACGGGAA GTACTGCCTG ATATGCGGCA TATTCGGTAG CACGAGGCTG GCGAGCCACG TGAGGATCTA CGACGCCTAC CCGAAGGGGA CTCCCACGCT GTTCATGAAG ACGGGCGTCG GGATTAACAG GGACTTCCGG GGGGCTCACC CGAACATCCT CTACACCGAG CGCCAGGTGG AGCCGGGGCA CAGGTGGAGC TTCATGATGG ACATCGTGAA CATAAGGGTG TACCCGGAGC CCGGGGACGA GAGGGGCAGG ATCCTGAGGA GGGTTCTCGA CATGCTCGCC GAGGGGATGG TCCAGGTCGG CGCCAGGAAG ACTGTGGGCT ACGGGCTCCT CAGGCTAGTG GAGGGCGAGT ACGTCGTGTA CGGGGTTAGG GACGGCAGGC TCCAGCGCCT AGAGTCCGGG AAGATAGGGG GTGGTGCGGT TTGA
|
Protein sequence | MQYNDLDRLD LFTRVTGVLE NLTPLRVGAG REAQLGSPVD LEPLRVRLGD RSVPYIPGSS LKGVFRSLAE AIARAEGHVI HDPWDFEAAE QEARDGKYCL ICGIFGSTRL ASHVRIYDAY PKGTPTLFMK TGVGINRDFR GAHPNILYTE RQVEPGHRWS FMMDIVNIRV YPEPGDERGR ILRRVLDMLA EGMVQVGARK TVGYGLLRLV EGEYVVYGVR DGRLQRLESG KIGGGAV
|
| |