Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1288 |
Symbol | |
ID | 4600596 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1229619 |
End bp | 1230542 |
Gene Length | 924 bp |
Protein Length | 307 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639774064 |
Product | CRISPR-associated Csm3 family protein |
Protein accession | YP_920689 |
Protein GI | 119720194 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1337] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) |
TIGRFAM ID | [TIGR02581] CRISPR-associated RAMP protein, SSO1426 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0685165 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGCCAGG TCCCGTGGTC CTCGCACAGG GTCCTCCTCA GGGAGGCAGT CTTCAGGGGC TACCTCGTCG CCGAGTCTCC GCTGAGGGTG GGCGCTGGCA GGGAGGCGCC GCTGGGCTCC CCGGTGGACC TCTCGGTCCT CAGGGTTAGG CTTGGCGGTA AAAGCGTCCC CTACATACCC GGCAGTAGCC TGAAGGGTGT GTTCAGGAGC TTCTCCCAGT CCCTCGCAGT GGCGAAGGGG CTAAGCGTGT GCAGCGGGCT GAGCGGGGAG ACGTGCATGG ACTACGAGGA CCCGTCGCTG GGCGGCGAGA AGTTGCTGAG CTACTTGCAG GGGCTGATGA GGGAGGGGCA CAGCCTGGAA GCCGTCAGGC TGTTCCACGA GAAGGCCTGC CTGATGTGCA AGGTCTTCGG CGCCCCCTCG TTCTCCGGGC ACGTCGAGTT CAGCGACGCC TACCCCGTCG ACGAGAAGGG GGGCGTCGTC GACGTCTCCA CCGGGGTGAG GACGGGCATA GCGATAAACA GGAGGACGGG CGCCGTCTAC GAGAGGGCCC TCTACCAGGT CGAGTACGTC GAGCCGGGCG CGAGGTTCAG GTTCGAGGCT AGGACTACGA ACCTCCCCAA CTACGCGCTC GGGCTCCTCT CCGCCGTCAT CAGGATGATG AACGAGGGCT GGGTCAGGGT GGGCGGCTTC AAGACGAGGG GCTTCGGAGA GGTGCGCGTG GAGGGCCTGG AGTTCGCGGC TAGGGGCGCG ACTGTCCGCG GCTCGGTGCT GGTGAAGCTC GACGACTACG ACTCGGACGT CGACCTCTCG GGCCTCGCCG AGGCCAGGGA CGGGTGGCTA AGGGCCTCCG GGGACTCCGC CTGGAAGGCG CTGGCTAAGC TCGAGGAGGT GTGGTCCAAT GCAAGCTTCG GGAAGCGCGG TTAG
|
Protein sequence | MSQVPWSSHR VLLREAVFRG YLVAESPLRV GAGREAPLGS PVDLSVLRVR LGGKSVPYIP GSSLKGVFRS FSQSLAVAKG LSVCSGLSGE TCMDYEDPSL GGEKLLSYLQ GLMREGHSLE AVRLFHEKAC LMCKVFGAPS FSGHVEFSDA YPVDEKGGVV DVSTGVRTGI AINRRTGAVY ERALYQVEYV EPGARFRFEA RTTNLPNYAL GLLSAVIRMM NEGWVRVGGF KTRGFGEVRV EGLEFAARGA TVRGSVLVKL DDYDSDVDLS GLAEARDGWL RASGDSAWKA LAKLEEVWSN ASFGKRG
|
| |