Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1362 |
Symbol | |
ID | 4602199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1318443 |
End bp | 1319549 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639774137 |
Product | CRISPR-associated RAMP Csm5 family protein |
Protein accession | YP_920762 |
Protein GI | 119720267 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01899] CRISPR-associated RAMP protein, Csm5 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0748722 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGAGGC TCACCCTCAA GGTAGAGGTC GTGACCCCCG TCCACGTCTG GGACGGGTAC GAGAGGGTGT ACGGCTTGGA CGTTGTCGCG GTGGACGGGC AGGCGTGCGT AGCCGACTTC GAGAGGCTGA GGCCCGACGC GCTGAGAGGA GAGCCGCCGG CCGGGCTCGC TGAGTTCGCC GAGAGGGTTG CGCGCTGGGT CAGGGAGGGG GCGTTGCCGT GCTCCAGGCG GGTCGGCATG AGGGCTCAGC CGAGGGTGGG GGACGCCGTG AGGCTGCTAC CGCCGACCCT CGTTCCCGCG TCGTCGCTCA AGGGCTACCT GAGGACCGCG CTCTTGTTCC GGTTGCTGAG GAGGGTGGCG GAGGAGCGCG GATCCGAGGA GGCGGCCAAG CTCCTGAGGG GCACCGTGAA CACTAGGGGC GACCCGAGGC GCGCCGGGGA GGCCTTGGAG AACGCTTTGC TCAGGGTGCC CAGGCTGAGG AGGCAGGGAG GGTACGCCGA CGCCATGCAG TCGGTCGTCG TGTCGGAGCC GAGGGCCTCG GGGTGCGCGA GCTCGCTGAG TAAGCTCAGC GTGGTGGAGG CCTCCGGGAA GGCTGTCGGA GAGCTCGTAG TGGAGGTCCT GGAGGGGGGC TCGTTGGAGT ACGAGGTACT GTTCCCGGAG CCGCCGCCCG TCAACCTCGC GAGCGCCGGC GGGCTCGAAG GCGAGCTCAA GGCTATCGCG GGGCTCCTCG CCTCCCCGAG CCCCGAGGAG CTTGCGCGGG CCCTCGAGGA GTTCGGCTCA GCGCTCCTGG AGCACGAGGT GGAGAGGCTG AGGGGCGCGC GGGAAAGCCT CGCCAAGGCG GGCTACGACG TCGCCACGTA CCTCGGCAGG CTGGAGGGGC TTAGGGGTGG CGGGTGCGCC GTCGCCAGGC TGGGCTTCGC CACCGGCCAC GCAAGCAAGA CGCTCGCGCT CCTGGTTAAA AGGCTCGACC CTGAGCTCTA CAAGGAGGTG GCCGACGCGA TGTCCAGGAG GCTCGGCAGG ACGTGGGACG AGCTCACGTT CAAGCTGGTA GACGTGGGCG GCGCCTACCT GGGCGCGGGG TGGGTCAAGG TATGCGTCCA GGCCTGA
|
Protein sequence | MRRLTLKVEV VTPVHVWDGY ERVYGLDVVA VDGQACVADF ERLRPDALRG EPPAGLAEFA ERVARWVREG ALPCSRRVGM RAQPRVGDAV RLLPPTLVPA SSLKGYLRTA LLFRLLRRVA EERGSEEAAK LLRGTVNTRG DPRRAGEALE NALLRVPRLR RQGGYADAMQ SVVVSEPRAS GCASSLSKLS VVEASGKAVG ELVVEVLEGG SLEYEVLFPE PPPVNLASAG GLEGELKAIA GLLASPSPEE LARALEEFGS ALLEHEVERL RGARESLAKA GYDVATYLGR LEGLRGGGCA VARLGFATGH ASKTLALLVK RLDPELYKEV ADAMSRRLGR TWDELTFKLV DVGGAYLGAG WVKVCVQA
|
| |