Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1160 |
Symbol | |
ID | 5103508 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1124124 |
End bp | 1124888 |
Gene Length | 765 bp |
Protein Length | 254 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640507052 |
Product | CRISPR-associated Csx7 family protein |
Protein accession | YP_001191245 |
Protein GI | 146303929 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1337] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) |
TIGRFAM ID | [TIGR02581] CRISPR-associated RAMP protein, SSO1426 family |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.791227 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0434649 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGTGA ATGAAGAGCG ACCCTGCTAT GACCTTGATA GGATCAAGGT AGTCACTGAG GTCACGGGCT ACCTCACTAA CCTTACCCCA CTCAGGATAG GAAGCGGTAA GGGATCGGCA AGCTTCAGGG ACACGACGGA CAACCCAATC CTAACCAGGG GAGACATGCC GTACATTCCA GGGAGTAGCC TCAAGGGAGC TCTTAGGTCA TGGCTTGAGG CCAACGTTGA GGGCCTCTAC GGACGACTGG GGCAAAAGTA CACGAAGGTT TACCCTATTA CAGCTAGGGA CAACGAGGTC AAGAGTTGCG TCAAGTCAAG TGACACACAG GGTAATCAGG TAGAGGAGTA CTGTATACCC TGTATCATCT TTGGGCATAA GGACCTTTCC GCGAGGATGA ACATCATGGA CGCCACAGTG GAGGGAAGTT TCAAGGTTGA GTCCTACACA GGAGTCTCCA TAAACAGGGT ATTCGGAGGT CAGTCGCCTG GGCATCTCTT TACCTTCGAT TACGTGTCCC CTGGTGCAAA GTTCAGATTT AGGTCGATCA TCTACAACGT AAACGTTGAG AACGAGACTG AGGAATGGAG GGAACAGGTG AGACGTGGTG TTGTCTTCCT CCTGAGGTCA CTCGAGGAGG GGCTCTTCAT TGGCGCCAGG AAAACCACGG GAGCAGGCCT TGTTAAGCTT CAGGATATCA GGGTTAGGAC CATGGTTCCT GGTCAACCTT GGAGAGAAGT GAAGTGGGCA GAGGTGAAGC TGTAA
|
Protein sequence | MFVNEERPCY DLDRIKVVTE VTGYLTNLTP LRIGSGKGSA SFRDTTDNPI LTRGDMPYIP GSSLKGALRS WLEANVEGLY GRLGQKYTKV YPITARDNEV KSCVKSSDTQ GNQVEEYCIP CIIFGHKDLS ARMNIMDATV EGSFKVESYT GVSINRVFGG QSPGHLFTFD YVSPGAKFRF RSIIYNVNVE NETEEWREQV RRGVVFLLRS LEEGLFIGAR KTTGAGLVKL QDIRVRTMVP GQPWREVKWA EVKL
|
| |