Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1171 |
Symbol | |
ID | 5104467 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1136653 |
End bp | 1137603 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640507063 |
Product | CRISPR-associated HD domain-containing protein |
Protein accession | YP_001191256 |
Protein GI | 146303940 |
COG category | [R] General function prediction only |
COG ID | [COG1203] Predicted helicases |
TIGRFAM ID | [TIGR01596] CRISPR-associated endonuclease Cas3-HD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.159368 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATGTT CACCGGACGA TCTATGTTCT CATCCCAACA AGCGCCTCGC GGACCACCTT AATGAGGTAG GAACCACTGC ATCTTCGCTC ATCTCAGGGA GTCGTCCGGA GCTCTCCGAC CCAGCCTATA TTGCTGGGGC ATACCACGAC GTTGGTAAGT ACACGACCTT CTTTCAGAAA CATCTTCGTG GAGGTAAGGA TCCGCGATCC AGTCACGCTG AGATATCTTC CCTCATCGCA TATTACGTCG CCAAGAGGGC ACTATCTTCC CTAGAACTTC CTATCCTCGT CTCACTAGCG GTGAGATCTC ATCACGGCCA TCTCAAGGGG ATTGAGACTG TCAAGAGATG GGCCATGAAC AAGTTGGATG ATCCAGGTGT CCTTGGACCC CAATTTCAGG ATCTATACAA GAGGATGGAC AGGATCCTAG AGAATATGTC AAATATTAGG TATCACTCGC ATGTCGAGGG ATTCCTGGAG CAAGATCTGA ACTCCACCCT CGAGGAGTAC GCTTCCGATC TGCTTCACCT CGAGGAGGAG CTCAAGGCAG ATCAGTCAAA GGCCTGGGAG AGATATCTGG GAGGCCTGCT ACTGTTCTCA TGTCTCATAG ACTCAGACAA GCATAGCGCC AGTGACACAT CCTTTCTTCC CCAGAATCCC CCTTCACCCT CACTGATTAC CGAGTTCATA TCCTCGATCA AGAGTAACGA TAGGATGGCT GGGATCAGGG AGAAGCTCAA CTCCCTTGTC TCGTCTACTC CGGTTAGCCG GACGGTAACC TTGATTGCCC CGAGCGGATC GGGGAAGACG CTGTCCGGCG TTCTAACTGC ACTCAAGATG GGGAAGTCGA GGTTGATCTA TGCTCTTCCC TACATTAGTA TTGTTGAGCA GGTTCACGAC GTGCTCTCAA GGACGAGGAT GGACCCGCTC AAGTTCTATC ATCTATACTA G
|
Protein sequence | MECSPDDLCS HPNKRLADHL NEVGTTASSL ISGSRPELSD PAYIAGAYHD VGKYTTFFQK HLRGGKDPRS SHAEISSLIA YYVAKRALSS LELPILVSLA VRSHHGHLKG IETVKRWAMN KLDDPGVLGP QFQDLYKRMD RILENMSNIR YHSHVEGFLE QDLNSTLEEY ASDLLHLEEE LKADQSKAWE RYLGGLLLFS CLIDSDKHSA SDTSFLPQNP PSPSLITEFI SSIKSNDRMA GIREKLNSLV SSTPVSRTVT LIAPSGSGKT LSGVLTALKM GKSRLIYALP YISIVEQVHD VLSRTRMDPL KFYHLY
|
| |