Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1147 |
Symbol | |
ID | 5103495 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1093126 |
End bp | 1094034 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640507039 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001191232 |
Protein GI | 146303916 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.253731 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTCAC TCGTCATCTC GGATTACGGG AGTTACGTTA CCGTGAAGAG GGGAATGTTC CTAGTCTCGC GAAAGGTTAA CGACAAGGAG GAGAGGAGGG AGGTATCCCC GAGCGAAGTT GATGAGATTC TGTTCTGCTC CACGTCTTTG GTCTCAACCC ACGTGTTAAG GGTGGCCTTG TCAAGGGGAA TAACGGTTGC CTTCCTGGAC TCAAGGGGGC AGATCTGGGG CCTCCTCCTC CCCTCAGTGG TTACGGAGAC CGTGAGGACA AAGAAGGCCC AGTATGAGGC AGTTGCCTCT GGACTGGATT ACGGGAAGGA GATCATAAGG GCGAAGATAA ACAACCAGGT GGTCCATCTC AAGTATTGGG CAAGGAGAGG GGTAAAGACG GATTACCGTG AGCTTGAGGG AAAGGATGAG GCCACTGCTG CAAGGATTTA CTGGCAGAAC CTGTCTCAGG TTGTCCCTGG CTTTCGCGGA AGGGACGTTG AGGGAGGGGA TGGATTCAAC TCAGCGTTGA ACTACTCCTA CGCTATCCTG TACTCTCGGG TAATGAGGGC CCTAGTCCTA GCGGGTCTCG ATCCCTACCT GGGATTTGTA CACAAGGACA GGCCAGGTAA TGAGAGTTTG GTCTACGACT TCTCGGAGAT GTTTAAGCCC TACGTGGACC TGGTACTGGC TAAGGCTTTC AAGGATGGTC TAGAGGTGAA GTTGAAGGGA GGCCTCATGG ACAAGGAAAG CAGGGGAGCA GTTGCTAAAC TCGTGGTAAA GGGCCTAGAG GAGAAGGTTA AGGAGGAACT TGACCACAAC CCCAAGAGCT TGAACCAGGC GATACGGGCT CACGCCTTGA AGTTTGCTTC TGCGTTGAGG GAAAAGAGGG AGTATAGGGG GTTCAGGATG GTGGTTTGA
|
Protein sequence | MNSLVISDYG SYVTVKRGMF LVSRKVNDKE ERREVSPSEV DEILFCSTSL VSTHVLRVAL SRGITVAFLD SRGQIWGLLL PSVVTETVRT KKAQYEAVAS GLDYGKEIIR AKINNQVVHL KYWARRGVKT DYRELEGKDE ATAARIYWQN LSQVVPGFRG RDVEGGDGFN SALNYSYAIL YSRVMRALVL AGLDPYLGFV HKDRPGNESL VYDFSEMFKP YVDLVLAKAF KDGLEVKLKG GLMDKESRGA VAKLVVKGLE EKVKEELDHN PKSLNQAIRA HALKFASALR EKREYRGFRM VV
|
| |