Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1449 |
Symbol | |
ID | 8383728 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 1419487 |
End bp | 1420479 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644972512 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_003130358 |
Protein GI | 257052525 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGACA ACTACCACAT CTTCTCGGAC GGACGGGTCG AGCGCCACAA CGACACGGTA CGGCTCGTCA CCGAGGACGA CGAGAAGAAG TACCTCCCGA TCGAGAACGC CGAGGCGCTG TACCTCCACG GGCAGATCGA CTTCAATACG CGAGTGATCT CGTTTCTCGA CGATCACGGT GTCGCGATGC ACGTCTTCGG CTGGAACGAC TACTATTCGG GCTCGATCAT GCCTGAGCGT GGCCAGACGT CGGGTCAAAC GGTCGTCGAA CAGGTCCGGG CCTACGACGA CGAAGCCCAT CGTGGCAACA TCGCCCGTGA GATCGTGGCG GGAAGCATCC ACAACATGCG GGCGAACGTC ACCTATTACG ACAACCGGGA CTACGACCTC AGCGCGACAC TCGAGTCACT CGATCGTCGA CGTGACGAGA TCAAGTCCGT GGCGTCAGTC GAGGAAGCAA TGGGCGTCGA AGCGAGTGCG CGACGTGCCT ACTACGCGAT CTTCGATCAG ATTCTTCCCG ACGCCTTCGT CTTCGGGGGT CGAAAGTACA ATCCGCCAAA CAACAAGGTG AACAGCCTGA TCTCGTTCGG GAACAGCCTG GTCTACGCGA ATATCGTCTC GGCAATCCGG GCCACGGCAC TCGATCCGAC GATCAGTTAT CTCCACGAGC CTGGCGAGCG ACGGTACTCA CTGGCGCTGG ATCTTGCCGA TCTCTTCAAA CCCGTCCTGA CTGATCGCGT CGTCTTCCGA CTCGTGAATC GAGGGCAGTT GTCCGACGAT GATTTCGATT CGGAGATGAA CGCGTGTCTG CTCACCGAGA GTGGCCGGGA GACGTTCTCG AAGGAGTTCG AGCAGACGCT CGATCGGACG ATCGAACACC CAAATCTCAA CCGGAAGGTC AGTTATCAGT ATCTCCTTCG GGTCGAGGCG TACAAACTCA AGAAGCACTT GTTGACTGGC GAGTCCTACG AGTCCTTCGA GCGGTGGTGG TAA
|
Protein sequence | MNDNYHIFSD GRVERHNDTV RLVTEDDEKK YLPIENAEAL YLHGQIDFNT RVISFLDDHG VAMHVFGWND YYSGSIMPER GQTSGQTVVE QVRAYDDEAH RGNIAREIVA GSIHNMRANV TYYDNRDYDL SATLESLDRR RDEIKSVASV EEAMGVEASA RRAYYAIFDQ ILPDAFVFGG RKYNPPNNKV NSLISFGNSL VYANIVSAIR ATALDPTISY LHEPGERRYS LALDLADLFK PVLTDRVVFR LVNRGQLSDD DFDSEMNACL LTESGRETFS KEFEQTLDRT IEHPNLNRKV SYQYLLRVEA YKLKKHLLTG ESYESFERWW
|
| |