Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_2838 |
Symbol | |
ID | 8412389 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 2720110 |
End bp | 2721102 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645021183 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_003178650 |
Protein GI | 257388877 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.150798 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGACA ACTACCACGT CTTTTCCGAC GGACGCATCG AACGCCACGA CGACACGGTA CGGGTCATCA CCGACGACGG CGAGAAAAAA TACCTCCCGG TCGAGAACGC CGAGGCGATC TTCCTCCACG GTCAGATCGA GTACAACACC CGCTTCGTCT CCTTTCTCAA TCAGGAAGGC GTCGCCGTAC ACGTCTTCGG CTGGCACGAT CACTACGCCG GGTCGATCAT GCCCAAGCGG GGCCAAACGT CCGGACAGAC ACTCGTCGAC CAGGTCCGGG CCTACGACGA TCCGGCCCAC CGGCTCGAAC TGGCTCAGGC GTTCGTCGAC GGCAGCATCC ACAACATGCG TGCGAACGTC ACGTACTACG ACGGCCGAGG ACACGACTTC GAGGACGTGC TGGCAGAGCT GACCGAAGCC CGGTCGTCAC TCGACAGGAT GGAGACGATC GACGAGACGA TGGGCGTCGA AGCACGCGCC CGAAAGGCGT ACTACTCGAC CTTCGACGAG ATCCTGCCCG ACGAGTTCGT CTTCGGCGGC CGCCAGTACG ATCCGCCGAA CAACGAAGTC AACAGCCTCA TCTCTTTCGG CAATTCGCTC GTCTACGCCA ACGTCGTCTC GGCCATCCGA GCGACGGCAC TCGATCCCAC GGTCAGCTTC CTCCACGAGC CCGGCGAGCG TCGGTACTCG CTGGCCCTGG ACATCGCCGA CCTGTTCAAA CCGTTGCTCG CGGATCGAGT CATCTTCAGA CTCGTCAACC GCGGCCAGCT GACCAGCGAC GATTTCGAGG CCGAGATGAA CGCCTGCCTG CTGAACGAGC ACGGCCGGAA GACCTACTCG AAGGCCTACG AAGAGACGCT CGACGAGACG ATCGAGCACC CGGATCTGGG AAAGAAGGTG AGCTATCAGT ATCTCCTCCG AGTCGAGGTG TACAAGCTCA AAAAACATCT CCTGACCGGC GAGGAGTACG TCCCGTTCCA ACGGTGGTGG TGA
|
Protein sequence | MNDNYHVFSD GRIERHDDTV RVITDDGEKK YLPVENAEAI FLHGQIEYNT RFVSFLNQEG VAVHVFGWHD HYAGSIMPKR GQTSGQTLVD QVRAYDDPAH RLELAQAFVD GSIHNMRANV TYYDGRGHDF EDVLAELTEA RSSLDRMETI DETMGVEARA RKAYYSTFDE ILPDEFVFGG RQYDPPNNEV NSLISFGNSL VYANVVSAIR ATALDPTVSF LHEPGERRYS LALDIADLFK PLLADRVIFR LVNRGQLTSD DFEAEMNACL LNEHGRKTYS KAYEETLDET IEHPDLGKKV SYQYLLRVEV YKLKKHLLTG EEYVPFQRWW
|
| |