Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1453 |
Symbol | |
ID | 8383732 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 1424450 |
End bp | 1425529 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644972516 |
Product | CRISPR-associated protein, Csh2 family |
Protein accession | YP_003130362 |
Protein GI | 257052529 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3649] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR01595] CRISPR-associated protein, CT1132 family [TIGR02590] CRISPR-associated protein, Csh2 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAAC CCACACAGAC CGTCGAGAAC CGTTCCGAGA TCGTGTTCCT GTACGATGCC GTCGACGCGA ACCCGAACGG CAATCCGCTG AGCGGATCGA ACAGGCCGCG GATCGATCCC CAGACCCAGC AGGCGATCGT CACGGACGTT CGGCTGAAGC GGTACCTCCG CGACCAGCTG GACGACGACG GCCACGGCGT CTACATCCGG AACGTTCAGG AAGAGGGCAC GCAGTACACA CGCGCCGAAC TCCTCGAAGA CAGGCTGAAG GCCGTCGATC CCGACGATTA CGACCTCGAT GACGACGAGG CAGCGGCGCA GTTCCGGGAC GACGTCTTCG GGGAATACCT CGAAGAGAGT GCCGACGTAC GCTACTTCGG CGCGACGATG TCTGTCGATA CTGACAATGC GTACGCGAAA CATCTCCCGG ACCACTTCAC TGGCCCAGTT CAGTTCTCGC CAGGCAAGTC GATCCACGCT GTCAACGAAA ACGAGGAATA TGACAGTCTC ACCAGCGTGA TCGCCACCCA GGAGGGGAAA GAACAGGGTG GGTTCGACCT CGACGACCAC CGCATCCAGT ACGGGCTCAT TCGGTTCCAC GGACTCGTCG ACGAACACGG GGCCGCCGAC ACGAATCTCA CGCGAGCGGA CGTAGAGCGT CTCGATACAC TCTGTTGGCG GGCGATCAAG AACCAGACCA TCAGTCGGAG CAAAGTCGGC CAGGAGCCAC GTCTCTACTG TCGCGTCGAA TACGGCGAGG AAAGCTACCA TCTCGGCGGC CTGGACAAGG ATCTCACCCT TGACGACGAG GCCTCGAAGG ACCACGACGA ACTCCGAAAC ATCCGCGATC TTACGCTGGA GATCGATGAT TTCGTGGATC GGATCTCGAA CGCCAGCGAC CAGATCGAAC GCATCCGGGT GGTCGCGAGC GACGTCCTGG AACTCTCTCA CGGGACCGAC AGCGGTGGGC CGGACCTCCT GTACGACGCG CTTCGAACGG CAATCGGACC GGACCGAGTC GATGTCGTCG ATGTCTACGA CGAGTATCCT GAAACGCTGC CACAGAGCAC CGGCGAGTGA
|
Protein sequence | MSEPTQTVEN RSEIVFLYDA VDANPNGNPL SGSNRPRIDP QTQQAIVTDV RLKRYLRDQL DDDGHGVYIR NVQEEGTQYT RAELLEDRLK AVDPDDYDLD DDEAAAQFRD DVFGEYLEES ADVRYFGATM SVDTDNAYAK HLPDHFTGPV QFSPGKSIHA VNENEEYDSL TSVIATQEGK EQGGFDLDDH RIQYGLIRFH GLVDEHGAAD TNLTRADVER LDTLCWRAIK NQTISRSKVG QEPRLYCRVE YGEESYHLGG LDKDLTLDDE ASKDHDELRN IRDLTLEIDD FVDRISNASD QIERIRVVAS DVLELSHGTD SGGPDLLYDA LRTAIGPDRV DVVDVYDEYP ETLPQSTGE
|
| |