Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_3327 |
Symbol | |
ID | 7402183 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012030 |
Strand | - |
Start bp | 76517 |
End bp | 77512 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643709879 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_002567445 |
Protein GI | 222481209 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAAAC CCAATCATCA CGTATTCACC GACGGCGAAC TGTCGCGAAG TGAAGACACA CTCAGAATCG ACACCCTCGA TGGGGAGGTG GAACACCTAC CGGTCGAAAG TATCGACACG CTGTATCTGC ACGGGCAGAT CGATTTCAAT ACCCGAGCAC TCGGCCTGCT CAACGATCAC GGAGTTCCTG CGCACGTGTT CGGATGGAAG GACTATTACA AAGGGTCATA TCTCCCGAAA CGGAGCCATC TCTCGGGAAA CACCGTCGTC GAGCAGGTAC GCGCGTATGA TGATCCGGAT CGTCGACTCG GGATTGCCAC GTTGATGATC GAAGCGAGTA TCCATAATAT GCGGGCAAAT CTTGTCTATT ATAATGCCCG CGACTGTTCG TTCGACTCAG AAATCGATCG GCTCGAATCA CTCAAGACGA AAGCAAGCAC AGCCGAAAGC ATTGACGGAC TTCGTGGGAC CGAAGCAACC GCACGCAAAA CCTATTATTC GTGTTTCAGT GAGATTCTGC GTGACCCGTT CGCCCTAGAT CGGCGTGAAT ACAATCCACC GACCAACGAG ACGAACGCGC TCATTTCGTT CCTGAACGCG ATGGTTTATA CGGCCTGTGT CTCTGCGATC CGCAAGACAG CACTTGATCC AACAGTCGGG TTCATGCATG AACCGGGTGA TCGCCGATTT ACGCTCTCGC TCGATATCGC GGATATTTAT AAACCGATAC TCGCGGATCG TGTGCTGTTT CGGCTCGTAA ACCGTCGACA GATCAGCCCT GATGAGTTCG AATCGGATCT CGATGGCTGT CTGCTTACTG AAGACGGTCG ACTGACAGTG CTTGCCGAGT ATGAGGAAAC GCTTGATAAA ACAGTCGAGC ATCCCCGACT AAAGCGAAAC GTGAGCTACA AAACACTCGT GCAAACGGAT GTGTACAGTT TGAAGAAGCA CATACTGACC GGCGAGCCGT ACCGGCCGAC CGAACGGTGG TGGTAA
|
Protein sequence | MTKPNHHVFT DGELSRSEDT LRIDTLDGEV EHLPVESIDT LYLHGQIDFN TRALGLLNDH GVPAHVFGWK DYYKGSYLPK RSHLSGNTVV EQVRAYDDPD RRLGIATLMI EASIHNMRAN LVYYNARDCS FDSEIDRLES LKTKASTAES IDGLRGTEAT ARKTYYSCFS EILRDPFALD RREYNPPTNE TNALISFLNA MVYTACVSAI RKTALDPTVG FMHEPGDRRF TLSLDIADIY KPILADRVLF RLVNRRQISP DEFESDLDGC LLTEDGRLTV LAEYEETLDK TVEHPRLKRN VSYKTLVQTD VYSLKKHILT GEPYRPTERW W
|
| |