Gene Hlac_3327 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3327 
Symbol 
ID7402183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp76517 
End bp77512 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content53% 
IMG OID643709879 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002567445 
Protein GI222481209 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAAAC CCAATCATCA CGTATTCACC GACGGCGAAC TGTCGCGAAG TGAAGACACA 
CTCAGAATCG ACACCCTCGA TGGGGAGGTG GAACACCTAC CGGTCGAAAG TATCGACACG
CTGTATCTGC ACGGGCAGAT CGATTTCAAT ACCCGAGCAC TCGGCCTGCT CAACGATCAC
GGAGTTCCTG CGCACGTGTT CGGATGGAAG GACTATTACA AAGGGTCATA TCTCCCGAAA
CGGAGCCATC TCTCGGGAAA CACCGTCGTC GAGCAGGTAC GCGCGTATGA TGATCCGGAT
CGTCGACTCG GGATTGCCAC GTTGATGATC GAAGCGAGTA TCCATAATAT GCGGGCAAAT
CTTGTCTATT ATAATGCCCG CGACTGTTCG TTCGACTCAG AAATCGATCG GCTCGAATCA
CTCAAGACGA AAGCAAGCAC AGCCGAAAGC ATTGACGGAC TTCGTGGGAC CGAAGCAACC
GCACGCAAAA CCTATTATTC GTGTTTCAGT GAGATTCTGC GTGACCCGTT CGCCCTAGAT
CGGCGTGAAT ACAATCCACC GACCAACGAG ACGAACGCGC TCATTTCGTT CCTGAACGCG
ATGGTTTATA CGGCCTGTGT CTCTGCGATC CGCAAGACAG CACTTGATCC AACAGTCGGG
TTCATGCATG AACCGGGTGA TCGCCGATTT ACGCTCTCGC TCGATATCGC GGATATTTAT
AAACCGATAC TCGCGGATCG TGTGCTGTTT CGGCTCGTAA ACCGTCGACA GATCAGCCCT
GATGAGTTCG AATCGGATCT CGATGGCTGT CTGCTTACTG AAGACGGTCG ACTGACAGTG
CTTGCCGAGT ATGAGGAAAC GCTTGATAAA ACAGTCGAGC ATCCCCGACT AAAGCGAAAC
GTGAGCTACA AAACACTCGT GCAAACGGAT GTGTACAGTT TGAAGAAGCA CATACTGACC
GGCGAGCCGT ACCGGCCGAC CGAACGGTGG TGGTAA
 
Protein sequence
MTKPNHHVFT DGELSRSEDT LRIDTLDGEV EHLPVESIDT LYLHGQIDFN TRALGLLNDH 
GVPAHVFGWK DYYKGSYLPK RSHLSGNTVV EQVRAYDDPD RRLGIATLMI EASIHNMRAN
LVYYNARDCS FDSEIDRLES LKTKASTAES IDGLRGTEAT ARKTYYSCFS EILRDPFALD
RREYNPPTNE TNALISFLNA MVYTACVSAI RKTALDPTVG FMHEPGDRRF TLSLDIADIY
KPILADRVLF RLVNRRQISP DEFESDLDGC LLTEDGRLTV LAEYEETLDK TVEHPRLKRN
VSYKTLVQTD VYSLKKHILT GEPYRPTERW W