Gene Hlac_3578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3578 
Symbol 
ID7402493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp332259 
End bp333254 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content56% 
IMG OID643710116 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002567682 
Protein GI222481446 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCCA CCGAAGGGAT GTTCGACGAG TCAGTGGTCT ACGTCACCAA GCAAGGAAGC 
CAGGTTGGCA CCGAGGGCGG TCGAATCACC GTCTGGGATG TCGACGGCGA CGAGGGTGAG
TTAGCCTCGT TCCCGACCGA GAAGCTCGAT ACGATCAACG TCTTCGGCGG GGTGAACTTC
TCGACACCGT TCGTCGCCGA GGCCAACCGT CACGGGATCA TTCTGAACTA CTTCACCCAG
AATGGAAAGT ATCGGGGGAG CTTCGTACCT GAAAAGAACA CCATCGCGGA GGTCCGGCGA
GCCCAGTATG ACCTCGACGA GACTGCGGAG ATCGACATCG CGGCAGATAT GATCGCCGCC
AAGATCCGAA ACGCTCGGAC GCTGCTCTCG CGGAAGGGCG TCCACGGGAC GGAGCTGCTC
AAGGATCTCG GTGTGCGGGC GACGACAGTA GCTACGAAGG ACGGCCTCCG TGGTGTTGAA
GGAGAAGCCG CCGAGCGCTA CTTCAACCGT CTCGATGAGA CACTCACCGA TGGCTGGACC
TTCGAGAAGC GGACCAAGCG ACCGCCAGAG GACCACATTA ACTCATTGCT ATCACTGACC
TACGTGTTTA TGAAAAACGA AGTGCTGAGC GCGTTACGGC AGTACAATCT TGATCCATTC
TTGGGTGTGC TACATGCGGA TCGGCATGGC CGACCCTCGC TGGCACTCGA TCTCCAGGAG
GAGTTCAGAC CGATCTTCTG TGATGCGTTC GTGACACGGT TGGTTAATCG CGGTGTCATC
ACCCACGATG AGTTCACTCA GGACAATCAT TTGGCCGACG ATGCATTTCA GACCTACTGC
TCAAAGTTCG ACGAGTTCAT GCAAGAGGAG TTCACCCATC CGTACTTTGA GTACACTGTG
ACTCGGCGTA AGGCAGTGCG ACAGCAGGCG ATTCTCTTAC GGAAAGCAAT CACTGGCGAG
TTGGATGAAT ATCATGCGCT AACTTTTTCA AAATGA
 
Protein sequence
MKATEGMFDE SVVYVTKQGS QVGTEGGRIT VWDVDGDEGE LASFPTEKLD TINVFGGVNF 
STPFVAEANR HGIILNYFTQ NGKYRGSFVP EKNTIAEVRR AQYDLDETAE IDIAADMIAA
KIRNARTLLS RKGVHGTELL KDLGVRATTV ATKDGLRGVE GEAAERYFNR LDETLTDGWT
FEKRTKRPPE DHINSLLSLT YVFMKNEVLS ALRQYNLDPF LGVLHADRHG RPSLALDLQE
EFRPIFCDAF VTRLVNRGVI THDEFTQDNH LADDAFQTYC SKFDEFMQEE FTHPYFEYTV
TRRKAVRQQA ILLRKAITGE LDEYHALTFS K