Gene Hlac_3332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3332 
Symbol 
ID7402188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp82881 
End bp85034 
Gene Length2154 bp 
Protein Length717 aa 
Translation table11 
GC content55% 
IMG OID643709884 
ProductCRISPR-associated protein, Csh1 family 
Protein accessionYP_002567450 
Protein GI222481214 
COG category 
COG ID 
TIGRFAM ID[TIGR02591] CRISPR-associated protein, Csh1 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGCC CGACACGGGA AGACTTCGCA GCAGCCATCG ATGACTACTG GCATGGTCGA 
CCACCGACCA GCCTCGAAGA CGTGATGTCA CTGTACGGTG TACTTGCAGT TGCCGAATCC
GGAGGCTCGC TGTACGGTAC GCCAAGTACG CTTGACCCCT TTATCGACGA CGGGCGTTTG
GTCGTTATTG ATCTCGATTT GACTGGCGAC GAGCCAACAT ACGAACAAGG AGGCGCAGGT
ATCAGCGTCG ATACCCTCCG CACCGATGAT ATAGCAAAGC TCGCGTACTC ACACAAGTCC
TCCGGCCGGG GAGCGAAGTA CAGTCTCACG CAGATCGGAT CGAAAAACGG AAACGACGCC
TCGGGTGTTG CTGGGACGAT CCTGAGGCGT GTTCGTTCAT GGACTAATCA AGATAGCGTC
CAGAGTATTA CTGGTGACAA CGGACACCCT GATGCGTGGG TTATCGAAAC GTTAGACAAG
GTATTCACGA AAGACGGCGA GACACTCGAA CGAATTCAAT CAGATATAGA GTCACTACTA
CCCACAGACG ATTCAGTCCC GACTGTACTG ACTGTCCGCT GTCGGATCAA CGACACAGAT
CTCAGCACTA CTGACAATGA TGACACAGAC TGGTTTTGGC CCGCCGATCT CGATGTACTC
AACCGAGCAA TGCGACGCTA CGCGACGGCC AACGCCACTG ACAAGAATAT TAGTTCTGGC
CCTGCTTCCG AAGGCGAGGC AGTCGGTGTT GTGACCGATC AGGTGGGTCG TGTAGTCGGG
ACACCTGAAA GCCCGCTGGA AGTTTTTTCA GTCAAACATC CGGATGCTCA GCCCGGCCTC
CGACGTGACC AATCGTGGCG AAACTATCCG GTGAGTGCCG ACGTTGCGAT GCTGTTCAGT
AAAGGTCAGG ACCTTATCGA GCGCTGTGTG CTTCGCCGAG GCGGGGTTGA GACGTACACA
CTCCCGTACT TCGCTGGCGA ACTCACCGAA TTAAAAGCTG AAACACTATA TCGGGCGATC
CATTCACTCG ATTCAGAAAG CGAGTACACG GACAGTTCAC ATGCCCCGAT GGCGAGAGTT
ACCTACGAGA TCCAAGAAAA CGACGATCCA GCAGTTCGGG CACTGAAAGA CGAAATCCGA
TTCTATGTCG TCTCATTACC AATCAGTGAC GACAAAAACA TCATCGCCGA AGAACCCTCG
GCGGGTATCT ATTGGGTCTC CGAACTCGCA AAGTCGCTTG TCGACACGGT CCAAGGGCCG
ACACTCTCGC CCGAATCGGG AGGATTCAGA ACCTACGATA ACTGGGACCT CCTTGATTTT
CCCGACGAGA CGGGTCAGGC TCGAAACGTC GCGTTCGGCA AAATCGTCGG CCACACATTC
ACCGACGCCG TGTTTGCCTA CCGTGACGAG GAGGGGGACG ACTTCCGACG AATCGTCGAC
CACCGACTCA TCGCCGGAGT ACCACTTGAC GCCTCGATGC TCTTTTCAGA GTACATGCAG
CGCTGGGGTG ACGAGTTCGA TGGCGACGAC CCGGTTCCAC AACAGGTCAT CGCCCAACAG
CTCGTGCATC TCGAATCGCT GTCGCGTGCC GGACTCCTAA CAGGACTCGA CGCGGCGGTC
GAACCAACGG ACACAGAGAC TATGACAGCA ATCGAAGACA CAGACACTGA CCTTTCGAAC
CTCGCGGCGA TCCGTAAGTA CCGACTCGAA TCGTTCCTCG ACCGACCCAT GTTCACCGAG
AACGCCGAGC GAAAGGCAAC CGCCCTCGCT GGCGTTCTCG TCGGACAAAT AAGTTGGCAT
CAAGCAGACG AGCGCGATCT CGGTCGACCG CTCGACTCGA AAACGCGTGG CGATCAGCTC
ACGAAAAACG GGCTCGAACA AGCGGTGAAA ACAGCCTTAG AAAAGGCAAA AGTGTACGCA
CACGACTCAA AACAGTACGC TGACCGAGAT ATTCTGTTCC CTGAAACAGT CGACCTGCTT
CTTGAAACGA CGGAGCAGAT GCCGACCAAA TGGGAGATCG ACAAACGTGA TCTCCAGTTT
GCCTACGTGC TCGGACACGC CCACGGTCGA CGGTCGATGC CGAACGCGTT CGATCTGTAC
AAAAAAGACG ACGCAAGCGA GGAGAGCGAA GCAGCCGAAT CGCCCGCAAA CTGA
 
Protein sequence
MSGPTREDFA AAIDDYWHGR PPTSLEDVMS LYGVLAVAES GGSLYGTPST LDPFIDDGRL 
VVIDLDLTGD EPTYEQGGAG ISVDTLRTDD IAKLAYSHKS SGRGAKYSLT QIGSKNGNDA
SGVAGTILRR VRSWTNQDSV QSITGDNGHP DAWVIETLDK VFTKDGETLE RIQSDIESLL
PTDDSVPTVL TVRCRINDTD LSTTDNDDTD WFWPADLDVL NRAMRRYATA NATDKNISSG
PASEGEAVGV VTDQVGRVVG TPESPLEVFS VKHPDAQPGL RRDQSWRNYP VSADVAMLFS
KGQDLIERCV LRRGGVETYT LPYFAGELTE LKAETLYRAI HSLDSESEYT DSSHAPMARV
TYEIQENDDP AVRALKDEIR FYVVSLPISD DKNIIAEEPS AGIYWVSELA KSLVDTVQGP
TLSPESGGFR TYDNWDLLDF PDETGQARNV AFGKIVGHTF TDAVFAYRDE EGDDFRRIVD
HRLIAGVPLD ASMLFSEYMQ RWGDEFDGDD PVPQQVIAQQ LVHLESLSRA GLLTGLDAAV
EPTDTETMTA IEDTDTDLSN LAAIRKYRLE SFLDRPMFTE NAERKATALA GVLVGQISWH
QADERDLGRP LDSKTRGDQL TKNGLEQAVK TALEKAKVYA HDSKQYADRD ILFPETVDLL
LETTEQMPTK WEIDKRDLQF AYVLGHAHGR RSMPNAFDLY KKDDASEESE AAESPAN