Gene Hlac_3576 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3576 
Symbol 
ID7402491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp329495 
End bp331648 
Gene Length2154 bp 
Protein Length717 aa 
Translation table11 
GC content50% 
IMG OID643710114 
ProductCRISPR-associated helicase, Cyano-type 
Protein accessionYP_002567680 
Protein GI222481444 
COG category[R] General function prediction only 
COG ID[COG1205] Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster 
TIGRFAM ID[TIGR03158] CRISPR-associated helicase, Cyano-type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAGC ACGATCCAGT TTCGTTTCAG CTTGCAGGGC TTGGTCTGCG AATGTACCCC 
GGAGAGTACC CTGTGGATGA GGTTATCCCT CATAAACATC AGTGGGCACT GCATGATGCA
CTCACAGACC GTCTTCCAGG ACTGTTTGTA GACGATGCGC CGACAGGAGC CGGTAAAACG
CTGGCATGGC TCGCTCCCGT CGTCTCCGAG GGACTCCAGA CAGTAGCAGT ATACCCGACG
AACGCGCTTA TTGAAGATCA GGTCCGGAAT ATCGACGAGA AACTCGAAGA TGTTGAGGGA
GGAAATAATG TCCGTCTCCT TGCTGTGACT AGTGAAACGC TCCAGAACGA ACACGCCGAG
CAGTTCCCAT CGGCTAACAC AAACGGTGAA CGGCTTCGAG ACCTCCTCAA AGTAGCATTC
GACAGCCGTG AGACGGTTAT CCTTCTCACG AACCCAGACA TATTTGTCCT GCTTCGTCGA
GAAATCTATC GGGAGAGAAT CGATGCGATC TCTCGGTTTG AGGTGGCTGT TGTTGATGAG
TTCCATCGAG CAACACGAAA AGAGCGAAAC ACGATGCTCT TTCTGCTCGA TGAGATGTAC
GAGACTAATG AAGCGATCTG TCGACTTAAT CATCTCGTAT TTCTCAGTGC AACTCCTGAG
AAAGAACTCG AACGCCAGTT TGAGGAGGCA ATGGGTGTGC CGTATTATCG AGTCAGTGAA
GTAAACTGGC GAGAGACGCC ACTACCCGAA GTTCCTGATA CTGAGGAGTC TGTGATTGCA
TTCGCTCCTA ATGAACTCCC ATCGAACTAT CGGGCAGTCC TGCCACCAGT TGATCTGACA
ATCACACCAG CACCGACCTT TGAGGCTGCA ACAGCGATCC TTGACTCGGA AGACGTGTTG
CATGATCGAT TGCGGGACGG CAGGACAGTG ATGATGCTTG ATGGGGTTCA CGAGGTTGAT
CAAGTATATA ATTCGTTGGC TCAATCAGAC CTCAATGGGG CCGTTCGAAT CGATGGGTTT
CACCGGGAAA ACGTGCGTGA GAAACTCGAT ACGTTCGAGA CGTTAGTGAG TAATTCTGCG
GTTGAGGTCG GTGTCGACTT CGATACAGAG CAAATCATCT TCTCAGGTCA CGATGCGGCG
AGTTTCTTCC AACGCCTTGG TCGACTCAGA ACTCGCCCGA ACCGGTCTGC AGCGTATGCA
TATGTCCCAC CGTATCTCCT TGACGCGCTC AACTCGATGG CCAACAAATA TACTGGCCAA
TGGATTGACA GAGCAACATT TGAAGAGTAT GTTAGTTCGG GATACATTGA CGCGTCGACA
CCCGGATCAT TCGACTGGCG ATACTCGGCT GTCGAAGCCT ACGATCACGT CGAAAAACGA
GCAAAGAGTG CACCATCAGA TGATCAGCCA GCAATACGGG AGACAGGCTG GAAACGCATT
GAACGTCACT TCTTCCACAA CCACGAGAAG GAACTTACTG AGGCTGATCT GAAACGATTA
TATGGTGTTG CAGGTACGCC ACTTTTGGAT TCGCTCCAAA CTTATCGCGG TGACAGCATA
CAGACAGTAG TTTATAACAA CCAGTCACAG ACGCTTCAGA CATACAGTAT CCCACATCTG
CTCCGTCATG GGGATGTCTC GTTCCATTCG AAAGACGAGT TTCTTTCTAT CCTTCCAGAG
CAGTTGCATA ACCAGGTATC ACGGCTGGAA CCATACAGTA GTGGCTACTG TCTCTATCAG
GGAGGGTATG AAGACGAAAC AGATACTACG GACGAAGAGC AGTTGACTGG ACGGTTGGTT
ACCTACAAAG CAACTGGTGA ACTGTATTCG TTATTAAGTG ATGTCTCACG GGATATTCGA
ACACCGAAAG TCTGCACTGG ACTCGAAATC GAGACAGAGC CGAGCGTCAG CGGTCTCGAT
TTACTAAAGG GGGGCCTCTC GGATACTGAA GTTCTTTGTT ATCCGTTGGA GGGCCACGTC
TCCCAGATAC AAAACCAGTA TTCGCTCGGT CCGTTTGGAT TTATTTATCC ACTGTATTAC
ACGGAAGGAG ATGCAGCAGT AGCATTCAGC CACGATGCCC TGTATTTACA CTGTCGCGTC
CAAGATCGGA TCGAGGCCGA ATCAAATACG ATTGATGAGG TGCTTGATTT TTGA
 
Protein sequence
MSEHDPVSFQ LAGLGLRMYP GEYPVDEVIP HKHQWALHDA LTDRLPGLFV DDAPTGAGKT 
LAWLAPVVSE GLQTVAVYPT NALIEDQVRN IDEKLEDVEG GNNVRLLAVT SETLQNEHAE
QFPSANTNGE RLRDLLKVAF DSRETVILLT NPDIFVLLRR EIYRERIDAI SRFEVAVVDE
FHRATRKERN TMLFLLDEMY ETNEAICRLN HLVFLSATPE KELERQFEEA MGVPYYRVSE
VNWRETPLPE VPDTEESVIA FAPNELPSNY RAVLPPVDLT ITPAPTFEAA TAILDSEDVL
HDRLRDGRTV MMLDGVHEVD QVYNSLAQSD LNGAVRIDGF HRENVREKLD TFETLVSNSA
VEVGVDFDTE QIIFSGHDAA SFFQRLGRLR TRPNRSAAYA YVPPYLLDAL NSMANKYTGQ
WIDRATFEEY VSSGYIDAST PGSFDWRYSA VEAYDHVEKR AKSAPSDDQP AIRETGWKRI
ERHFFHNHEK ELTEADLKRL YGVAGTPLLD SLQTYRGDSI QTVVYNNQSQ TLQTYSIPHL
LRHGDVSFHS KDEFLSILPE QLHNQVSRLE PYSSGYCLYQ GGYEDETDTT DEEQLTGRLV
TYKATGELYS LLSDVSRDIR TPKVCTGLEI ETEPSVSGLD LLKGGLSDTE VLCYPLEGHV
SQIQNQYSLG PFGFIYPLYY TEGDAAVAFS HDALYLHCRV QDRIEAESNT IDEVLDF