Gene Hlac_1535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1535 
Symbol 
ID7401465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1555292 
End bp1557268 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content69% 
IMG OID643708601 
Producttype III restriction protein res subunit 
Protein accessionYP_002566193 
Protein GI222479956 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACG ACTCCGGCGA TCCCGCCGCC GGCGATGCTT CCGATTCTGA CACCGGCGAC 
GCCCCCGATT CCGCTGCTGG CAACGACTCC AGAGCGGACC CCGATTCTCA GGATAACGAC
GCCGCCGAAG AGCTCTCGCT CGACCGCTTC CACGAGGCGT TAGAGGCCGA GGAACGACCC
GTCGCGACCG CGAGCGAGGT CGCGAGACGG CTCGGTACCA CGCAGGCGGC CGCGCGCGAC
GCCCTCGCTG CGCTCGTCGA CCGCGGCGAC GTGGACCGGC TTGACGTCGA GAGCGATCCC
ATCGTCTTCT ACCCGACCGA CTGGGGCCGG CTGGCGACCC GCGAGCGCGT CGTCGCGTTT
CCGAGCCGCC GAGAGATCGT GGTCGACCGC CCGACGCAGT TCACCCGGGC GCGGCTCTCG
CAGTTCGCGC ACCTCGTCGA CACCACCGGC ACCGAGCCCG GCACGCGCGG GTACCTTTAT
CGGATCCGCC AGGAGGACGT GTGGGCCGCG CCGTTCGAGG ACGCCGACGG CCTGATCGCG
AGCCTCCGCT CGGTGCTTCC TCGCCGGTTC GACCACCTCG AAGACTGGAT CCGCGACCAG
TGGCGCCGGG CGCACCGCTT CCGGTTATAT ACCCACGAGG ACGGCTACGT CGTCCTGCAG
GCCGCCTCCG AGAGCCTGAT GGGCAACGTC GCGGACCAGC ACCTCGACGA CGATCACCTC
CGGGCACCCA TCTCCGAGAC GGAGGCGTGG GTCAACGAGG ACGCCGTCGC CGCGGTGAAA
CGCGCCCTCT ACGACGCCGG CTACCCGGTC GAGGACGACC GCGACCTCGA CGTGGGCGAC
CCGGTCGATA TCGACCTGAC AACCGATCTC CGGTCGTATC AGGAGACGTG GGTCGAGACC
TTCCTCGACG CGCGCTCCGG AGTGTACGTC GGCCCGCCGG GGTCCGGCAA GACCGTCGCC
GCCATCGCGA CCATCGCGGC GATCGGCGGC GAGACGCTGA TTCTCGTCCC CTCCCGCGAA
CTCGCCGGCC AGTGGCGCGA GGAACTGCTC GAACACTCCA CGGTCGACCC GGCCGACATC
GGACTGTACC ACGGCGGCCA AAAGGAGATC CGACCGGTCA CGATCGCGAC CTACCAGATC
GCCGGGATGG ACCGCCACAG GGCCTTGTTC GACTCCCGGA AGTGGGGGTT GATCTGCTTC
GACGAGGCTC ATCATATCAC CGCCCCCATA TTTTCACGGT CTGCAGAGCT GCAAGCAAAA
CACCGCCTTG GCCTCTCGGC CACGCCGGTC CGTGAGACCG GCAGCGAAGA GGAGATATAC
ACCCTGATCG GTCGGCCGAT CGGTGCCGAC TGGGACGAGC TGTTCGAGGC CGGCTTCGTT
CAGGAGCCGG AGGTCGAGAT TCGGTACGTC CCGTGGCGCG ACGAGATGGC CCGCAACGAG
TACGCCAGCG CCGACGGGCG GGAGCGACGA CGCCTCGCCG CAGAGAACCC CGCGAAGATC
GAGGAGATCC GGTACCTGCT CGCCGCTCAC CGCGACAAGA AGGCGCTCGT GTTCATCGAA
TACCTCGATC AGGGCGAGGC GATCGCCGAT GCGCTCGGCG TCCCGTTTAT AAGCGGCGAG
ACGCCCCACC ACGAGCGGGC GGAGCTGTTC CGGCGATTCC GCGAGGAGGG CACGGAGGGC
GGAGAACGCG AGGGAATCGG TGCAGACGGA GACGACGTCG ACACCCTCGT CGTCTCCCGT
GTCGGTGACG AGGGAATCGA TCTCCCGAAC GCCGAACTCG CGATCGTCGC CAGCGGGCTC
GGCGGCTCGC GCCGGCAGGG CTCTCAACGG GCGGGCCGCA CCATGCGACC GACCGGCTCC
GCGCTCGTGT ACGTCCTCGC GACCCGCGGA TCGAGCGAGG AGGAGTTCGC CCAGCGACAG
ATGCGCCACC TCGCGCGCAA GGGGATCCGG GTTCGGGAGA CGAACGTCGC GGAGTGA
 
Protein sequence
MTDDSGDPAA GDASDSDTGD APDSAAGNDS RADPDSQDND AAEELSLDRF HEALEAEERP 
VATASEVARR LGTTQAAARD ALAALVDRGD VDRLDVESDP IVFYPTDWGR LATRERVVAF
PSRREIVVDR PTQFTRARLS QFAHLVDTTG TEPGTRGYLY RIRQEDVWAA PFEDADGLIA
SLRSVLPRRF DHLEDWIRDQ WRRAHRFRLY THEDGYVVLQ AASESLMGNV ADQHLDDDHL
RAPISETEAW VNEDAVAAVK RALYDAGYPV EDDRDLDVGD PVDIDLTTDL RSYQETWVET
FLDARSGVYV GPPGSGKTVA AIATIAAIGG ETLILVPSRE LAGQWREELL EHSTVDPADI
GLYHGGQKEI RPVTIATYQI AGMDRHRALF DSRKWGLICF DEAHHITAPI FSRSAELQAK
HRLGLSATPV RETGSEEEIY TLIGRPIGAD WDELFEAGFV QEPEVEIRYV PWRDEMARNE
YASADGRERR RLAAENPAKI EEIRYLLAAH RDKKALVFIE YLDQGEAIAD ALGVPFISGE
TPHHERAELF RRFREEGTEG GEREGIGADG DDVDTLVVSR VGDEGIDLPN AELAIVASGL
GGSRRQGSQR AGRTMRPTGS ALVYVLATRG SSEEEFAQRQ MRHLARKGIR VRETNVAE