Gene Hlac_3007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3007 
Symbol 
ID7398984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012028 
Strand
Start bp264921 
End bp266285 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content63% 
IMG OID643706817 
Productrestriction endonuclease 
Protein accessionYP_002564439 
Protein GI222475918 
COG category[V] Defense mechanisms 
COG ID[COG1715] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGTAC TGGACGATCT CTCGGGGTTC GAGTTCGAGG ATGTGATCGA GGACGTGTTC 
CGTAACCTCG GCTACGAGAA CGTCCGCCAG GCCGACCGCA CGGCTGACGA GGGTCGCGAT
GTCCTTATGG AGGAGGTCGT CGACGGAACG CGGCGTGCGA TCATCGTCGA GTGTAAGCAC
ACGGGGACGG TCGGACGCCC CGTCGTCCAG AAGCTCCACT CCGCCATAGC GACCTTCGAC
TTCGACGGCC CCAAACGCGG GATGGTCGTC ACGACCGGCC GGTTTACGAA CCCTGCTCAG
GAGTACGCAA ACCGCCTCCA GCAAAACGAC GACCCACACG CAATCGAACT GCTCGATGGC
GAGGACCTCC GGGAGATCGC CGACGAGATC GGCCTCGACC TCTACAACGG CCGCATCGAG
ATTCTCTGCG ACGAGACGCT ACGTCCCTAC GATCCGGCCG CCGACGTCGA CGCGGCCGTC
GAGGTGGCAT TTCGCGACAT CGAGAACATC GAGAGCGCCG ACCTCCCGGA ACCACATTCG
GCGGTGACGT TCCGCCCAGT GGTCGCGGTC ACCGCGGACA CGAACGCCGT CTTCGAGACG
TCGGTGGGTG TCATCCACCG GATCAACGAC CGGACGCGGT TCGTCGTCCA CGCCGAACGC
GGGCAGCCGC AGGTCGTCGA CGAAGACGTC GGGACGCTGG TCACCGAGAA CCTCCATGCG
ACGGTCGATC TCGACGCCGA GCAGTTCGGA GCAGTGTTCG ACGACGTCGA GGAGAACCGG
TTCGGCCAGA CGCAGACCGA GTACAAGGAG TGGGCCGTCG AGCGGCTCCA GCAGCACCAC
ACGACGACGG TGACCTACAC CGGCGACAAC AACGTCACAT ACAACAAGAC CTGCGAGCCG
AACCGCTCGG ACATCTCCGT CCAGACGATC GAGCCGGTGT ATCTCCCCGA GGTTCGGCAC
ACCACTGACC TTCAGGAGTA CACCTATCCT TACGAGTACT ACGCAGCGGG TCCGTCCAGA
GTGACCGCCG AGGACGGGAT CCATCGGTGC GTCCACTGTG ACACGAGCGG CGTCGACGAG
CCGTACACGT ATTGTCCGAA CTGCGGGGCA ATCTCCTGCG ACAGTCATAG CAAGACCGAA
CGGCTTGAGC AGGAGCCGGT GTGTACGGGG TGTGCGGTCA CCGAGCGATT TGCGTTGAAG
ACGAAGTACT TCTACGACGA ACAGAACCTC AAGGCGTTCC GCGAGGAGTA CGCGGCGATG
CCCCTCCACG AGAAAGCGAT GGAGAACAGG CTACTGGCCG GAGGGAGTGT GGTCGTGGCG
CTTCTGGCGG TTATTGTCCT GCTCGCGGGT GGCGGCATCA TCTAA
 
Protein sequence
MAVLDDLSGF EFEDVIEDVF RNLGYENVRQ ADRTADEGRD VLMEEVVDGT RRAIIVECKH 
TGTVGRPVVQ KLHSAIATFD FDGPKRGMVV TTGRFTNPAQ EYANRLQQND DPHAIELLDG
EDLREIADEI GLDLYNGRIE ILCDETLRPY DPAADVDAAV EVAFRDIENI ESADLPEPHS
AVTFRPVVAV TADTNAVFET SVGVIHRIND RTRFVVHAER GQPQVVDEDV GTLVTENLHA
TVDLDAEQFG AVFDDVEENR FGQTQTEYKE WAVERLQQHH TTTVTYTGDN NVTYNKTCEP
NRSDISVQTI EPVYLPEVRH TTDLQEYTYP YEYYAAGPSR VTAEDGIHRC VHCDTSGVDE
PYTYCPNCGA ISCDSHSKTE RLEQEPVCTG CAVTERFALK TKYFYDEQNL KAFREEYAAM
PLHEKAMENR LLAGGSVVVA LLAVIVLLAG GGII