Gene Hlac_3258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3258 
Symbol 
ID7398855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012028 
Strand
Start bp518175 
End bp519539 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content56% 
IMG OID643707048 
Productrestriction endonuclease 
Protein accessionYP_002564670 
Protein GI222476149 
COG category[V] Defense mechanisms 
COG ID[COG1715] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGTCC TGGACGATCT CTCCGGCTTC GAATTCGAGG ATGTGATCGA GGACGTCTTC 
CGGAATCTCG GATACGAAAA CGTGCGGCAA GCGACGAAAA CGGCTGATGA GGGACGGGAT
ATCTTGATGG AGGAAGTCGT CGACGGAATC CGCCGGGCTG TCGTCGTCGA GTGCAAGCAC
ACCGATTCAG TTGGTCGACC GGTCGTCCAG AAACTCCATT CCGCGATTGC GACGTTCGAG
TTCGACGGTC CGAAACGCGG TATGGTCGTG ACGACTGGTC GGTTCACCGG GCCAGCACAG
GAGTACGCCG ACAGGCTGAA ACAGAATGAC GATCCGTATC AGATCGAACT CACCGATGGC
CAGGACCTCC GGCGGATCGC CGACGAGATC GGCCTCGATC TCTATAATGG TCGCATCGAG
ATTCTCTGTG ACGAGACGCT CCGGCCATAC GATCCGGCAG CGTCCATCGA AGTACCAATC
AAAGAGGCCT TCCGTGATAT CGAAAACATC GACAGCACAG AAATTCCGGA GCCGCACACA
CAGGTCACGT TCAATCCAAT TGTCTCGGTC ACTGCTGACA CGAATGCGGT ATTCGAGACT
TCTGTAGGGG TTATCCACCG AATTAATGAG CGAACCAACT TTGTTGTCCA CGCCGGTCGC
GGTCATCCGT CGGTAGCCGA TGATGCCGTT GCAACCTTGG TCGCGGAGAA TCGTCACACG
ACGGTCGACC TCGATATTGA GCGGTTTCAG GAGGTCTTCG ACGAGGTCAG TGATCACCGG
TTCGGCCAGA CCGAAACCGA ATACAAAGAG TGGGCAGTCG ACCGGCTCCA GGAGTTCCAC
ACGACGACGG TGACTTACAC CGGTGGCAAC AACGTGACCT ACAACAAGAC CTGTGAGCCG
AACCTTTCGG ATATCTCAGT GCAGTCTATT GACCCAGTGT TCCTCCCGGA GGTTCGACAG
ACGACGCATC TGAAGGATTA CAGCCATCCC TACGAGTACT ACGCCGCAGG CCCCTCTCGG
GTTACGATTG AGGACGGTAT CCATCGCTGC GTCCACTGTG AGACACACGG CGTCGACGAA
ACCTACACCT ATTGCGCGAA CTGTGGCGCT GTTGCGTGTG GGAGTCACAT CGAAACTGAA
CGGCTGACCG GTGAGCCTGT CTGTACGGGT TGTGCGGTCA CCGACCGGTT CGCGCTCAAG
ACGAAGTACT TCTACGACGA GGAGAATCTC GAATCCTTCC GTACTGAATA CGCTGAAATG
CCGTTTTACA AGCAGGCTAT GGAGAACACG CTGCTGGCCG GAGGAAGTGT GATCATGACG
CTCTTGATTG TCGTTGGTCT GCTCATCATC GGCGGCATCA TTTAA
 
Protein sequence
MAVLDDLSGF EFEDVIEDVF RNLGYENVRQ ATKTADEGRD ILMEEVVDGI RRAVVVECKH 
TDSVGRPVVQ KLHSAIATFE FDGPKRGMVV TTGRFTGPAQ EYADRLKQND DPYQIELTDG
QDLRRIADEI GLDLYNGRIE ILCDETLRPY DPAASIEVPI KEAFRDIENI DSTEIPEPHT
QVTFNPIVSV TADTNAVFET SVGVIHRINE RTNFVVHAGR GHPSVADDAV ATLVAENRHT
TVDLDIERFQ EVFDEVSDHR FGQTETEYKE WAVDRLQEFH TTTVTYTGGN NVTYNKTCEP
NLSDISVQSI DPVFLPEVRQ TTHLKDYSHP YEYYAAGPSR VTIEDGIHRC VHCETHGVDE
TYTYCANCGA VACGSHIETE RLTGEPVCTG CAVTDRFALK TKYFYDEENL ESFRTEYAEM
PFYKQAMENT LLAGGSVIMT LLIVVGLLII GGII