Gene Hlac_2898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2898 
Symbol 
ID7399132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012028 
Strand
Start bp155068 
End bp156432 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content62% 
IMG OID643706716 
Productrestriction endonuclease 
Protein accessionYP_002564342 
Protein GI222475821 
COG category[V] Defense mechanisms 
COG ID[COG1715] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGTAC TGGACGATCT CTCAGGGTTC GAGTTCGAGG ACGTGATGGA GGACGTCTTC 
CGCAACCTCG GCTTCGAGAA CGTCCGCCAG GCAGAGCGGA CGGCCGACGA GGGCCGCGAC
GTCATTATGG AGGAGGTCGT CGACGGCACC CGGCGCGCGA TCATCGTGGA GTGCAAGCAC
ACAGGGACGG TCGGGCGCCC GGTCGTCCAG AAGCTCCACT CGGCGATCGC GACGTTCGAC
TTCGACGGTC CGAAACGCGG GATGGTCGTC ACGACCGGCC GGTTCACGAA CCCTGCTCAA
GAGTACGCTA ATCGACTCCA GCAGAACGAC GACCCCCATC CAATCGAGTT GCTTGATGGC
AAGGACCTCC GGGAGATCGC TGACGAGATC GGTCTCGACC TCTACAACGG GCGCATCGAG
ATTCTCTGCG ACGATACCCT ACGCCCGTAC GACCCGGCCG CCGACGTCGA CGCGGCCGTC
GCGGAGGCGT TCCGCGACAT CGAGAACACC GAGAGCGCCG ATCTCCCGGT GGCGCACTCG
CAGGTGACGT TCCGTCCGGT GGTTGCGGTC ACCGCGGACA CGAACGCCGT CTTCGAGACG
TCGGTGGGCG TCATCCACCG GATCAACGAC CGCACACAGT TCGTTGTCCA CGCTGAACGC
GGCCATCCGA AGGTGGCCGA CGACGATGTC GCGACGCTGG TCACCGAGAA CTTCCACGCG
ACGGTTCCCC TCGATACTGG GCAGTTTACC GAGGTATTCG ACGACGTCGA GGAACGACGG
TTCGGCCAGA CCCAAACGGA GTACAAGGAG TGGGCTGTTG ACCGTCTTCA GGACTACCAC
ACGACGACGG TGACCTACAC TGGCGACAAC AACGTCACCT ACAACAAGAC GTGTGAGCCG
AATCTCTCGG ACATCTCTGT GCAATCGATC GAGCCGGTGT ATCTCCCCGA CATTCGGCAC
ACGACCGACC TCCAGGAGTA CACCTACCCC TACGAGTACT ACGCGGCAGG TCCGTCACGA
GTGACCGCCG AAGACGGCAT TCACCGCTGC GTCCATTGTG ACACGAGCGG CATCGATGAG
ACGTACACCT ACTGTTCGAA CTGCGGGGCC ATCGCCTGCT CCAGTCACAC CAAAACGGAG
CGGCTGGAAG GTGAGCCGGT CTGTACGGGC TGTGCAGTCA CCGACCGGTT CGCGCTGAAG
ACGAAGTACT TCTACGACGA GGAGAATCTC GAGGCGTTCA GCGAGGAGTA CGCCGATATG
CCGCTTCACG AAAAGGCGAT GGAGAACAAG TGGCTGGCCG GGGGAAGCGT TGTTGCGACG
GTGCTGCTTG TCGTCGGACT ACTCGTCATC GGCGGCATCA TCTGA
 
Protein sequence
MAVLDDLSGF EFEDVMEDVF RNLGFENVRQ AERTADEGRD VIMEEVVDGT RRAIIVECKH 
TGTVGRPVVQ KLHSAIATFD FDGPKRGMVV TTGRFTNPAQ EYANRLQQND DPHPIELLDG
KDLREIADEI GLDLYNGRIE ILCDDTLRPY DPAADVDAAV AEAFRDIENT ESADLPVAHS
QVTFRPVVAV TADTNAVFET SVGVIHRIND RTQFVVHAER GHPKVADDDV ATLVTENFHA
TVPLDTGQFT EVFDDVEERR FGQTQTEYKE WAVDRLQDYH TTTVTYTGDN NVTYNKTCEP
NLSDISVQSI EPVYLPDIRH TTDLQEYTYP YEYYAAGPSR VTAEDGIHRC VHCDTSGIDE
TYTYCSNCGA IACSSHTKTE RLEGEPVCTG CAVTDRFALK TKYFYDEENL EAFSEEYADM
PLHEKAMENK WLAGGSVVAT VLLVVGLLVI GGII