Gene Hlac_3127 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3127 
Symbol 
ID7399258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012028 
Strand
Start bp371056 
End bp372420 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content63% 
IMG OID643706929 
Productrestriction endonuclease 
Protein accessionYP_002564551 
Protein GI222476030 
COG category[V] Defense mechanisms 
COG ID[COG1715] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGTAC TGGACGATCT CTCAGGGTTC GAGTTCGAGG ACGTGATAGA GGACGTCTTC 
CGCAATCTCG GCTACGAGAA CGTCCGCCAG GCCGACCGCA CGGCTGACGA GGGTCGCGAC
GTCATCATGG AGGAGGTCGT CGACGGCACG CGGCGTGCGA TCATCGTCGA GTGCAAGCAC
ACGGGGACGG TCGGGCGCCC AGTCGTCCAG AAGCTCCACT CGGCGACCGC GACGTTCGAC
TTCGACGGCC CCAAACGCGG AATGGTCGTC ACGACCGGCC GGTTTACGAA CCCTGCTCGG
GAGTACGCCG ACCGCCTCCA ACAAAACGAT GATCCCTATC CTATCGAGCT GCTCGACGGC
GAGGACCTCC GGGAGATCGC CGACGAGATC GGCCTCGACC TCTACAACGG TCGCATCGAG
ATTCTCTGCG ACGAGACACT CCGCCCCTAT GACCCGGCCG CCGACGTCGA CGCACCCGTC
GAAGAGGCGT TCCACGACAT CGTCAACATC GAGGCCGCTG AACTCCCAGA ACCACATTCG
GCGGTGACGT TCCGCCCGGT GGTCGCGGTC ACCGCCGACA CGAACGCTGT CTTCGAGACG
TCGGTGGGCG TCATCCACCG GATCAACGAC CGGACTCGAT TCGTTGCCCA CGCCGAACGC
GGGCAGCCGC AGGTCGTCAA CGAGGACGTC ACGACACTGG TCACCGAGAA CCTCCACGCG
ACGGTCGAAC TCGATACCGA GCAGTTCGCG GAGGTGTTCG ACGACGTCGA GGAGAACCGG
TTCGGCCAGA CGCAGACCGA GTACAAAGAG TGGGCCGTCG AGCGGCTCCA GCAGCACCAC
ACGACGACGG TCACCTACAC CGGCGACAAC AACGTCACCT ACAACAAGAC CTGCGAGCCG
AACCGCTCGG ATATCTCGGT CCAGTCGATC GAACCGGTGT ACCTTCCTGA GGTTCGACAG
ACGACGGAAC TCCAGGAGTA CACCTACCCC TACGAGTACT ACGCGGCAGG GCCGTCCAGA
GTAACAGACG AGGACGGCAT CCATCGGTGC GTCCACTGTG ACACGAGCGG CGTCGATGAG
ACGTACACCT ACTGTCCGAA CTGCGGGGCC ATCGCCTGCA ACAGCCACAC CAAAACGGAA
CGGTTGGAAG GCGAGCCGGT TTGTACTGGT TGTGCGGTCA CCGAACGGTT CGCGCTGAAG
ATGAAGTACT TCTACGACGA GGACAACCTC GAGGCGTTCC GCGAGGAATA CGCCGCAATG
CCACTCCACG AGAAGGCGAT GGAGAACAAG TTACTCGCTG GAGGGAGCGT GGTCGCGACG
CTGCTGCTCG TCGTCGGCCT GCTCGTCATC GGCGGCATCA TTTAG
 
Protein sequence
MAVLDDLSGF EFEDVIEDVF RNLGYENVRQ ADRTADEGRD VIMEEVVDGT RRAIIVECKH 
TGTVGRPVVQ KLHSATATFD FDGPKRGMVV TTGRFTNPAR EYADRLQQND DPYPIELLDG
EDLREIADEI GLDLYNGRIE ILCDETLRPY DPAADVDAPV EEAFHDIVNI EAAELPEPHS
AVTFRPVVAV TADTNAVFET SVGVIHRIND RTRFVAHAER GQPQVVNEDV TTLVTENLHA
TVELDTEQFA EVFDDVEENR FGQTQTEYKE WAVERLQQHH TTTVTYTGDN NVTYNKTCEP
NRSDISVQSI EPVYLPEVRQ TTELQEYTYP YEYYAAGPSR VTDEDGIHRC VHCDTSGVDE
TYTYCPNCGA IACNSHTKTE RLEGEPVCTG CAVTERFALK MKYFYDEDNL EAFREEYAAM
PLHEKAMENK LLAGGSVVAT LLLVVGLLVI GGII