Gene Hlac_3219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3219 
Symbol 
ID7399345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012028 
Strand
Start bp461000 
End bp464896 
Gene Length3897 bp 
Protein Length1298 aa 
Translation table11 
GC content52% 
IMG OID643707016 
Producthypothetical protein 
Protein accessionYP_002564638 
Protein GI222476117 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCCTC AATCAGCCAC CTCAGCCACC GAATTTGTCG AGCAGGCATT CGAGACGACC 
GTCGACGATC ACGCACGGGC GGATCTTCTA CAGATCATCG ACGACGCCGT GTCGACGCTC
CGTAAGAAGA TGAAGAAGGA CGAGACGGTA GAGCGGATGC TACGGAGCGG CGGTGGCGGC
TCTTACGTCT TGAAAAGTAG CATGACCCGT GACGGGTTAC AACCGGAACC GTTCACTCAA
AGTGCCGTCA TCGAGCCGCT GTTGGACAAA CTCGGCTACG ACTACGACAC GGAGGCGGGT
GGCCTCTCAG GTGGCCGGAC GGAGGTGGCC GACTACACGA TCTCGCTACG CGACTACGAC
GATATCGACT CTACCCGTCT ATTGATCGAG GCCGAGCCGA TCAACAAAGA CCTCGATTCT
CGAAAGCACG GTGCGGGACA GGTTCGCAGC TGGCTCAGTC AGCGAGAGTT TGAGTCCGAC
TTTGGCTTCG CCACGGACGG ACTTCGCTGG ATGTTCATTC GGTACGACCC GGATTCCTAC
ACCCACAACG TCATTGAGGA GGTCGATCTA CAGCCAGTGT TTGTCGCTCT ATTTGAGAAC
CAAGTTGGCC CGCGAGAGCC GCTCGAGGAG GCTGTTTTCG ATGCCGATCT TGAACGAGTT
GACACGCTGC TACGGACCTT CGAGTTTGCT AACTTCCGGT CGATTGCAGG TGAGGCCCGG
CAGGTCATCA AGCAGAAACA GGAGGAGATC ACCGACGAGT TTTACGACGA CTATATCCGG
TACGTCTTCG GGATTGTCAA CGAGTCCGAG GAGACGCCCA GATCGCTCAC CCACGACGGC
GTGATCGCAC CGGAGGGAGC AACCGCCGAC GACACGCGGC TGTTTTCGGT CGAGATGATG
AACCGGCTTA TCTTCATCAA ATTCCTTGAG GACAAGGGAA TTGTCCGTCC GGACCTCCTA
CAGAGCATCC TTGACACGTA TGAGGACGGC CTTTACACGG ATTCTCTCTA CCAGCAGTTC
ATTCAGCCGT TGTTCTACGA CGTACTCAAC AAACGCCCGG ACAAACGCTC GCCACAGATA
CAGGATATCG AGCTGTTCGC TGATATTCCG TACCTGAATG GTGGGTTGTT CCGGCCGTCG
ATACAACACG ACGGCAGTGA CGACCGTGAG CAGTTCAAGG AGGCCGATTT TGACGTTCGG
AATAGTATTC TCAGGTCGAT TCTTGAATTA CTTGAAAGCT ACAGCTTCTC CACGGATGGA
TCAGTGACCG ACCTCGATCC GAGCGTGCTG GGTAACGTCT TCGAGAAGAC GATCAACTAC
ATTACAGCAG ACAACGCCGA CCAGAACAAG GAACTCGGAG CCTACTACAC GCCGAAGGAG
ATTACCCGGT TCAGTGCAGA GCGAACGGTT CGACCGGCTC TATTCGACCG ACTCAAACAG
GTGGTAATTG AGGAACGCGG ATGGCCTGAA GCCGAGTTGG AGAACTACGA TACCGTCTAC
GAGTTGATCG AATCGCTCCC GGCGTCGATG GACCTGATTA CCACATTACT CGGTGAGGTC
GACAACTTTC GGGTGGTCGA CCCCGCCTGT GGGAGTGGGC ACTTTCTAAC CTCAGTGTTG
GAGGAGATCG TCGGCGTTCG ACGAGCGTTG TGGGCACATA CCGATTCGTA CCCGCACGAA
CAGGCACTCA AGAAGACGAC TGTCCAGCAC AACATATACG GCGTCGACAT CGTCGGCCCG
GCCGTCGAGA TCGCCAAGCT CCGGTGTTGG CTGTCGGTGA TCGCTGAACT GCAACAAGAG
GATCTAGAGT CGATGGATCA AGAGGAGTTG GCCCTACCTA ACATCGCGTT CAATCTCCGA
CAGGGCAACA GCCTGATCGG GTACACGGGG TTCCCAGAGA CGACCGAAGA CGGCGACGGT
TACACGCTCG ATAGCTTCAA CGAGGATACG GTCAGGACAC GCTACGAGAA CATCATCGAC
GAGATCACAG CCTACGAAGA GGCGATTGAG AGCGAGCAGG CCGAACAACA CCGAAAAGAG
GCCAATCGTC TACTCGAAAA TGCAAGAGAT GAGCTGGTCG ATGATGTGAA AGACGAATTC
GTGGCGGCAG GTGTCGACGA TATCACGCCC GAAAAGGTCG AAACATTTGA TCCATTCCAC
TGGGTGCTTG AGTACGCAGA AGTCTACTCT GATGGTGGTT TCGACGTGAT TGTAGGAAAT
CCGCCGTGGG ATAGATTATC ACCACGTCGA GATGACTACT TCTCACGGTT TGATTCGGCG
TTCAGAACGC TGATGCCGGA TGAAAAACAG GAACGACAAG AAGAATTACT GACTGATCCA
GAGATCGCGG AAGGATGGGA GGAATACAAA CGAGAAACTG AAATCTTTGC TACTTACTTT
AAAAATAGTG ACTCCTATGA ACTCCAGCAA CCGAAGGTAG CAGGAAGAAC GGCAGCAACT
GAGAGCGATC TTTCTGCACT GTTTTTGGAG CGTGTTTTTC AGATAGCAAG AGACGACGGG
TATCTCGCAC AAATTCTTCC AGGTGCGATT TTTAATGGCC TGTCGACCAA GGATCTCCGA
CTACACCTTC TTGATGAAAC GAGCATCGAT TCGCTAGTCA CGTTTGAGAA TAATGGAATA
TTCTCTGATA TTGATAATAG ATACAATTTT GGAGTGGTGA CTTTTGAGAA TCAAGGGGAA
ACAACCGATG TAAGAGGGAT CTTCAAACAG ACTGATGTAG ATATACTTCA GAATTTTGAG
GATCAAGCCC TGTCGCTCTC CCGGCGTGTT CTACGCAATT ATTCCCCAGA AGCCGCGATA
TTCCCATATC TTCAGTCCCA GCAAGAAGTT GACGTTCTCG ACACAATCCT GCAACACCCA
CCAATTTCAG AGGAAATTGG GTCATCGTGG TACGTAGAAC CTTACAGAGA GTTGGATCGA
GGTAACGATG TCGACCGGTT TGTAGAGGAT GAGGAGGAAG GCGATTATCC TGTTCTCGGA
GGAAGTAATA TATTCCAATT TGCATATAGT GACGCCTATT TCGGTGTTGA GTCACCGAAA
TTCTGGAGTG TAGATGAAGA CAAAGATCCT GAACTAAGCG CGAAGAAACG AATACGTGGA
AAGAATTTGC GGAAACTGAA ACGTGCGGTG TATGATGCCT TCGACGGCAC TGGTTCGCAA
GTTGGGTTTG TAAACGACCT GCTTGAAAAA CGACGAAGCA AAGAACTCTC TGACGAGGAC
GTTCTTCTTG ACTGTACAGA GTACCGTATC GTATACCGAG ATATTGCGAG GTCGACGGAC
GAACGAACCA TGATTTCGAC TGTCATTCCG AAAGGTGTCG TCTGTCACGA CAAAGCCCCA
CAACTACGTC CTTACAGTAT TGAACCGAGC GAGAAGGACC TCTCTGAAGA CACGCTACAC
AGTGCCTACA AACGCATTTA TAGCGATGAA GAACTGTTTG TCGCCACTGG ATTACTAAAC
AGTCTTCCGT TCGACTTTCT GATGAGGACT AAAATAGATT CGACTGTTGT ATTCTATAAA
TTGAAGGAGT CACAAGCACC CCGACTCACC AAAGGTGATG AATGGTTTGA GTACATTTGG
CGACGATCTG CTCGGCTCAA CTGCTATGGA GACGAATTTG CAGAAATGCG AGATCGACTG
GACGGGATCG AGCCCGTTGT CGACGTTACC GAGCGTCGAC GGGTACAGGC TGAACTCGAT
GCAGCGGCCT TCCACGCCTA TGGCCTCAAT CACGAGCAGA CGGCCTTCGT ACTGGGTGAC
TTCCACCGAG TACAGAGCCC CCGGCTCATG GATGAAGACT ACTTCCAGCT GGTGCTTGAG
AAGTACGAGC AGTTGGCCGA GGTAGAGGTC GAACAGGTCC AAGAGTCGAC GCAGTAA
 
Protein sequence
MPPQSATSAT EFVEQAFETT VDDHARADLL QIIDDAVSTL RKKMKKDETV ERMLRSGGGG 
SYVLKSSMTR DGLQPEPFTQ SAVIEPLLDK LGYDYDTEAG GLSGGRTEVA DYTISLRDYD
DIDSTRLLIE AEPINKDLDS RKHGAGQVRS WLSQREFESD FGFATDGLRW MFIRYDPDSY
THNVIEEVDL QPVFVALFEN QVGPREPLEE AVFDADLERV DTLLRTFEFA NFRSIAGEAR
QVIKQKQEEI TDEFYDDYIR YVFGIVNESE ETPRSLTHDG VIAPEGATAD DTRLFSVEMM
NRLIFIKFLE DKGIVRPDLL QSILDTYEDG LYTDSLYQQF IQPLFYDVLN KRPDKRSPQI
QDIELFADIP YLNGGLFRPS IQHDGSDDRE QFKEADFDVR NSILRSILEL LESYSFSTDG
SVTDLDPSVL GNVFEKTINY ITADNADQNK ELGAYYTPKE ITRFSAERTV RPALFDRLKQ
VVIEERGWPE AELENYDTVY ELIESLPASM DLITTLLGEV DNFRVVDPAC GSGHFLTSVL
EEIVGVRRAL WAHTDSYPHE QALKKTTVQH NIYGVDIVGP AVEIAKLRCW LSVIAELQQE
DLESMDQEEL ALPNIAFNLR QGNSLIGYTG FPETTEDGDG YTLDSFNEDT VRTRYENIID
EITAYEEAIE SEQAEQHRKE ANRLLENARD ELVDDVKDEF VAAGVDDITP EKVETFDPFH
WVLEYAEVYS DGGFDVIVGN PPWDRLSPRR DDYFSRFDSA FRTLMPDEKQ ERQEELLTDP
EIAEGWEEYK RETEIFATYF KNSDSYELQQ PKVAGRTAAT ESDLSALFLE RVFQIARDDG
YLAQILPGAI FNGLSTKDLR LHLLDETSID SLVTFENNGI FSDIDNRYNF GVVTFENQGE
TTDVRGIFKQ TDVDILQNFE DQALSLSRRV LRNYSPEAAI FPYLQSQQEV DVLDTILQHP
PISEEIGSSW YVEPYRELDR GNDVDRFVED EEEGDYPVLG GSNIFQFAYS DAYFGVESPK
FWSVDEDKDP ELSAKKRIRG KNLRKLKRAV YDAFDGTGSQ VGFVNDLLEK RRSKELSDED
VLLDCTEYRI VYRDIARSTD ERTMISTVIP KGVVCHDKAP QLRPYSIEPS EKDLSEDTLH
SAYKRIYSDE ELFVATGLLN SLPFDFLMRT KIDSTVVFYK LKESQAPRLT KGDEWFEYIW
RRSARLNCYG DEFAEMRDRL DGIEPVVDVT ERRRVQAELD AAAFHAYGLN HEQTAFVLGD
FHRVQSPRLM DEDYFQLVLE KYEQLAEVEV EQVQESTQ