Gene Hlac_3237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3237 
Symbol 
ID7398839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012028 
Strand
Start bp499225 
End bp502701 
Gene Length3477 bp 
Protein Length1158 aa 
Translation table11 
GC content57% 
IMG OID643707032 
Producthypothetical protein 
Protein accessionYP_002564654 
Protein GI222476133 
COG category[S] Function unknown 
COG ID[COG4717] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAGGG ACCCGATCTG CTTCGAGGAA ATCCAGATCA TCCAAGCTCC TGGGTTCGAA 
ACTGGCGGCT TCTCTGTCGA CGACTTGTGC TCCGGTATCA ATGTCGTTCA TGGACCGAAC
GCGGCAGGAA AGACGACGCT CGCTGAGTCG CTTGAATGGC TTTGCTGGCC CGAGATCGCC
GACGAGCGTG CATCCCTCGT GGGGCAGCTC TCACTGAACG GTGAAGACTG GCGAGTCGAA
GTCGACAATG GCCGCACGAG TTACCAACGT GACGGTCAAG AGGCGAACGG TCCGAGCCTT
CCTCCAGCCG ATCAACGCGA CCGTTACCGT CTCTCCCTCC ACGACCTACT CCAGCGAGAC
AACAACAACG AATCTTTCGT CGAGATAATC GAGCGAGAAT CAGCTGGTGG CTACGATCTT
TCTACAGCAT ACGACGAGCT CGGGTATTCG GATTCCCCGA GTCGGGCAAA CAGGAATGTC
GTCCAGAAAG CCAAGGGAGC GATTCAGGAG TTACGTGAGG CGCGGACCGA CGTTTCCGAA
CTACGTCAAG AGCAGAACGA ACTCTCACGG TTACGTGGTG AACTGGAGGC AGCGCGTCAA
GCACAAGAGC GATCCGAACT GCTCGAGCAG GCGATCGACT ACGCACAGGC GAGAAACCAG
CTGGAGCAGG CTGAATCGAG GCTCGACGAG TTCCCCGATA TCTTAGACCA GGTCGACGGA
GACGAAATCG AGCGTGTTCG ATCCCTCGAA GATGACATCG ACGAGTGGAC CGAGAAAAAA
GACGAGGCCG GGGAAACGGA AACGGACGCT CAAGAACGGC TTGATGAGGC CGACCTTCCT
GAAGAGGGGC TCCCGACAGG CCGTATCGAC CATCTGAAAG AGCTTCGCGA TGACCTTGAT
TCTGCGGAGG ACAGGAAGCG CGATCTCCAG GGAGACTTGG CTGACGCCCA ACGGCAGCGA
GAAACTGCCC GAGATGACAT TCCCCTAGAT GTCGACACCG GGGACCTCGT GGACTTGGAA
CCCGTCACGT GGAAGAACGT CTCGAAGTTC GCACGAGAGG CCGAGGAACT CCAGTCCGAG
CGTGAAACCC GGGAGGCCGT CCAGCGACTA CTGGAGGACG GTGAGCACCC TGAGTCCGAT
TTACCCACGC TCCAACGCGC GAGTCAGTCG CTGGAGGAGT GGCTGGCTGC GTCCGTTTCT
ACGGAGTCGA ACGACGGTTC AGAGGCATTC CGAATCGCTG TGTTCTCGGC CGTTTCTCTC
GCCTCGACGG GCATCGCGCT GGGTCTACTG GTGCATCCAC TGCTATTCTC CATCCTGCTC
GTCGCTGCTG GTATCTTCTG GTATGGGCTG CGCGCTCGCT CACAGTCCAA AGACGGAGGT
AATTCACGGG ACTCGCATCG CAAGTCGTTC GAGAAAACCG GTCTGAACCC GCCGGCGAGG
TGGACCGAAG ACGAGGTTCG GTCTCGTCTG ATCGGACTGT ACGACGCAAT CGCAGCGCAC
AAACTGGCCG AGCGACGGTC CGAATGGCGC GATAGCCTAG CGACGGATTC AGACACGCTC
GAACAGAAAG AGCAGGATCT GGAGGAAACG CGTGCCAAAC TCCAAGACCA GTTGGGCGCC
GCGCCAGATG CGTCTGATGT CGAACTCGCC GTGATCTCCA AGCGAGTGCT CGACTGGCAG
GAAGCCCACG ACGAAATTGA GGGGATTCAA GAGAGCATCG AAACCGTCGA CGACCAAATC
AAGACCGCCC GCGAAGAACT CCAAGCGAAA CTCGATCCCT ACGGCTACGA CGATGTCGAG
AGGTCTGGTG AGGCGACCGA AGCAATCCGA AACCTCGAGA ACCGCGAACA GCAGCGCGAA
GCCGCACAGC GGGACCTCGA CCAAGCTACC GAGACGATTC AGGAAGCGAC CGAGAAAATA
GGTGCGCTGG CGGACGAACG CGACGAAATC TTTGCGGATC TGGATCTCGA CTCCGATGAC
CACGATAGGT TGGAGGGACT CTGTGAGCAG GTCGAAGCAT ACGAATCGGC TGCGGAGGAC
GTACGAGAAG CCAACATCCG GGCGAACACG GAAGCCGAAG AACTCGAAAG CTACCCGGGC
TTCGAACCGG ACCTCAAGAA GCAAGAGATT GCTGACCTCA GAGAAGACCT CCGTGAGGCG
GAACGAATTG CTGAGGATTT CGACGACCTG CAGTCACGGC TTGCCGATAT CAAGGCAGAA
ATCAGGCAGG CAAAATCCGA CGACCAAGTT GAAAATGCGC TTGCGGAGCG TGATCGGGCG
CTCGATGATC TGAAAGACCA ACTTGAGGAC GACTGTGCTG CGATGGTCGG CGACGTACTG
GTGGATCATG TTCAGGAGGC GACGATGGAG ACTAGTCGAC CGGACGTTTT CGAGCGTGCT
CGTGAAATCC TGACGACGAT CACTCGCGGC CGGTATCGAT TGGACTTCGA TGAGGCCGAA
GCCGAATTCC GTGTGTTCGA CGAAACCAAA AAGAAGGGGC TCGCACTTGA CGAGCTTTCG
AGCGGCACTC GGGTCCAAGT CCTGCTCGCT GTACGAATCG CCTTCGTCGA ACAGCAGGAA
CAGGGAGTTC AGATTCCACT TCTCCTCGAC GAGACGCTCG CCAACACCGA CGACCGAAGA
GCGAAGAGGA TTATCGAGTC AATGATCGAA CTCGCTCGAA ACGGTCGGCA GGTTTTCTAT
TTCACTGCAC AGGGCGACGA AGTGGCAAAG TGGACTGCTG CACTGGAGAG TACAAACGGT
GTCGACCACG AAATCGTCGA CCTCGCAACG GTTCGTGATG TTGACGACAC CGTCCATATT
CCCGATACAG ACTCTGTCGA ATCACACACT CCGCAGGCCC CCAGCCCCGA CAGTCACGAC
CATTCCTCGT ATGGGGACGA GCTCAAGGTA GACTCGTTCA ATCCGCACCG TGGGGTCGGG
ACGGCGCACC TGTGGTACGT GGTCGACGAT GTCGAAACCC TCCATCAACT CTTGGAGCTC
GGGATCGAAC ACTGGGGACA GATGAATAAC CTGCTCCAGT GGGGCAACGG AGACCTCTCC
TCAGTTGAGT CCGACCAGGT AACGGTTGCC GAAGAGAATT CTGCAGCGCT GAATGAGTTC
GTCGCCGCCT GGAAGGTGGG TCGTGGTGAG CCCGTTGATC GGGAGGTTCT CGAAGCTTCT
GGCGCTGTAA GTAGTAATTT CATCGACGAG GTCACTGCCC TTGCCGAATC GGTCAACGGG
GATGGGAGAA AAATCGTTGA AGCCCTGCAT AATGGCGAAG TGAACCGCTT CCGTAGCGGG
AAAGCGAATG AATTGGAGAC GTATCTCGAA GAGAACGGGT ACATCGAGCC CCGTGATACG
CTGGATCAAG GCCAGATACG AGCCCGAATC ATCGAGCGCT TCGTTGATGA AGGTGTCTCT
CCCGAAGAAG CCAAAGATAG GACTGAGAAC CTGCTTTCGC GCATGAACAA AAACTGA
 
Protein sequence
MIRDPICFEE IQIIQAPGFE TGGFSVDDLC SGINVVHGPN AAGKTTLAES LEWLCWPEIA 
DERASLVGQL SLNGEDWRVE VDNGRTSYQR DGQEANGPSL PPADQRDRYR LSLHDLLQRD
NNNESFVEII ERESAGGYDL STAYDELGYS DSPSRANRNV VQKAKGAIQE LREARTDVSE
LRQEQNELSR LRGELEAARQ AQERSELLEQ AIDYAQARNQ LEQAESRLDE FPDILDQVDG
DEIERVRSLE DDIDEWTEKK DEAGETETDA QERLDEADLP EEGLPTGRID HLKELRDDLD
SAEDRKRDLQ GDLADAQRQR ETARDDIPLD VDTGDLVDLE PVTWKNVSKF AREAEELQSE
RETREAVQRL LEDGEHPESD LPTLQRASQS LEEWLAASVS TESNDGSEAF RIAVFSAVSL
ASTGIALGLL VHPLLFSILL VAAGIFWYGL RARSQSKDGG NSRDSHRKSF EKTGLNPPAR
WTEDEVRSRL IGLYDAIAAH KLAERRSEWR DSLATDSDTL EQKEQDLEET RAKLQDQLGA
APDASDVELA VISKRVLDWQ EAHDEIEGIQ ESIETVDDQI KTAREELQAK LDPYGYDDVE
RSGEATEAIR NLENREQQRE AAQRDLDQAT ETIQEATEKI GALADERDEI FADLDLDSDD
HDRLEGLCEQ VEAYESAAED VREANIRANT EAEELESYPG FEPDLKKQEI ADLREDLREA
ERIAEDFDDL QSRLADIKAE IRQAKSDDQV ENALAERDRA LDDLKDQLED DCAAMVGDVL
VDHVQEATME TSRPDVFERA REILTTITRG RYRLDFDEAE AEFRVFDETK KKGLALDELS
SGTRVQVLLA VRIAFVEQQE QGVQIPLLLD ETLANTDDRR AKRIIESMIE LARNGRQVFY
FTAQGDEVAK WTAALESTNG VDHEIVDLAT VRDVDDTVHI PDTDSVESHT PQAPSPDSHD
HSSYGDELKV DSFNPHRGVG TAHLWYVVDD VETLHQLLEL GIEHWGQMNN LLQWGNGDLS
SVESDQVTVA EENSAALNEF VAAWKVGRGE PVDREVLEAS GAVSSNFIDE VTALAESVNG
DGRKIVEALH NGEVNRFRSG KANELETYLE ENGYIEPRDT LDQGQIRARI IERFVDEGVS
PEEAKDRTEN LLSRMNKN