Gene Hlac_3627 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3627 
Symbol 
ID7402539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp383592 
End bp386888 
Gene Length3297 bp 
Protein Length1098 aa 
Translation table11 
GC content49% 
IMG OID643710162 
Producthypothetical protein 
Protein accessionYP_002567728 
Protein GI222481492 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCTCG ATTTTCTCGC CACACGTATG TGGCGACCTT GGTCCCCGGA CCACGATTAT 
TTTGAGATAG AAGGATTTGA TCTTGGCAAT GGTTCCAATC AGCTAGAAAT TGTCCTTGCT
CATTCTGAAG GTCGTCCTCG AAAAGATGAG ATGCGTCGTA TCTGGAAGGA TCGCCGGGGT
GGACGCCCGA ATCCTGTTCT TGTTGTTACT ACATACGATA CTGATTCTGT TGCGCTCTGC
GGACCATCTG GTGATTCTCC ACCAGTTTAT CACGACGTTG ATGTCGGAGT TGCTGAGCGA
ATTGTCGATG TTGCCCTTGA GAAACCGGAC CGTTTTGCGG CACATCGTTT TCTTGGAGAA
GTACTGGATC AAATAACCGA CGACCTCGTC GGTCTACGTA ACCAGGGTCT CCTCTCAACA
CACGAGCTTC GTGTCGGCGT TCCGCACCGA CAAGACTGGG ACGATGCTGC GACCAAAGCA
GATAATGCCC TCGGGGCTGA CGACGGTCGA GAGTTAGTCG AAGAACTCGG ATACGAGATC
GAACCGCTGC AAGGTGGTGG CTACGTTCTT CGAGATGGTG CAAAAAAGAC AGCAGTCGCT
GTCTTCCTCG AAGATGATGA AGCCTTCGAG CAAACGAAAG ACCGCTTCTC AGGCAAATCG
GCAGTCAGTT ATTCACTAAA CAAAGCCGAT AACGAGAACC TACAGTACGT CGTTGCCAGT
TCGGGTACCA CTCTTCGGCT CTATACGACA GACGCTGATG CCGGATTTGG ATCGCGTGGT
CGGACTGATA CCTTCGTCGA ACTCAATCCA GATCTTCTCA CCGACGATAT GGCGGGCTAC
CTTTGGCTTC TTTTCTCTGC TGAAGCACTC CGTGATGGCG GATCATTAGA AGAAATTATG
GCTCGCTCTG AGGACTACGC TGCCGACTTA GGTGAGCGAC TTCGAGAACG CATCTATGAT
GACGTAGTAC CGCAGTTGGC CGAGGCTATC GCAGAGTCGC GTGATCTCGA CGACCCAACA
AAAGAAGAAC TGGATCAGAC CTACCGGATG GCATTGGTCC TTCTTTACCG CTTGCTCTTC
ATCTCCTACG CTGAGGACGA GGACTTCCTC CCACGACGAC GTAATGGCAA CTACCGTGAG
CACTCGCTGA AGAACCTCGC TAAGCGTATT GACCGAACAC TAACCGACAA TGAACAGGAG
TTCGACGACG ACGCGACAGA CTACTGGGAT GAGGTGATGC AACTCACGAA ATATATCCAC
AATGGCCACG CTGAGTGGGG CTTACCAGAG TACGACGGTA CTTTACTCTC TTCGAACATG
GATATGTCTG AAGCTGGTGC TGAACTTGCA AAAATCGAAC TTACCAATGC GGAGTTCGGT
CCTGCTTTGG GCAGCCTACT CATCGACGAG ACACCTGACG AGCGCAAAGG TCCTATCGAC
TTTCGGAATA TTGGCGTCCG TGAGTTCGGT GTCATCTATG AGGGTCTCTT AGAATCTGAG
TTGTCTGTCG CCGACGATGA TTTAACTCTT GATGATGGGA ATTACCGGCC TGCAATAGAT
GACGAAACAG TGGATGTTGA AGAGAGAGAA ATATATCTGC ACGGCCAATC AGGAGAGAGA
AAATCTACGG GATCGTATTA TACCGGGAGT GAATTCGTTG AACATCTCTT AGATTATTCG
TTGAAGCCTG CTCTCGACGA TCACATTGAT AAACTCAAAG GGATGTCGGA CAATAAGGCA
GCGGAGCACT TCTTCGACAT TCGGGTTGCA GATATCGCAA TGGGTTCTGG TCACTTTCTA
GTTGGTGCCA TAGACCGAAT AGAGAGTCGT TTATCTGGCT TCTTAGAACA ACGAGATAGC
AAGCTACCAC GTGTCCAACA AGAATTGGAC CGTCTCGAAC AAGCAGCAAT GGATGCCTTC
GAAAACGAGG ATAATGCACC AGAAATCGAA CGGGATCAGT TACTTCGGCG GCAAGTGGCA
CGTCGGTGTA TCTACGGTGT TGACCTCAAT GATGAGGCAA CGTTGTTAGC ACGTCTTTCT
CTCTGGATTC ATACGTTCGT ACCTGGCCTC CCTCTCACTT TCTTGGACTA TAATCTCCAG
ACAGGGGATT CAATCGTCGG TATTGGCTCT CTCAATGAGA TCACAGATCT TGCGGATGTC
AAACAGAGTT CCTTGGGAAT GTTTCTCGAT GACGGTGATG ATGAGGCAAA TCTCCCTGAC
ATTGAGGAAG AGGTAGAGAT GATTGGTCAA ATGGCCGACT CCGATGCCTC AGAGGTACAA
CAAGCTCGAC AAACCCGTAA TAAAATCGAT GAGAGGCTTG AACAGACTGA AGCAGCACTC
GATATCCTCG TGGGTTCATA TCTGGACGAT GATGTTGAGA CTAGCGTTGT TACAATGGAC
TGTGACCTCA CAGGTGTATC CAGTTACGAC AAAGCACAAG AAGCACTCGG TGATCTTGAT
GTTCTTCATT TCCCAACAAC CTTCCCAGAG GTATTCACTG GTGAAGATCC TGGATTTGAC
GTAATTGTAG GTAACCCGCC ATGGGATAAA GTTCTACATG AGCCACAGCA ATTCTGGGTC
ACCCGATTCC CCGGACTGAA TGCACTCAGT AAATCAAAAC GTGAGGATAG AATCGAAGAG
CTTCGAGAGA AATATCCACA GATTGCCGAC GAAGAAGATG AAGTACAGGC AAATCGAGAA
TTGTATCAGG ACTATGTGAC CGCTGCCTAT GATGAGCAAG GTCATGGACA CAAAGATTAT
TCAAAACTCT TTGTTGAACG GGCTCTAAAC TTACAGAAGA GCAGTGGGAA GCTCGGATAT
GTTCTCCCAC GTCAATCCCT CGTACTGGGT GGATGGAAGA AACTCCGCCA GCGACTGCTT
GATGAATCTC ACTTAACTGT TTTACAGGCG AGAAACCGTG GAGGATGGAT CTTTGAGGAC
GTTCATCACA GCTACATGGT CGTTTTCCTG ACACAGAACT CAGAGACAAA TAACTCTGGA
GCACATATTT GGCCAGCTAC TAAAAGCAAA ACGGCATTAG AGAAGATCTC GATTGATAAT
GGACTGGATC TATCGTATGA TGAGGTTGTG AATCTAACTA CAGAGTCACA CGTTGTTTTG
CCATGGCTCA ACGATGAACG GGCTACGGAT ATCTTCCCTC AAATGGAAAA TGAGTCCCGT
TTGTCAGACG ACAATGGGTG GATAAGTGGA ATACACGATA GCCGTTGGGA CTTCCGTGGT
TCAGGACGAC ACGGACACCT AACTAAGGAT CAGTACTTCA CAAAGCCGCT TAGTTAG
 
Protein sequence
MTLDFLATRM WRPWSPDHDY FEIEGFDLGN GSNQLEIVLA HSEGRPRKDE MRRIWKDRRG 
GRPNPVLVVT TYDTDSVALC GPSGDSPPVY HDVDVGVAER IVDVALEKPD RFAAHRFLGE
VLDQITDDLV GLRNQGLLST HELRVGVPHR QDWDDAATKA DNALGADDGR ELVEELGYEI
EPLQGGGYVL RDGAKKTAVA VFLEDDEAFE QTKDRFSGKS AVSYSLNKAD NENLQYVVAS
SGTTLRLYTT DADAGFGSRG RTDTFVELNP DLLTDDMAGY LWLLFSAEAL RDGGSLEEIM
ARSEDYAADL GERLRERIYD DVVPQLAEAI AESRDLDDPT KEELDQTYRM ALVLLYRLLF
ISYAEDEDFL PRRRNGNYRE HSLKNLAKRI DRTLTDNEQE FDDDATDYWD EVMQLTKYIH
NGHAEWGLPE YDGTLLSSNM DMSEAGAELA KIELTNAEFG PALGSLLIDE TPDERKGPID
FRNIGVREFG VIYEGLLESE LSVADDDLTL DDGNYRPAID DETVDVEERE IYLHGQSGER
KSTGSYYTGS EFVEHLLDYS LKPALDDHID KLKGMSDNKA AEHFFDIRVA DIAMGSGHFL
VGAIDRIESR LSGFLEQRDS KLPRVQQELD RLEQAAMDAF ENEDNAPEIE RDQLLRRQVA
RRCIYGVDLN DEATLLARLS LWIHTFVPGL PLTFLDYNLQ TGDSIVGIGS LNEITDLADV
KQSSLGMFLD DGDDEANLPD IEEEVEMIGQ MADSDASEVQ QARQTRNKID ERLEQTEAAL
DILVGSYLDD DVETSVVTMD CDLTGVSSYD KAQEALGDLD VLHFPTTFPE VFTGEDPGFD
VIVGNPPWDK VLHEPQQFWV TRFPGLNALS KSKREDRIEE LREKYPQIAD EEDEVQANRE
LYQDYVTAAY DEQGHGHKDY SKLFVERALN LQKSSGKLGY VLPRQSLVLG GWKKLRQRLL
DESHLTVLQA RNRGGWIFED VHHSYMVVFL TQNSETNNSG AHIWPATKSK TALEKISIDN
GLDLSYDEVV NLTTESHVVL PWLNDERATD IFPQMENESR LSDDNGWISG IHDSRWDFRG
SGRHGHLTKD QYFTKPLS