Gene Hlac_0412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0412 
Symbol 
ID7401029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp429354 
End bp432545 
Gene Length3192 bp 
Protein Length1063 aa 
Translation table11 
GC content60% 
IMG OID643707476 
Producthypothetical protein 
Protein accessionYP_002565085 
Protein GI222478848 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAACG ACAATAATAC ACGTAAGAAG GCCAGTGCGG TCTTCTTCGC GGCGATCATG 
GTCGTCTCCA TGGTCGCAAT TGGTTTCGCT GCTGCTCCGG CAGCAGCCGA CCTTGATGGT
ACAAACAGTA CCATCGACTC TGCGGCCGCC GAACCCGTGA CGGCTGGGTT CGACTCGTCA
GAGCAGGACG TACAGGTAAA GGTTACAATT AGCGACAATG CCAGCGACAA CGTGACTGTC
GACCTCTCTG AGGCGGCAGA CGTGGAAGCT GTTCCGTCGC TACAGGACGT AACCATCTCC
CCCAGTACGA CTGGTAATGT TAACCTCCTC GGCTCGGAGT TTGACGAGAG CACGAACGTT
CTCGAGTTCC AAGTCAAAGG TACCACCGCA GGAGACGACG TCGCAACAGT CACTGCCTCC
TTCGAGCACG ACCTCAGCAA TGTTGAAGGT CCCGCCGGTA CTACAAACCG GCTGAGCTAC
AGCGTTGGTG CAGGCACTGA AACGGATTCC ACGATTGCTA ACTTCGAGGT TTACTCCGCT
GAGGATGTCA GCCTTGACGC ACCGCTCACC GAAGACAATG TGAACTCCGC GGATTTCACA
GTCTCGCACC CGGCTAACAC TCCGGTCGCC GGTGATGAAG TTTCACTCTT CATCACTGGA
AACGATAATT CGACTGTTGT CGATCCCACT TCGGCTACCA CAGACGCAGA CCCCACAACC
TTCACGGGTG TCAATCTGGA CAGTCAGGGA ATCGCTTCCG GTGCTGGCGA CGAGATCCAG
TTCCATGCTG TGTTGGCGGC TGACGCCGAC AACGGTACGA GCGGATTCGA GGCCAGCTTA
AATAACGTTC AGGCTGATGG TACCGCCACT GTCGATGATT CTGATGCGGA TATCATCACC
GATAGTGATT GGGACGGTTC GCAGCTCTGG GAAGGTCAGA CAGTTACTGT TGACCTTAGT
GGCAGCAATG TTGCTCCCGG TGACGAGGTC ACCGTGCGAG AGGTCACTAA CCGCAATGGT
AACGGGTACG CGACCAACAC GCGACTAGCC CGCTCCCTCA CCGTTAGCGG TGATCGGACG
ATCTCCATCG AAACGGATCG TCTCCGTGGT GAAGCCGAAT ACGTCCTTCG GACGTCTAAC
GGGCTCCTAG CAGCCGGAAC GGCCGGCGCT GGTACTGACG GAGAGTTCAG GATGGTTCTT
GGTGTATCCG ACATCGCTGA GGCGCAAGTT CTCACGCAGG ACCTCTCGGC AGAGTTTGCT
GAGGATGAAG CGAACAACGA CGAGAAGATC GACGTCGACG TTGCATCGCT GCGTAGCGAA
TTCGACGTTG AAGTGAGCGG CGACCTTGAC GACAGCGAGC TCTCCGAAGA AGAACTCGAG
AACATCTTCG ACGACGAGAG CGCAACCGGC CTCGACGACG GCGACGATGA CACGGTGCTC
ATCGAGGACG TCCAGGACGG CGAAGCCTTC GTTGCGAACT TCACTGATGT TGACGGCGGC
AACTACTCCT TCGACTTCGA CGTCGAGGAC ACTACGGCCT CCGACACCGA CTCGATCGAA
GTGACCGAGC TCGGCGAGGG TGAACTCACC CTTGGTGGCG AGAGCATCGT CACCGAACAG
CAGGGTGACG TCGCGAACAT CACCGTCACC TTTGACGGTA CCGCTGAAAC TGGTACGCTC
CTCGTCGGCG ACGAAGACGA TGTTGGCTAC CAGGGTAACA TCACGATTGA CTCGAACGGC
GAAGACGAAG TGAACGTCCT GTTCAACAGC TACGCCGCGG GTAGCTCCGG TAACGGGACG
GTCTTTGAGC TCGCGAACCC GGACGACACG GACGCTGAGC TCGACAACTT CCAGCAGAAC
CAGATTTCCG ACGTCCTCTC CGATGGCGAC TACACGCTCT CGGTGAGTAC GTCGAGCGAC
TACGACGATA CGCTGGACGA TCCCGACACG ATCGGGACGC TCGTGCTCGA ACAGCGCGCG
ACGACGAACC AGCAGATCTG GACGACCTCT GAAGGCACCG TCACTGACAT CGTCGACGCA
GCAGATGCCG ACGATGAGGA CGGGCTCGAA GAGCTGAACT CCCAGATTGA GGGTGACAAC
GTGACCCAGG CGAGCACGAT CGCCGAGGGC GACTACGTCA TCCACCAGAT CGAGGCGTCC
GGCCTGTCCG GCCTCCTCGC GCAGTACGAC GACGACCCGA CCACCGCGCT TAACGATGCA
GTGACCGACA CCGGCGCGGG TATCGGCACT GCAGTGACCG ACACCGGCGC GGGTATCGGC
ACCAACGACG GTGCGCTCAG CCTGCGTGTC CGTCAGACTC AGGCCTCGAC GACCGCCAAC
CAGGATCCCG CTGAACTGCA GAGCAACATC CTGGGTAACA TGACGGTCTT CGCAGACGAG
GAGACGAACA ACTACTACGT CGTCTACGAC CTCGACGACA CGTCGGCTGA GGATGGCGAA
GCGTTCGACG CACGCTTCCG CGTGCAGGAC GATCGCCTCC TCAACCCGTC TGACTCGGAC
CGTGACGCTC TCTCGACGAA CGAGCTCACC AACGAGTACT ACCAGAGCGT GACCGCTTCC
TTCGACGTGG CAGAGCGTGA ATTCGAGTTC GACCAGGACC CTTACAACGT GACGAACGCT
GAGGGTCAGG CCGTCTCGGG CACCTCGAAC GTTGCGCCCG GTACTGAAGT GAACGTCCGC
CTCCGCTCGG CGTCTGGTAC GAGTCCCTCG TTCATCGAGA CGAGCGAGGG CGTGCGTGTC
AACGCTGACG GCACGTGGAT GACTGAATTC GACTTCAGTG ACACCTCGGT CGGTGACGAG
TACACGTTCA CGGTCCGGCA GACGGGCCTT GACGAGAACC CGTCCGTCGA CGGTACGGTC
ATCGAAGCAG TCGATGACGG TACCGACGAT GGTGACGACG GCAACGTGAC CGACGGTGAC
GACGGTGACG ACGGTAACGT TACCGACGGT GACGATGGTG ACGACGGCAA CGTCACTGAC
GGTGACGACG GCACCGACGA TGGTGACGAC GGAACCGACG ACGGTGACGA CGGCTCCGAC
GATGGATCCG ACGGCTCCGA CGGCGGTGAC GACGGCGGCG ACTCCGAAGA CGGCACGCCC
GGCTTCGGTG CGCTCGTCGC TCTCGTCGCC CTCATCGCGG CTGCGCTCCT CGCGACGCGG
CGTAACGAGT AA
 
Protein sequence
MTNDNNTRKK ASAVFFAAIM VVSMVAIGFA AAPAAADLDG TNSTIDSAAA EPVTAGFDSS 
EQDVQVKVTI SDNASDNVTV DLSEAADVEA VPSLQDVTIS PSTTGNVNLL GSEFDESTNV
LEFQVKGTTA GDDVATVTAS FEHDLSNVEG PAGTTNRLSY SVGAGTETDS TIANFEVYSA
EDVSLDAPLT EDNVNSADFT VSHPANTPVA GDEVSLFITG NDNSTVVDPT SATTDADPTT
FTGVNLDSQG IASGAGDEIQ FHAVLAADAD NGTSGFEASL NNVQADGTAT VDDSDADIIT
DSDWDGSQLW EGQTVTVDLS GSNVAPGDEV TVREVTNRNG NGYATNTRLA RSLTVSGDRT
ISIETDRLRG EAEYVLRTSN GLLAAGTAGA GTDGEFRMVL GVSDIAEAQV LTQDLSAEFA
EDEANNDEKI DVDVASLRSE FDVEVSGDLD DSELSEEELE NIFDDESATG LDDGDDDTVL
IEDVQDGEAF VANFTDVDGG NYSFDFDVED TTASDTDSIE VTELGEGELT LGGESIVTEQ
QGDVANITVT FDGTAETGTL LVGDEDDVGY QGNITIDSNG EDEVNVLFNS YAAGSSGNGT
VFELANPDDT DAELDNFQQN QISDVLSDGD YTLSVSTSSD YDDTLDDPDT IGTLVLEQRA
TTNQQIWTTS EGTVTDIVDA ADADDEDGLE ELNSQIEGDN VTQASTIAEG DYVIHQIEAS
GLSGLLAQYD DDPTTALNDA VTDTGAGIGT AVTDTGAGIG TNDGALSLRV RQTQASTTAN
QDPAELQSNI LGNMTVFADE ETNNYYVVYD LDDTSAEDGE AFDARFRVQD DRLLNPSDSD
RDALSTNELT NEYYQSVTAS FDVAEREFEF DQDPYNVTNA EGQAVSGTSN VAPGTEVNVR
LRSASGTSPS FIETSEGVRV NADGTWMTEF DFSDTSVGDE YTFTVRQTGL DENPSVDGTV
IEAVDDGTDD GDDGNVTDGD DGDDGNVTDG DDGDDGNVTD GDDGTDDGDD GTDDGDDGSD
DGSDGSDGGD DGGDSEDGTP GFGALVALVA LIAAALLATR RNE