Gene Hlac_0718 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0718 
Symbol 
ID7400191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp730092 
End bp732041 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content68% 
IMG OID643707784 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_002565390 
Protein GI222479153 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.746522 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAAG GAAAATTTAT CCGCGCTCCA GTCTTGCTCA TCTGTATAGT GAACACCGAG 
CCCAGCGGGC GGTCGTCCGT CCTCGTTCTC GCTGGCGACG CGGACGAGGC GGCCGACGTG
GCCGACGCGT TGGAAAGCGC CGACGCCGCT CCCGCACCCG TCGACGGCTC CCCCCGACCT
TCCGTCGACG ACGCCGTCTC CCCCACCTCC GTCGACGACC TCGCGACCAC GGACGTGGCT
ATCGTCCTTG ACGCCGAGGG ACTCGACGCG ATCCGTGGTC AGAACCGGTC GTGCCGTGTG
GTCGCGTTCG TTGACGACCC CGCCACAGAG CCAGCAACCG ATCCCCTCGT CGACGTGATT
GCGACCACGC CGGCGGATCT CGATGCGACG CTCCGCTGGT TCGCGGCCAG AGGCGAGTCC
GGCATCGACC GCGACGACCC GACGCGAACG CCTTCGCGGA TCGAGCAGCT CAACGCAGAG
GTGACCCGCC TCGCGTCGGT CAGGTCGATA GAGGGGGCTC ACCGAACCGC GATCGCCGTC
GCCGCCGAAG TGTTCCCGCG GTACCACTGC GTCGTCGGCG TCCGCGACGG GGAGTGGGTC
GAACCGATCG CGACCTCGTC CGGGGTCTCG ATCGACGACT GTAACCGCGT TCGAGTCGGG
CGCGGCTCGG CCGGAACTGC GATCGATACC GGCGATCCGG TGATCGAGTC CCGAACGATA
GGCGAGGCCC CCTACGACGC GCTTCTCTCG CTTCCGGTCG GCGACGAGAC GGTACTCCAG
CTCGCGGCCG ACGAAGACGA TGGGTTCGAC GCAGGTGACC GCCGCCTCGC GGAGCTGCTC
GCGTCCCACG TCGAGGAGAC GCTCGACAGG ATCCGCGCCG ACGAAGCGCT TCGGACCGAG
CGGGACCACC TGCTCGCGCT GTTCTCGAAC GTCCCCGACC CGGCGATCGC CTACGACTAC
GTCGACGGCG AGCCGATCGT TCACCGAGTC AACGACGCGT TCGAAGAGAC GTTCGCCTAC
GACGCCGATC GCGTGGTCGG CGAGTCCGTC GACGACTATA TCGTTCCCCC CAGCGACGAG
GCCGAAGCCG AGGCCGTGGA ACTCAACGAG CGGCTTCAGT GCGGCGAAAA CGTCAGACGC
GAGGTGACCC GCGAGACCGC TGACGGCCTT CGACACTTCA TCCTCCACGT CGTGCCGATC
CGGCTCGACG CCGAGAACGT CTCCGGATAC GCTATCTACA CCGATGTGAC GGAGCGACGT
GAGCGCGAGG CGGCGCTCCG CAGACAGAAC GAACGGCTCG ACGAGTTCGC AAGTATCGTC
TCCCACGACC TCCGAAACCC GCTTTCCGTC GCAGAGGGAT ACGTCACCCT CGCGAACGAG
ACCGGCGAGA TCGCCCATCT CGAGAAAACC CTGGAGGCGC TGGACCGCAT GGACGAGCTC
GTCGGCGACC TGCTCTCGCT GGCCCGACAG GGAGAGGCGG TCGGCGAGAC CGAACCCGTC
TCGATCGAGG CGATCGCGAG GGACGCGTGG GAGAGTGTCG ACACCGACGG CATCGAGCTC
GTCGTCGACG GCGACGTGAC GATCGATGCA AGTCCGACCC GGACGCGAGA GCTGCTGGAG
AACGTGTTTC GAAACAGCGT GGAGCACGGC CGGCGCTCCT CAGGTGGGAA CGACGGCGGC
GACGGCAAGC CGCTTACGGT GCGCGTCGGC GACACTGCGT TCCGCGGCGA CGACGGGACA
ACGGGGAGCG GCTTCTTCGT CGAAGACAAC GGTTGTGGGA TCCCCGAGGG CGAGCGCGAC
CGCGTGTTCG AGAGCGGATT CACGACCGAG GAGGGCGGGA CCGGCCTCGG GCTCGCGATC
ACGAAGCGGA TCGCCGACGC GCACGACTGG CAGGTGCGCG CGCTCACCGG CGAGTCCGGC
GGGGCGCGGT TCGAGTTCAA AACGTCGTGA
 
Protein sequence
MTEGKFIRAP VLLICIVNTE PSGRSSVLVL AGDADEAADV ADALESADAA PAPVDGSPRP 
SVDDAVSPTS VDDLATTDVA IVLDAEGLDA IRGQNRSCRV VAFVDDPATE PATDPLVDVI
ATTPADLDAT LRWFAARGES GIDRDDPTRT PSRIEQLNAE VTRLASVRSI EGAHRTAIAV
AAEVFPRYHC VVGVRDGEWV EPIATSSGVS IDDCNRVRVG RGSAGTAIDT GDPVIESRTI
GEAPYDALLS LPVGDETVLQ LAADEDDGFD AGDRRLAELL ASHVEETLDR IRADEALRTE
RDHLLALFSN VPDPAIAYDY VDGEPIVHRV NDAFEETFAY DADRVVGESV DDYIVPPSDE
AEAEAVELNE RLQCGENVRR EVTRETADGL RHFILHVVPI RLDAENVSGY AIYTDVTERR
EREAALRRQN ERLDEFASIV SHDLRNPLSV AEGYVTLANE TGEIAHLEKT LEALDRMDEL
VGDLLSLARQ GEAVGETEPV SIEAIARDAW ESVDTDGIEL VVDGDVTIDA SPTRTRELLE
NVFRNSVEHG RRSSGGNDGG DGKPLTVRVG DTAFRGDDGT TGSGFFVEDN GCGIPEGERD
RVFESGFTTE EGGTGLGLAI TKRIADAHDW QVRALTGESG GARFEFKTS