Gene Htur_2359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_2359 
Symbol 
ID8742966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp2427056 
End bp2429359 
Gene Length2304 bp 
Protein Length767 aa 
Translation table11 
GC content65% 
IMG OID646512943 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_003403910 
Protein GI284165631 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGACC GAGCGGACGA CGTGAGAAAC GGAGGTGACG CGAACGAAGC GACGGCTCTC 
CGGCGGTATC AGACGCTCGT AAACGAAATC GATGACGGGG TGTATCAGCT CGATGCCGCC
GAACGGTTCG TCACGATCAA CGATCGGTTC GTGGAGTTGA CGGGCTACGC GCGCGACGCC
CTTCTCGGGG AGCACGTTTC CCTCGTGTTC GACGAGACCG ATCGGCAGCG AGTCAAACGG
GAGCTCTCCG AACTCCGTAC GACGGGGGGT CGGCGGAACG AAACGATCGA AACCGCGGTA
CACACCGCCG ACGGTGAGAC GATCCGCTGC GAACTCAGGG TGCACCCGCT CAGCGACGAG
GTCCACGGGT CCGTCGGCGT CCTCCAAGAG CGCAGCGAGC CGACGAGCGA CGGGAACAGA
ACCGGGCGTG TAGAGGCGGA AGGGCGACGC CGAAACGAGG TGACGTTCCG TCAGCTCGTC
GAGCACCTCG ATCAGGTCGT CTGGATGTCG ACCGCCGACA TGCGGGAGAC GATCTACGTG
AACTCGGCCT TCGAGGAGGT CTACGGCCGC GATCGGGAAC GCCTGTACGA GGACCCGGAG
GTCCTACTCG ACGCCGTCCA TCCGGACGAC CGGGAGTTGC TGCGGTCGGA GCTCGAGACC
GAGGTCGAGG AACCCCACGT GATAGAGTAC CGGATCGTGC AACCCGACGG CGACGTTCGG
TGGATCCACG ACCGCGTTGT TCCCGTCTAC GACGACGACG GGAACGTCTT CCGTATCGTC
GGCGAAGCGA TGGACATCAC CGAGCGGAAG GAGCACGAAC GCGAACTCGA GGACACGAAG
TCCCAGCTCG AAGCGGCGAC CGAGGCCGGC GCGGTCGGCA CGTGGGAGTG GCACGTTCGG
ACCGACGAGA TGATCGTGGG ACCGTCGTTC GCGAGGACGT TCGGCGTGAA CCCGGAGGCG
GCCCGTGACG GCGTCTCACT CGACCGGTTC GTCGAGGCCG TCCACGAGGA CGACCGCGAC
CGGGTCGCGG CCGAGATCGA GGAGGTCGTC GAAACCTGTG GCGAGTACGA GTCGGAGTAC
CGCGTCCGCG ACGCCGACGG CGAGCTCCGG TGGGTGGTCG CCCGCGGCCA CATCGAGTGC
GCCGAGAACG GCGAGGCGAT GACGTTCCCC GGCGCGCTCA CTGACATCAC CGAGCGCAAA
CGCGCCGAGT TGCGACTCGA GCAGACCACC GAGCAGCTGG CGACGCTGTT CGAAATCCTT
CCCGTCGGTG TCGTGGTCGC CGACAGCGAC GGGGGATTCG TCGAAGCTAA CGAGACGGCA
AAAGAGATCT GGGGCGGAGA CGTGTTCGAT GTCGAGTCGG TCGCGGAATA CGAACGGTAC
ACGGGGTGGT GGGCCGAGTC GGACGAGCCC GTCGAGCCCG AGGAGTGGAC GATGTCTCGA
GTGCTGGAGG GCGAGGAGGT CACCGACCCC GACATCTACG AAATCGAGAC CGTCGACGGT
GCGCGACGGA TCATTCAGGC GGAGGGAATG CCGGTTTGGG ACGCCGACGG AGACGTGACC
CGCGGCGTCG TCACCATTTC CGACATCACT GAACGGCGGG CGTATCAGCG GAAACTCGAG
GAGTCCAACG AGCGCTTGGA GCAGTTCGCC TACGCCGCCT CCCACGATCT CCAGGAGCCG
CTGCGGATGG TCACCAGCTA CCTCCAGTTG CTCGAAAGTC GGTACGCCGA CGCCTTCGAC
GAGGACGGCC GGGAGTTTTT GGAGTTCGCG GTCGACGGCG CCGACCGGAT GCGCGCGATG
ATCGACGGGC TGCTCGAGTA CTCCCGCGTC GAGACGCGGG GCGACCCGTT CGAGCCGACC
GATCTGAACG ACGTTCTCGA GGACGTTCGG AGCGACCTGC AGCTACAGAT CGAAGAGAGC
GGCGCCGAGA TCACGACAGA GGACCTCCCC CGAGTAAACG GCGACGTTGA CCAGTTACGG
CAGCTCTTTC AGAACCTGTT GTCGAACGCG ATCATCTACA GCGGCGAGGG ATCGCCACGC
GTCCGCGTCG ACGCCCGTCG GCGGGGCCGG CAGTGGGTGA TCTCAGTCGA AGACAATGGG
ATCGGGATCG ACCCCGAGGA TCAGGAGCGA ATTTTCACCG TCTTCGATCG ACTGCACAGC
CGCGAGGAGT ACGAGGGGAC GGGCATCGGA CTCGCGCTCT GTGAGCGCAT CGTCGAGCGC
CACGGCGGGG AGATCTGGGT CGACTCCGAA GCCGGCGACG GGGCGACGTT CTCGATGACG
CTTCCGGCCG CACGCGATCG ATAA
 
Protein sequence
MGDRADDVRN GGDANEATAL RRYQTLVNEI DDGVYQLDAA ERFVTINDRF VELTGYARDA 
LLGEHVSLVF DETDRQRVKR ELSELRTTGG RRNETIETAV HTADGETIRC ELRVHPLSDE
VHGSVGVLQE RSEPTSDGNR TGRVEAEGRR RNEVTFRQLV EHLDQVVWMS TADMRETIYV
NSAFEEVYGR DRERLYEDPE VLLDAVHPDD RELLRSELET EVEEPHVIEY RIVQPDGDVR
WIHDRVVPVY DDDGNVFRIV GEAMDITERK EHERELEDTK SQLEAATEAG AVGTWEWHVR
TDEMIVGPSF ARTFGVNPEA ARDGVSLDRF VEAVHEDDRD RVAAEIEEVV ETCGEYESEY
RVRDADGELR WVVARGHIEC AENGEAMTFP GALTDITERK RAELRLEQTT EQLATLFEIL
PVGVVVADSD GGFVEANETA KEIWGGDVFD VESVAEYERY TGWWAESDEP VEPEEWTMSR
VLEGEEVTDP DIYEIETVDG ARRIIQAEGM PVWDADGDVT RGVVTISDIT ERRAYQRKLE
ESNERLEQFA YAASHDLQEP LRMVTSYLQL LESRYADAFD EDGREFLEFA VDGADRMRAM
IDGLLEYSRV ETRGDPFEPT DLNDVLEDVR SDLQLQIEES GAEITTEDLP RVNGDVDQLR
QLFQNLLSNA IIYSGEGSPR VRVDARRRGR QWVISVEDNG IGIDPEDQER IFTVFDRLHS
REEYEGTGIG LALCERIVER HGGEIWVDSE AGDGATFSMT LPAARDR