Gene Htur_5232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_5232 
Symbol 
ID8745780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013747 
Strand
Start bp129394 
End bp131490 
Gene Length2097 bp 
Protein Length698 aa 
Translation table11 
GC content68% 
IMG OID646515589 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_003406536 
Protein GI284176259 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATCGG GTCCTCCGTC GGCGCCCTCT GCCGGGAATA CGGACTCTCA GTCGTCCGCG 
AGCGAGTTCG GCACCGGAGC GTTGCTCGCC GCAGCGGATG TCGCCGCCTT TCGCGTCGGA
GCGGACGGAA CGATCGCCGC GGTCAACGAC GCCTTCGCGT CGCTGACCGG CCACGACCGC
TCGGAACTGC TCGGAACGCC GTTTTCGGAG CTCACGGCGG CCGACGATCC GACGCTCGAG
TCGCTCGTTG ACGAGGCTGC CGGGGCGGGC CCGCTCTCGG CGCAGGTCTC GATCCGAACG
AACGCGGAGA CGTCGATCGC CGTCGACGTC CACCTCGAGG TTCGCGAGAC CGACGACGGG
CCCCGGCTCG CCGGGGTCGT CAACCGGCGG CCGACTTCCA CGGAGTCGGT TTCCGACTCG
GATCTCACCT ACGGTCGGAC GTTCGAGGCG CTGGCTAACG CGCTCCCGGA CGGCATCATC
GTCCTCGATA CGAACAGCGA CGTCCAGTAC GCTAACCCCG CCGTCGAACG GATTCTCGGC
CACGCCCCGG ACGAACTCGT CGGCTCGAGC AAGGTCAACA TCATCCCGCC GCGCCTGCGC
CAGGCCCACC TCGACGCCCT GCAGAACTAC CTCGAGACCG GCGAGCGGAA CTTAAACTGG
ACCTACGTGG AGCTGCCCGG ACAGCACAAG TCGGGCCACG AGGTCCCGTT GGGCGTCTCG
CTCAACGACT TCACCTACGA CGGCGACCGC TACTTCGTCG GGCTCTTCCG GGACATATCG
CCGCGAAAGG AGGCCGAACG GACGCTCAAG GCCAAGGTCG CCCAGCTCGA GTCGATCGCC
TACCTCGGCC GCCACGCGCT CGAGGAGGGC GACGCCGACG ACCTCCTCGA GAAGGCGACC
GAACTGGCCA GCGCGGCCCT CGAGGTCGAC TGCTGCGTCG CCTTCGAGTA CGACAGCATC
GGAACCGCCC CCGGGACGGT CGGATCGGGC GCTCCGACGG CCGACGTCGG CATCCGGATG
GCCGGCTCGG ACGGCGACGC GTTTCGGGTT CGCGCGACCA CCGATTGCGA CGAGGCCGTG
CTCGAACGCG AGTGCGAGCG ATCGGCGTCG ACCGAGTCGC TGGCCGGCGC CACGCTCGCG
TCGGACGAGC CGATCGTCGT CGAAGACGTC GCGACCGACG ACCGGATCGA CGCGCCCCAC
CTGTCCGACG AGGGGATCCA CAGCGGGCTC GGCGTGACGA TCGGTCCGAT CACCGACCCG
TGGGGGGTCC TCGTCGCCTA CGACGGCCGG GACGGGGAGT TCGCCGACCA CGACGTCGAC
TTCCTCGAGA GCGCCGCGAC GATCATCGCG ACCGCGCTCG AACGCCAGGC GTACGAACGC
CAACTGAGCG AGACGGTCGA CGAACTCGAG GCCTCGAACG AGCGCTTGGA GCAGTTCGCC
TACGCGGCCA GTCACGACCT CCAAGAGCCC CTGCGGATGG TCTCGAGCTA TCTGGGCTTG
GTCGAGAGTC GGTACGCCGA CGAGCTCGAC GACGACGCGA TGGAGTTCAT CGAGTTCGCC
GTCGACGGCG CCGATCGGAT GCGCGAGATG ATCGACGGCC TGCTCGAGTA CTCCCGGATC
GACACGCAGG GTGAACCGTT CGAGCCCGTC GATCTCGAGA CCGTTCTCGA GGACGTCCTG
ACGGACCTGC AGATGATGCT CGAGGAGTCC GACGCCGAGA TCACCGCCGA ATCGCTGCCG
ACCGTCCGCG GCGATCCCAC GCAGCTCCGG CAGTTACTGC AGAACCTGCT GTCGAACGCG
ATCGAGTACT CGGGGGACGA GCCGCCGCGG GTGCACCTCG AGGCCGAACG GTGTGGCCGA
ACGTGGCGGG TCTCGGTCGA AGATGAGGGC ATCGGGATCG ATCCCGAGGA CGCCGACCGG
ATCTTTCAGG TGTTCCAGCG CCTGCACAGC CGTGAAGAGT ACGACGGCAC CGGCATCGGG
CTGGCGCTCT GCCGGCGCAT CGTCGAGCGC CACGGCGGCG ATATCTGGGT CGACTGCGAA
TCCGGCGAGG GAGCGACGTT CTCGTTTACG CTGCCTCGAG CCAACGCGGA CGCGTAG
 
Protein sequence
MESGPPSAPS AGNTDSQSSA SEFGTGALLA AADVAAFRVG ADGTIAAVND AFASLTGHDR 
SELLGTPFSE LTAADDPTLE SLVDEAAGAG PLSAQVSIRT NAETSIAVDV HLEVRETDDG
PRLAGVVNRR PTSTESVSDS DLTYGRTFEA LANALPDGII VLDTNSDVQY ANPAVERILG
HAPDELVGSS KVNIIPPRLR QAHLDALQNY LETGERNLNW TYVELPGQHK SGHEVPLGVS
LNDFTYDGDR YFVGLFRDIS PRKEAERTLK AKVAQLESIA YLGRHALEEG DADDLLEKAT
ELASAALEVD CCVAFEYDSI GTAPGTVGSG APTADVGIRM AGSDGDAFRV RATTDCDEAV
LERECERSAS TESLAGATLA SDEPIVVEDV ATDDRIDAPH LSDEGIHSGL GVTIGPITDP
WGVLVAYDGR DGEFADHDVD FLESAATIIA TALERQAYER QLSETVDELE ASNERLEQFA
YAASHDLQEP LRMVSSYLGL VESRYADELD DDAMEFIEFA VDGADRMREM IDGLLEYSRI
DTQGEPFEPV DLETVLEDVL TDLQMMLEES DAEITAESLP TVRGDPTQLR QLLQNLLSNA
IEYSGDEPPR VHLEAERCGR TWRVSVEDEG IGIDPEDADR IFQVFQRLHS REEYDGTGIG
LALCRRIVER HGGDIWVDCE SGEGATFSFT LPRANADA