Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_5232 |
Symbol | |
ID | 8745780 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013747 |
Strand | + |
Start bp | 129394 |
End bp | 131490 |
Gene Length | 2097 bp |
Protein Length | 698 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646515589 |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_003406536 |
Protein GI | 284176259 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATCGG GTCCTCCGTC GGCGCCCTCT GCCGGGAATA CGGACTCTCA GTCGTCCGCG AGCGAGTTCG GCACCGGAGC GTTGCTCGCC GCAGCGGATG TCGCCGCCTT TCGCGTCGGA GCGGACGGAA CGATCGCCGC GGTCAACGAC GCCTTCGCGT CGCTGACCGG CCACGACCGC TCGGAACTGC TCGGAACGCC GTTTTCGGAG CTCACGGCGG CCGACGATCC GACGCTCGAG TCGCTCGTTG ACGAGGCTGC CGGGGCGGGC CCGCTCTCGG CGCAGGTCTC GATCCGAACG AACGCGGAGA CGTCGATCGC CGTCGACGTC CACCTCGAGG TTCGCGAGAC CGACGACGGG CCCCGGCTCG CCGGGGTCGT CAACCGGCGG CCGACTTCCA CGGAGTCGGT TTCCGACTCG GATCTCACCT ACGGTCGGAC GTTCGAGGCG CTGGCTAACG CGCTCCCGGA CGGCATCATC GTCCTCGATA CGAACAGCGA CGTCCAGTAC GCTAACCCCG CCGTCGAACG GATTCTCGGC CACGCCCCGG ACGAACTCGT CGGCTCGAGC AAGGTCAACA TCATCCCGCC GCGCCTGCGC CAGGCCCACC TCGACGCCCT GCAGAACTAC CTCGAGACCG GCGAGCGGAA CTTAAACTGG ACCTACGTGG AGCTGCCCGG ACAGCACAAG TCGGGCCACG AGGTCCCGTT GGGCGTCTCG CTCAACGACT TCACCTACGA CGGCGACCGC TACTTCGTCG GGCTCTTCCG GGACATATCG CCGCGAAAGG AGGCCGAACG GACGCTCAAG GCCAAGGTCG CCCAGCTCGA GTCGATCGCC TACCTCGGCC GCCACGCGCT CGAGGAGGGC GACGCCGACG ACCTCCTCGA GAAGGCGACC GAACTGGCCA GCGCGGCCCT CGAGGTCGAC TGCTGCGTCG CCTTCGAGTA CGACAGCATC GGAACCGCCC CCGGGACGGT CGGATCGGGC GCTCCGACGG CCGACGTCGG CATCCGGATG GCCGGCTCGG ACGGCGACGC GTTTCGGGTT CGCGCGACCA CCGATTGCGA CGAGGCCGTG CTCGAACGCG AGTGCGAGCG ATCGGCGTCG ACCGAGTCGC TGGCCGGCGC CACGCTCGCG TCGGACGAGC CGATCGTCGT CGAAGACGTC GCGACCGACG ACCGGATCGA CGCGCCCCAC CTGTCCGACG AGGGGATCCA CAGCGGGCTC GGCGTGACGA TCGGTCCGAT CACCGACCCG TGGGGGGTCC TCGTCGCCTA CGACGGCCGG GACGGGGAGT TCGCCGACCA CGACGTCGAC TTCCTCGAGA GCGCCGCGAC GATCATCGCG ACCGCGCTCG AACGCCAGGC GTACGAACGC CAACTGAGCG AGACGGTCGA CGAACTCGAG GCCTCGAACG AGCGCTTGGA GCAGTTCGCC TACGCGGCCA GTCACGACCT CCAAGAGCCC CTGCGGATGG TCTCGAGCTA TCTGGGCTTG GTCGAGAGTC GGTACGCCGA CGAGCTCGAC GACGACGCGA TGGAGTTCAT CGAGTTCGCC GTCGACGGCG CCGATCGGAT GCGCGAGATG ATCGACGGCC TGCTCGAGTA CTCCCGGATC GACACGCAGG GTGAACCGTT CGAGCCCGTC GATCTCGAGA CCGTTCTCGA GGACGTCCTG ACGGACCTGC AGATGATGCT CGAGGAGTCC GACGCCGAGA TCACCGCCGA ATCGCTGCCG ACCGTCCGCG GCGATCCCAC GCAGCTCCGG CAGTTACTGC AGAACCTGCT GTCGAACGCG ATCGAGTACT CGGGGGACGA GCCGCCGCGG GTGCACCTCG AGGCCGAACG GTGTGGCCGA ACGTGGCGGG TCTCGGTCGA AGATGAGGGC ATCGGGATCG ATCCCGAGGA CGCCGACCGG ATCTTTCAGG TGTTCCAGCG CCTGCACAGC CGTGAAGAGT ACGACGGCAC CGGCATCGGG CTGGCGCTCT GCCGGCGCAT CGTCGAGCGC CACGGCGGCG ATATCTGGGT CGACTGCGAA TCCGGCGAGG GAGCGACGTT CTCGTTTACG CTGCCTCGAG CCAACGCGGA CGCGTAG
|
Protein sequence | MESGPPSAPS AGNTDSQSSA SEFGTGALLA AADVAAFRVG ADGTIAAVND AFASLTGHDR SELLGTPFSE LTAADDPTLE SLVDEAAGAG PLSAQVSIRT NAETSIAVDV HLEVRETDDG PRLAGVVNRR PTSTESVSDS DLTYGRTFEA LANALPDGII VLDTNSDVQY ANPAVERILG HAPDELVGSS KVNIIPPRLR QAHLDALQNY LETGERNLNW TYVELPGQHK SGHEVPLGVS LNDFTYDGDR YFVGLFRDIS PRKEAERTLK AKVAQLESIA YLGRHALEEG DADDLLEKAT ELASAALEVD CCVAFEYDSI GTAPGTVGSG APTADVGIRM AGSDGDAFRV RATTDCDEAV LERECERSAS TESLAGATLA SDEPIVVEDV ATDDRIDAPH LSDEGIHSGL GVTIGPITDP WGVLVAYDGR DGEFADHDVD FLESAATIIA TALERQAYER QLSETVDELE ASNERLEQFA YAASHDLQEP LRMVSSYLGL VESRYADELD DDAMEFIEFA VDGADRMREM IDGLLEYSRI DTQGEPFEPV DLETVLEDVL TDLQMMLEES DAEITAESLP TVRGDPTQLR QLLQNLLSNA IEYSGDEPPR VHLEAERCGR TWRVSVEDEG IGIDPEDADR IFQVFQRLHS REEYDGTGIG LALCRRIVER HGGDIWVDCE SGEGATFSFT LPRANADA
|
| |