Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_1021 |
Symbol | |
ID | 8741608 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 1053596 |
End bp | 1056478 |
Gene Length | 2883 bp |
Protein Length | 960 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 646511599 |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_003402586 |
Protein GI | 284164307 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCAAC AGGGCGAACG CAGTGACCAA CCGGGGACGC CGGGTCTCGA AAGCGGCCTC GAGGCATTGC GTCAGAGTCC TGAGTTTCGC GGCCCGGTCG AATCCCTCGA TGGTCACGAC CACGCCAACG ACCATTTCGC ACTCATATAT AAGAACCGGG ACGAGCAGTT CGCGGCCGCC ATCCCGTTCA TCCGCCAGGG GCTCGAGCAG GGCGAGCGAT GCCTGTACGT CGCGGATGAC AACTCGAAAG CAGAAGTACT GGAAGCGATG CGGGCCCGCG GCATTGACGT GGACGGTGCC CTCGACTCGG GTGCGCTCTC CGTCCACACG GAGGCGGACA CGTACCGCAG GACCGGCACG TTCAATCAGG ATGCGATACT GGAGTTCTGG GAGGACTCCC TAACGGAGGC AACAGACGAG GACGGCTACA CGGGAATCAG GGCAGCCACC GAGATGACGT GGGCGCTGGA TGAGAATACG AGCCCCGATC AATTGGTTGA GTACGAAGCG GCCCTCAACT CTCTGTTCCA GAACGAGGAC TACACCGTCA TGTGTCAGTA TAATCGCGAG CGGTTCCCAC CCGAGGTGCT CGAAGACGTC ATCCAGACCC ACCCGCTCCT CATCCACGAC AACGTGATCT CTCACAACGT CTACTACACC CCACCCGAGG AGGTCTTCGG CCCCGAGCAG CCAGCCGACA GGGTCGATCG GATGATGGGG ACGCTGCGCG AGCGAACCGA GGCGAAGACG GAACTCCAGC GATCGAAGGA ACACGAGCAG GCCCAGCAGA GATTGTACGA GATCGTTGCC GATTCCGACC TGCCGTTCGA CGACAAACTC CAGGCGGTAC TCGAGCTCGG CTGCGAGCGG TTCGACCTCG AATACGGCGG GATCGCCCGC ATCGATCCGG CGACCGATCT GTTCGAGGTG GAGACCATAC GCGGGGACCA CGACCACCTC GTCCCGGGCG AGCAGTATCC GCTCTCCGAA ACCTACTGCC GATCGGTGGC GGACGACGGA GAAACCGCTG TCGTCACCGA CCCCGTAAGC GAGGGGTTCG AGGGGAAACT GTGCTACGAG CGCTTCGGCG TCCAGACGTA CCTCGGAACC CGACTCGAAG TCGATGGTGA CGACGATCGG ACGTTCTTCT TCGCCTCGAA CGAACCACGG GAGGAGGGGT TCTCTGAGGC CGAACGCACG TTCCACCACC TGATGGGGCA GTGGGTGGAG TACGAACTCG AACGGAGGCA GGCCGCGGAG GCGCTGCGCG AACAAACCCA CACTCTCGAA ACGATCAACC AGGTGGGCAA CTCGTTGGCG GCCGAACTCG ACCTCGAGAA TCTGGTGCAG AAGGTGACAG ACGCCGGTAC GGAAATAACC GGCGCGGAGT TCGGTGCCTT CTTCTATAAC GTCATCGACG ACCAGGGAGA ATCCTACACG CTCTATACCC TCTCGGGCGT TCCCGACGAG GCGTTCGAAG ACTTCCCGAT GCCGCGCAAT ACGGAGGTCT TCGGCCCGAC CTTTCACGGC GAGGGGGTCG TCCGTTCGGA CGACATCACC AACGATCCGC GCTACGGTAA CAACGCGCCC TACAACGGGA TGCCCGAAGG CCATCTGCCC GTCTGCAGTT ACCTGGCGGT TCCCGTAATC TCGAACTCCG GTGAAGTACA CGGCGGGCTT TTCTTCGGCC ATTCTGAACC GGGCGTCTTC ACCGAGAAGG ACGAAAACAT CATCACCGGG ATTTCCGCTC AAGCGGCCGT AGCCATCGAT AACGCTCGTC TGTACGAAAC GGCGCGCGAA AGCGAGCAGC GATTCCGGGC GCTGGTCACC GCGAGTTCCG AAGCCGTGTT TCGCATGGGC CCCGATTGGG ACGAAATGCA ACACCTCGAA GCCCAGGGCT TCCTCGCCGA CACGAACGAA CCGACCAGCG ACTGGCTCGA CAAATACATT CACCCGAACG ACCAGGAGCG CGTCATGGAG GCCGTCAACG AAGCCGTCCG GACAAAGAGC ACGTTCGAGC TCGAACACCG AGTAGAGCAG GTCGATGGCA GCCTGGGCTG GTCGTTCACG CGTGCGGTAC CGATGCTGAA CGAGGACGGT GACATCGAGG AATGGATCGG GATGGCGAGC GACGTTACCG AGCGCAAGCG TCGCCAGCAG GAACTCGAAC AAACTAACGC GCAACTGGAA CGCTCGAACG CCGAATTGAA GCGGTTCGCC TACGCCGCCT CCCACGACCT CCAGGAGCCG TTACGGATGG TGTCGAGTTA TGTCCAACTG CTCGAACAAC GATATGCCGA CGATCTCGAT GCCGACGCAC AGGAGTACAT CGAGTTCGCC GTCGATGGTG CCGACCGGAT GCGCGAGATG ATCGATGCGT TGCTGCAGTA TTCACGGCTC AACACGAGTG ACAAAGAATT CGAACCCGTG GACTGTAATG ACGTGCTCGC CCAGGCGACG GATAATCTTC AAATCGCCAT CGAAGAGAGC AGCGCCGAGA TTACCTCGGA TTCACTGCCC ACGGTCATGG GCGACGAGCA GCAACTGGTG CAGCTGTTCC AAAATCTGCT CGATAATGCT ATTACGTACG CTGGTGATGA GCCGCCGCAT ATTCACGTCA CCGCTGAGAA GCAAAACGAT GAATGGGTGC TGTCGGTCCA GGATAACGGA ATCGGGATCG GTTCGGAAAA GGCTGAGGAG ATCTTTGAGG TGTTCAACCG CCTCCACACC ACTGACGAGT ATGCCGGCAC TGGTATCGGC CTCGCACTCT GCCAACGGAT CGTTGATATT CACAACGGCC GCATTTGGGT TGAGTCGGAA CTCGGTGAGG GGTCGACTTT CTCATTCACA GTTCCCGAGA AGAAAGCGAG CAAATCCGCA TAG
|
Protein sequence | MSQQGERSDQ PGTPGLESGL EALRQSPEFR GPVESLDGHD HANDHFALIY KNRDEQFAAA IPFIRQGLEQ GERCLYVADD NSKAEVLEAM RARGIDVDGA LDSGALSVHT EADTYRRTGT FNQDAILEFW EDSLTEATDE DGYTGIRAAT EMTWALDENT SPDQLVEYEA ALNSLFQNED YTVMCQYNRE RFPPEVLEDV IQTHPLLIHD NVISHNVYYT PPEEVFGPEQ PADRVDRMMG TLRERTEAKT ELQRSKEHEQ AQQRLYEIVA DSDLPFDDKL QAVLELGCER FDLEYGGIAR IDPATDLFEV ETIRGDHDHL VPGEQYPLSE TYCRSVADDG ETAVVTDPVS EGFEGKLCYE RFGVQTYLGT RLEVDGDDDR TFFFASNEPR EEGFSEAERT FHHLMGQWVE YELERRQAAE ALREQTHTLE TINQVGNSLA AELDLENLVQ KVTDAGTEIT GAEFGAFFYN VIDDQGESYT LYTLSGVPDE AFEDFPMPRN TEVFGPTFHG EGVVRSDDIT NDPRYGNNAP YNGMPEGHLP VCSYLAVPVI SNSGEVHGGL FFGHSEPGVF TEKDENIITG ISAQAAVAID NARLYETARE SEQRFRALVT ASSEAVFRMG PDWDEMQHLE AQGFLADTNE PTSDWLDKYI HPNDQERVME AVNEAVRTKS TFELEHRVEQ VDGSLGWSFT RAVPMLNEDG DIEEWIGMAS DVTERKRRQQ ELEQTNAQLE RSNAELKRFA YAASHDLQEP LRMVSSYVQL LEQRYADDLD ADAQEYIEFA VDGADRMREM IDALLQYSRL NTSDKEFEPV DCNDVLAQAT DNLQIAIEES SAEITSDSLP TVMGDEQQLV QLFQNLLDNA ITYAGDEPPH IHVTAEKQND EWVLSVQDNG IGIGSEKAEE IFEVFNRLHT TDEYAGTGIG LALCQRIVDI HNGRIWVESE LGEGSTFSFT VPEKKASKSA
|
| |