Gene Htur_1021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_1021 
Symbol 
ID8741608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp1053596 
End bp1056478 
Gene Length2883 bp 
Protein Length960 aa 
Translation table11 
GC content60% 
IMG OID646511599 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_003402586 
Protein GI284164307 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCAAC AGGGCGAACG CAGTGACCAA CCGGGGACGC CGGGTCTCGA AAGCGGCCTC 
GAGGCATTGC GTCAGAGTCC TGAGTTTCGC GGCCCGGTCG AATCCCTCGA TGGTCACGAC
CACGCCAACG ACCATTTCGC ACTCATATAT AAGAACCGGG ACGAGCAGTT CGCGGCCGCC
ATCCCGTTCA TCCGCCAGGG GCTCGAGCAG GGCGAGCGAT GCCTGTACGT CGCGGATGAC
AACTCGAAAG CAGAAGTACT GGAAGCGATG CGGGCCCGCG GCATTGACGT GGACGGTGCC
CTCGACTCGG GTGCGCTCTC CGTCCACACG GAGGCGGACA CGTACCGCAG GACCGGCACG
TTCAATCAGG ATGCGATACT GGAGTTCTGG GAGGACTCCC TAACGGAGGC AACAGACGAG
GACGGCTACA CGGGAATCAG GGCAGCCACC GAGATGACGT GGGCGCTGGA TGAGAATACG
AGCCCCGATC AATTGGTTGA GTACGAAGCG GCCCTCAACT CTCTGTTCCA GAACGAGGAC
TACACCGTCA TGTGTCAGTA TAATCGCGAG CGGTTCCCAC CCGAGGTGCT CGAAGACGTC
ATCCAGACCC ACCCGCTCCT CATCCACGAC AACGTGATCT CTCACAACGT CTACTACACC
CCACCCGAGG AGGTCTTCGG CCCCGAGCAG CCAGCCGACA GGGTCGATCG GATGATGGGG
ACGCTGCGCG AGCGAACCGA GGCGAAGACG GAACTCCAGC GATCGAAGGA ACACGAGCAG
GCCCAGCAGA GATTGTACGA GATCGTTGCC GATTCCGACC TGCCGTTCGA CGACAAACTC
CAGGCGGTAC TCGAGCTCGG CTGCGAGCGG TTCGACCTCG AATACGGCGG GATCGCCCGC
ATCGATCCGG CGACCGATCT GTTCGAGGTG GAGACCATAC GCGGGGACCA CGACCACCTC
GTCCCGGGCG AGCAGTATCC GCTCTCCGAA ACCTACTGCC GATCGGTGGC GGACGACGGA
GAAACCGCTG TCGTCACCGA CCCCGTAAGC GAGGGGTTCG AGGGGAAACT GTGCTACGAG
CGCTTCGGCG TCCAGACGTA CCTCGGAACC CGACTCGAAG TCGATGGTGA CGACGATCGG
ACGTTCTTCT TCGCCTCGAA CGAACCACGG GAGGAGGGGT TCTCTGAGGC CGAACGCACG
TTCCACCACC TGATGGGGCA GTGGGTGGAG TACGAACTCG AACGGAGGCA GGCCGCGGAG
GCGCTGCGCG AACAAACCCA CACTCTCGAA ACGATCAACC AGGTGGGCAA CTCGTTGGCG
GCCGAACTCG ACCTCGAGAA TCTGGTGCAG AAGGTGACAG ACGCCGGTAC GGAAATAACC
GGCGCGGAGT TCGGTGCCTT CTTCTATAAC GTCATCGACG ACCAGGGAGA ATCCTACACG
CTCTATACCC TCTCGGGCGT TCCCGACGAG GCGTTCGAAG ACTTCCCGAT GCCGCGCAAT
ACGGAGGTCT TCGGCCCGAC CTTTCACGGC GAGGGGGTCG TCCGTTCGGA CGACATCACC
AACGATCCGC GCTACGGTAA CAACGCGCCC TACAACGGGA TGCCCGAAGG CCATCTGCCC
GTCTGCAGTT ACCTGGCGGT TCCCGTAATC TCGAACTCCG GTGAAGTACA CGGCGGGCTT
TTCTTCGGCC ATTCTGAACC GGGCGTCTTC ACCGAGAAGG ACGAAAACAT CATCACCGGG
ATTTCCGCTC AAGCGGCCGT AGCCATCGAT AACGCTCGTC TGTACGAAAC GGCGCGCGAA
AGCGAGCAGC GATTCCGGGC GCTGGTCACC GCGAGTTCCG AAGCCGTGTT TCGCATGGGC
CCCGATTGGG ACGAAATGCA ACACCTCGAA GCCCAGGGCT TCCTCGCCGA CACGAACGAA
CCGACCAGCG ACTGGCTCGA CAAATACATT CACCCGAACG ACCAGGAGCG CGTCATGGAG
GCCGTCAACG AAGCCGTCCG GACAAAGAGC ACGTTCGAGC TCGAACACCG AGTAGAGCAG
GTCGATGGCA GCCTGGGCTG GTCGTTCACG CGTGCGGTAC CGATGCTGAA CGAGGACGGT
GACATCGAGG AATGGATCGG GATGGCGAGC GACGTTACCG AGCGCAAGCG TCGCCAGCAG
GAACTCGAAC AAACTAACGC GCAACTGGAA CGCTCGAACG CCGAATTGAA GCGGTTCGCC
TACGCCGCCT CCCACGACCT CCAGGAGCCG TTACGGATGG TGTCGAGTTA TGTCCAACTG
CTCGAACAAC GATATGCCGA CGATCTCGAT GCCGACGCAC AGGAGTACAT CGAGTTCGCC
GTCGATGGTG CCGACCGGAT GCGCGAGATG ATCGATGCGT TGCTGCAGTA TTCACGGCTC
AACACGAGTG ACAAAGAATT CGAACCCGTG GACTGTAATG ACGTGCTCGC CCAGGCGACG
GATAATCTTC AAATCGCCAT CGAAGAGAGC AGCGCCGAGA TTACCTCGGA TTCACTGCCC
ACGGTCATGG GCGACGAGCA GCAACTGGTG CAGCTGTTCC AAAATCTGCT CGATAATGCT
ATTACGTACG CTGGTGATGA GCCGCCGCAT ATTCACGTCA CCGCTGAGAA GCAAAACGAT
GAATGGGTGC TGTCGGTCCA GGATAACGGA ATCGGGATCG GTTCGGAAAA GGCTGAGGAG
ATCTTTGAGG TGTTCAACCG CCTCCACACC ACTGACGAGT ATGCCGGCAC TGGTATCGGC
CTCGCACTCT GCCAACGGAT CGTTGATATT CACAACGGCC GCATTTGGGT TGAGTCGGAA
CTCGGTGAGG GGTCGACTTT CTCATTCACA GTTCCCGAGA AGAAAGCGAG CAAATCCGCA
TAG
 
Protein sequence
MSQQGERSDQ PGTPGLESGL EALRQSPEFR GPVESLDGHD HANDHFALIY KNRDEQFAAA 
IPFIRQGLEQ GERCLYVADD NSKAEVLEAM RARGIDVDGA LDSGALSVHT EADTYRRTGT
FNQDAILEFW EDSLTEATDE DGYTGIRAAT EMTWALDENT SPDQLVEYEA ALNSLFQNED
YTVMCQYNRE RFPPEVLEDV IQTHPLLIHD NVISHNVYYT PPEEVFGPEQ PADRVDRMMG
TLRERTEAKT ELQRSKEHEQ AQQRLYEIVA DSDLPFDDKL QAVLELGCER FDLEYGGIAR
IDPATDLFEV ETIRGDHDHL VPGEQYPLSE TYCRSVADDG ETAVVTDPVS EGFEGKLCYE
RFGVQTYLGT RLEVDGDDDR TFFFASNEPR EEGFSEAERT FHHLMGQWVE YELERRQAAE
ALREQTHTLE TINQVGNSLA AELDLENLVQ KVTDAGTEIT GAEFGAFFYN VIDDQGESYT
LYTLSGVPDE AFEDFPMPRN TEVFGPTFHG EGVVRSDDIT NDPRYGNNAP YNGMPEGHLP
VCSYLAVPVI SNSGEVHGGL FFGHSEPGVF TEKDENIITG ISAQAAVAID NARLYETARE
SEQRFRALVT ASSEAVFRMG PDWDEMQHLE AQGFLADTNE PTSDWLDKYI HPNDQERVME
AVNEAVRTKS TFELEHRVEQ VDGSLGWSFT RAVPMLNEDG DIEEWIGMAS DVTERKRRQQ
ELEQTNAQLE RSNAELKRFA YAASHDLQEP LRMVSSYVQL LEQRYADDLD ADAQEYIEFA
VDGADRMREM IDALLQYSRL NTSDKEFEPV DCNDVLAQAT DNLQIAIEES SAEITSDSLP
TVMGDEQQLV QLFQNLLDNA ITYAGDEPPH IHVTAEKQND EWVLSVQDNG IGIGSEKAEE
IFEVFNRLHT TDEYAGTGIG LALCQRIVDI HNGRIWVESE LGEGSTFSFT VPEKKASKSA