Gene Htur_4049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4049 
Symbol 
ID8744677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp302969 
End bp305620 
Gene Length2652 bp 
Protein Length883 aa 
Translation table11 
GC content61% 
IMG OID646514615 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_003405562 
Protein GI284167284 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGATC AAGCCGGGAC TACACAGGGG GGATTTTGGA CGGACGCTAC CGGCGAGCGC 
ACAGTCGATC GGTATTCTAC ACTTGTCAAT ATGGTTGACG ATGGCATCTA CCAGCTTGAC
GCCGAGGGGC GGTTCGTCGC GGTCAACGAT GCCATAGTCA GTTTGACTGG CTACGCCCGG
GAAGAGCTTC TCGGCAAGCA CGCTTCAACC GTGATTGACG ACGAGGACGT CAGTCGCATC
CAGCGCGAGA TCTACCAGCG ATTTACAGAC GGCGACCGCG AGGGCGAGCC GCTCGAGTTC
ACCGCCCGGA TGGCCGACGG CGAGACCATC CCATGCGAAT TGGAGCTACA CCTGCTCGTC
GAGGATGGGA CGTTCCAGGG AACCATCGGG GTCGTGCGCG ACATCGCGGA CCGCAAACAG
ACAGAACAGG AGTTCCGTGA GCACGAACTC GAGCTGTTCC GTACCCTGCT TAATCACTCG
AACGACAGCG TACTGGTAGT GGATCCGGAG ACGGGCCGCT ATCTCGACGT CAACGACACT
GCCTGTGAGC GGCGGGGATA CTCGCGTGAA GAATTCCTTG ATCTCACGGT CATGGATCTC
GAAACCGAGA TTCCCGACCA AGAGGCGTGG CGGTCGTTCG TTGAGGAACT ACGTGCCGAG
GGGCAACTGA CCTTCGACGG ACACCACCGC CGCAAGGACG GTACGACGTA TCCGGTTGAG
GTCAATACTT CCTACGTTGA TCTGGATAAG GAGTATGTCC TCGCCATTGC TCGCGACGTT
ACCAAGCGCC GGGAGTACGA ACGGTACCTC GAAGATGCTA AGTCACAGTT AGAGGCAGCA
ACAGAAGCCG GCGCGGTTGG GACCTGGGAA TGGCATATCC CCGAGGACGA GATGGTCGTC
GGTACGTCGT TCGCTCGGAC GTTCGGCATC GAGCCGGAGG TAGCCTCCGA GGGTGTGCCC
CTCGATCAAT TTATCGAAGC CATCCATGAG GATGACCGCA AGCGAGTCGA GGTGGCGATC
GAGGAGGCCG TCGAGACCTG TGGCGACTAC GAGGAAGAAT ACCGCGTGTG GGATGCTGAC
GACAATCTCC GGTGGGTTGT CGCTCGCGGG CACGTCGAGT GCGACGATGG CGAGCCAGTC
CGCTTCCCGG GGGCACTCAC CGACATCACC AAACGCAAAC GCGCCGAATT AAAACTCCAG
CAGAACAACG AACAGCTGGA GACCCTCTTC GAAGTCCTTC CTGTCGGTGT CGTGGTCGCG
GAGGCCGACG GGCAAATCAT CCAGGCCAAC GACATTGCAC ACGAGATCTG GGGCGGTGAT
GTCTTCGACG CTGAGTCCGT CGAGGAGTAC GAGCAGTACC CGGTCTGGGA TGCTGAGTCC
GGCGAGCGCG TCCAGCCAGA CGAGATGACG CTCGCGAGGG TCGTCAACGG CGAGGAGGTG
CTCGACCCGG ACATCTACGA GATCGAGGCA GTCGACGGCG AGCGCCGAGT CATCCGGCTG
GAGGGGATGC CCGTTCGTGA CGAACATGGC GAGGTGACCC GCGGCGTGGC TACTCTAACC
GACATCACCG ACCGCCGTGA GGCCCAACGT GCGCTTGAGG AGTCCGAGCG CCGCTACCGA
ACGCTGGTCG AAAACTTCCC GAACGGAGCA GTAGGCCTCT TCGATGATAA CCTACGATAC
ACCGCTGTTG GCGGTCAGCT CTTCGACAGA CTCGACTACG ACCCGGAGGA TCGGATTGGA
TACCGCTTTA CCGAACTCCA CACTCCGGAT CTGGTTGAGG AGTTAGAACC GCACTTTCGG
GCCGCACTCG ATGGTGAGGC AAGCACCTTC GAAATCGAAT ACCAGGGCCG GTACCTGCAC
GCCCATACAC TCCCTGTCAG AGACGCCGAC GACGAGGTGT ACGCGGGCAT GCTCGTGATC
CGAGACGTGA CTGAGCGACG GGAATACGAG CGTATGCTGG AGGAATCGAA CGACCGCCTC
GAACAGTTCG CCTATGCCGC CTCCCACGAC CTCCAGGAAC CGCTCCGAAT GGTCTCGAGC
TATCTGCAAT TGCTTGAACG GCGCTACGAC GACGATCTCG ACGAGGACGG ACAGGAATTC
CTCGAGTTTG CTGTTGACGG CGCCGACAGG ATGCGGGAGA TGATCGACGG CCTGCTCAAG
TATTCCCGTG TGGAGACGCG TGGCGATCCG TTCGAGACCG TCAGCCTCGA CGATGTCCTC
ACGGAGGTCT GTGACGACCT GCAAGTAATG ATCCAAGAGA GCAACGCTGA GATCACCACT
GAAGACCTCC CCCGCGTGGA GGGCGACCAC GGGCAGTTGC GGCAGGTATT CCAGAACCTG
CTGGACAATG CCATCGAGTA CAGCGGCGAC GACCCACCGC GGATCTACAT CGACGCCGAG
CGCGACGGCG ACCAGTGGCT CGTGTCCGTC GAGGACAACG GCGTCGGTAT CGATCCGGAG
GATACTGACC GCGTCTTCGA GGTCTTCCAG CGCCTCCACG CTCGCGGCGA ACACGCAGGC
ACTGGTATCG GTCTCGCGCT GGTTGAGCGC ATCATTGAGC GCCACGGCGG CGATGTTTGG
GTCGAGTCCG ACCCCGGCCA GGGCTCGACG TTCTCGTTTA CACTACCTGT AGCGAGTGAT
TTCGAGACGT AA
 
Protein sequence
MSDQAGTTQG GFWTDATGER TVDRYSTLVN MVDDGIYQLD AEGRFVAVND AIVSLTGYAR 
EELLGKHAST VIDDEDVSRI QREIYQRFTD GDREGEPLEF TARMADGETI PCELELHLLV
EDGTFQGTIG VVRDIADRKQ TEQEFREHEL ELFRTLLNHS NDSVLVVDPE TGRYLDVNDT
ACERRGYSRE EFLDLTVMDL ETEIPDQEAW RSFVEELRAE GQLTFDGHHR RKDGTTYPVE
VNTSYVDLDK EYVLAIARDV TKRREYERYL EDAKSQLEAA TEAGAVGTWE WHIPEDEMVV
GTSFARTFGI EPEVASEGVP LDQFIEAIHE DDRKRVEVAI EEAVETCGDY EEEYRVWDAD
DNLRWVVARG HVECDDGEPV RFPGALTDIT KRKRAELKLQ QNNEQLETLF EVLPVGVVVA
EADGQIIQAN DIAHEIWGGD VFDAESVEEY EQYPVWDAES GERVQPDEMT LARVVNGEEV
LDPDIYEIEA VDGERRVIRL EGMPVRDEHG EVTRGVATLT DITDRREAQR ALEESERRYR
TLVENFPNGA VGLFDDNLRY TAVGGQLFDR LDYDPEDRIG YRFTELHTPD LVEELEPHFR
AALDGEASTF EIEYQGRYLH AHTLPVRDAD DEVYAGMLVI RDVTERREYE RMLEESNDRL
EQFAYAASHD LQEPLRMVSS YLQLLERRYD DDLDEDGQEF LEFAVDGADR MREMIDGLLK
YSRVETRGDP FETVSLDDVL TEVCDDLQVM IQESNAEITT EDLPRVEGDH GQLRQVFQNL
LDNAIEYSGD DPPRIYIDAE RDGDQWLVSV EDNGVGIDPE DTDRVFEVFQ RLHARGEHAG
TGIGLALVER IIERHGGDVW VESDPGQGST FSFTLPVASD FET