Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4049 |
Symbol | |
ID | 8744677 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | + |
Start bp | 302969 |
End bp | 305620 |
Gene Length | 2652 bp |
Protein Length | 883 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 646514615 |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_003405562 |
Protein GI | 284167284 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGATC AAGCCGGGAC TACACAGGGG GGATTTTGGA CGGACGCTAC CGGCGAGCGC ACAGTCGATC GGTATTCTAC ACTTGTCAAT ATGGTTGACG ATGGCATCTA CCAGCTTGAC GCCGAGGGGC GGTTCGTCGC GGTCAACGAT GCCATAGTCA GTTTGACTGG CTACGCCCGG GAAGAGCTTC TCGGCAAGCA CGCTTCAACC GTGATTGACG ACGAGGACGT CAGTCGCATC CAGCGCGAGA TCTACCAGCG ATTTACAGAC GGCGACCGCG AGGGCGAGCC GCTCGAGTTC ACCGCCCGGA TGGCCGACGG CGAGACCATC CCATGCGAAT TGGAGCTACA CCTGCTCGTC GAGGATGGGA CGTTCCAGGG AACCATCGGG GTCGTGCGCG ACATCGCGGA CCGCAAACAG ACAGAACAGG AGTTCCGTGA GCACGAACTC GAGCTGTTCC GTACCCTGCT TAATCACTCG AACGACAGCG TACTGGTAGT GGATCCGGAG ACGGGCCGCT ATCTCGACGT CAACGACACT GCCTGTGAGC GGCGGGGATA CTCGCGTGAA GAATTCCTTG ATCTCACGGT CATGGATCTC GAAACCGAGA TTCCCGACCA AGAGGCGTGG CGGTCGTTCG TTGAGGAACT ACGTGCCGAG GGGCAACTGA CCTTCGACGG ACACCACCGC CGCAAGGACG GTACGACGTA TCCGGTTGAG GTCAATACTT CCTACGTTGA TCTGGATAAG GAGTATGTCC TCGCCATTGC TCGCGACGTT ACCAAGCGCC GGGAGTACGA ACGGTACCTC GAAGATGCTA AGTCACAGTT AGAGGCAGCA ACAGAAGCCG GCGCGGTTGG GACCTGGGAA TGGCATATCC CCGAGGACGA GATGGTCGTC GGTACGTCGT TCGCTCGGAC GTTCGGCATC GAGCCGGAGG TAGCCTCCGA GGGTGTGCCC CTCGATCAAT TTATCGAAGC CATCCATGAG GATGACCGCA AGCGAGTCGA GGTGGCGATC GAGGAGGCCG TCGAGACCTG TGGCGACTAC GAGGAAGAAT ACCGCGTGTG GGATGCTGAC GACAATCTCC GGTGGGTTGT CGCTCGCGGG CACGTCGAGT GCGACGATGG CGAGCCAGTC CGCTTCCCGG GGGCACTCAC CGACATCACC AAACGCAAAC GCGCCGAATT AAAACTCCAG CAGAACAACG AACAGCTGGA GACCCTCTTC GAAGTCCTTC CTGTCGGTGT CGTGGTCGCG GAGGCCGACG GGCAAATCAT CCAGGCCAAC GACATTGCAC ACGAGATCTG GGGCGGTGAT GTCTTCGACG CTGAGTCCGT CGAGGAGTAC GAGCAGTACC CGGTCTGGGA TGCTGAGTCC GGCGAGCGCG TCCAGCCAGA CGAGATGACG CTCGCGAGGG TCGTCAACGG CGAGGAGGTG CTCGACCCGG ACATCTACGA GATCGAGGCA GTCGACGGCG AGCGCCGAGT CATCCGGCTG GAGGGGATGC CCGTTCGTGA CGAACATGGC GAGGTGACCC GCGGCGTGGC TACTCTAACC GACATCACCG ACCGCCGTGA GGCCCAACGT GCGCTTGAGG AGTCCGAGCG CCGCTACCGA ACGCTGGTCG AAAACTTCCC GAACGGAGCA GTAGGCCTCT TCGATGATAA CCTACGATAC ACCGCTGTTG GCGGTCAGCT CTTCGACAGA CTCGACTACG ACCCGGAGGA TCGGATTGGA TACCGCTTTA CCGAACTCCA CACTCCGGAT CTGGTTGAGG AGTTAGAACC GCACTTTCGG GCCGCACTCG ATGGTGAGGC AAGCACCTTC GAAATCGAAT ACCAGGGCCG GTACCTGCAC GCCCATACAC TCCCTGTCAG AGACGCCGAC GACGAGGTGT ACGCGGGCAT GCTCGTGATC CGAGACGTGA CTGAGCGACG GGAATACGAG CGTATGCTGG AGGAATCGAA CGACCGCCTC GAACAGTTCG CCTATGCCGC CTCCCACGAC CTCCAGGAAC CGCTCCGAAT GGTCTCGAGC TATCTGCAAT TGCTTGAACG GCGCTACGAC GACGATCTCG ACGAGGACGG ACAGGAATTC CTCGAGTTTG CTGTTGACGG CGCCGACAGG ATGCGGGAGA TGATCGACGG CCTGCTCAAG TATTCCCGTG TGGAGACGCG TGGCGATCCG TTCGAGACCG TCAGCCTCGA CGATGTCCTC ACGGAGGTCT GTGACGACCT GCAAGTAATG ATCCAAGAGA GCAACGCTGA GATCACCACT GAAGACCTCC CCCGCGTGGA GGGCGACCAC GGGCAGTTGC GGCAGGTATT CCAGAACCTG CTGGACAATG CCATCGAGTA CAGCGGCGAC GACCCACCGC GGATCTACAT CGACGCCGAG CGCGACGGCG ACCAGTGGCT CGTGTCCGTC GAGGACAACG GCGTCGGTAT CGATCCGGAG GATACTGACC GCGTCTTCGA GGTCTTCCAG CGCCTCCACG CTCGCGGCGA ACACGCAGGC ACTGGTATCG GTCTCGCGCT GGTTGAGCGC ATCATTGAGC GCCACGGCGG CGATGTTTGG GTCGAGTCCG ACCCCGGCCA GGGCTCGACG TTCTCGTTTA CACTACCTGT AGCGAGTGAT TTCGAGACGT AA
|
Protein sequence | MSDQAGTTQG GFWTDATGER TVDRYSTLVN MVDDGIYQLD AEGRFVAVND AIVSLTGYAR EELLGKHAST VIDDEDVSRI QREIYQRFTD GDREGEPLEF TARMADGETI PCELELHLLV EDGTFQGTIG VVRDIADRKQ TEQEFREHEL ELFRTLLNHS NDSVLVVDPE TGRYLDVNDT ACERRGYSRE EFLDLTVMDL ETEIPDQEAW RSFVEELRAE GQLTFDGHHR RKDGTTYPVE VNTSYVDLDK EYVLAIARDV TKRREYERYL EDAKSQLEAA TEAGAVGTWE WHIPEDEMVV GTSFARTFGI EPEVASEGVP LDQFIEAIHE DDRKRVEVAI EEAVETCGDY EEEYRVWDAD DNLRWVVARG HVECDDGEPV RFPGALTDIT KRKRAELKLQ QNNEQLETLF EVLPVGVVVA EADGQIIQAN DIAHEIWGGD VFDAESVEEY EQYPVWDAES GERVQPDEMT LARVVNGEEV LDPDIYEIEA VDGERRVIRL EGMPVRDEHG EVTRGVATLT DITDRREAQR ALEESERRYR TLVENFPNGA VGLFDDNLRY TAVGGQLFDR LDYDPEDRIG YRFTELHTPD LVEELEPHFR AALDGEASTF EIEYQGRYLH AHTLPVRDAD DEVYAGMLVI RDVTERREYE RMLEESNDRL EQFAYAASHD LQEPLRMVSS YLQLLERRYD DDLDEDGQEF LEFAVDGADR MREMIDGLLK YSRVETRGDP FETVSLDDVL TEVCDDLQVM IQESNAEITT EDLPRVEGDH GQLRQVFQNL LDNAIEYSGD DPPRIYIDAE RDGDQWLVSV EDNGVGIDPE DTDRVFEVFQ RLHARGEHAG TGIGLALVER IIERHGGDVW VESDPGQGST FSFTLPVASD FET
|
| |