Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2351 |
Symbol | |
ID | 8384650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2394201 |
End bp | 2397176 |
Gene Length | 2976 bp |
Protein Length | 991 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644973424 |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_003131250 |
Protein GI | 257053417 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.248597 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGTCG AAGGGTCTGG TGAGATCACT GTCCTGCACG TCGACGACGA CCCGGACCTC GCGGCGGTCG CCGGCGAGTA CGTGGCGCGT GAGGACGACC GGATATCCGT CGACGTTGCG ACCAGCGCCG ACGAGGGCCT CACGATGCTC GCCGACGCGT CGATCGACTG TATCGTCTCG GACTACGACA TGCCCGGCCA GGACGGCATC GAGTTCCTCG AAACAGTCCG CGAGGCACAC CCGGACCTTC CGTTTATCCT GTACACGGGG AAGGGATCAG AGGAAGTCGC CAGCGACGCG ATCGGCGCTG GCGTGACGGA TTACCTCCAG AAGGAGTCCG GCCCCGACCA CTACCGACTC CTGGCCAACC GGATCGTGAA CGCGGTCGAG CGATACCGGG CGGGCGAGCG CGTCGAGCGG GCCGAGCAGC GCTATCGATC GCTGTTCGAG GAGATGAACG AGGGGGCGGC GTTGCACGAA CTCGTCTACG AGGCGGGCGA GCCGGTCGGG TACGAGATCA TCGACGTCAA CCGGAACTTC GAGGCGATCC TCGACATTCC GGCCGAGCAG GCCATCGGCC AGCGTGCGGT CGACGTCTAC GACGTCGAGG AGCCGCCGTT CCTCGACCGG TACGCTCAGG TGGCCGAGAC GGGCGAGTCG ATGGAGTTCG AGACCTCCTT TCGGCCGCTG GAGATGCACT TCCACGTCTC GGTGTTTGCA CCGAGAGAAG GCCAGTTCGC GACCGTGTTC TCGGACGTGT CCGACCAGCG AGAGACCGAA CAGCACCTCC GCCACGAGCG GGCGTTGTAC CACGCCCAGA GCGAGGCCAC CCTCGATGGG TATCTCGTCG TCGACGAGGA GCGCCGCATC GCCTCGTACA ACTCCCGGCT GCTCGAGTTG TGGGACATCC CCGAGGATCT CATCGAGAGT CGAGACGACG AGGCGGTCCT CGATCACGTG GTCGAGAAAA CGGTCGATCC CGATGAGTTC CGCGAGGTGG TCGAGTCGCT CTACGACCAG CCAAACGCCG AGAGCAGAGA CGAGATCGAA CTGGCCGACG GGCGCTGGTT CGACCGGTAC TCGACGCCGG TCGTCGGCGA GGACGGAACC CGCTACGGCC GACTGTGGGT CTTTCGGGAC GTCACCCAAC GCAAGGAACG CGAACGCGAG CTGACCCGAC TCTCGGAACG ACTCGAACTC GCCGTCGAAG GGGCGAATCT GGGAGTCTGG GACTGGGATA TGACCACTGA CGCGGTCGAG TTCAACGAGC AGTGGGCCGA GATGCTCGGC CACTCGGTAT CGGAGATCGA ACCACACCTC GACGCCTGGG AACGGCGGGT TCACCCTGAC GACCTCCCGG CGGTCGAGGC GGCACTCGAC GCCCACATCG AGGATGAAAC ACCGCTGTAC GACACCGAAC ACCGGATGCG GACGGCCGAG GGCGACTGGA AGTGGATCAG GGACGTCGGC CGGGTCGTGG ATCGCGGCGG GGACGGGGAA CCACGTCGCG CGGTCGGTAT CCACATCGAC ATCGACGACC GGAAGCGACG CGAACGGCAA CTGCAGTTGT TCCGAAAGGC GGTCGAACAG ACCGCCCACG CCGTCTACGT TACCGACGCC GACGGGACGA TCGAGTACGT CAACCCGGCC TTCGAGGACG TGACCGGGTA TCCCGAACAA GAGGCGCTGG GCAGCGATCC ACACATCCTC CAGTCCGGGG AGTACGACGA GGACTACTAC GAGGCGTTCT GGGAAACGAT CACTGACGGC GAGCGCTGGC GCAAGGAGAT GATCGACCGA GACGCCGACG GCGAGCGGAT CGTCCTCGAA CAGTCGATCG CGCCGATCAC GGACGCGGAC GGTGACCCGG AGAAGTTCGT CGCCGTCGCC CAGGACGTCA CCGAGCGCAA GGAGGCCGAA CGCGACCTCG AGCGCGCTCG CGAGGAACTC CGGCAGGTCA TCGATCTCGT GCCGGATCTC ATCTTCGCGA AGGACCGCGA GGGACGGTAC TTGCTGGCCA ACGAGGCGAC TGCCGAGGCG TACGGGCTCT CGCCCGAGGA TGTCGAGGGC GAGCTCGAAT CGAACGTCAT CCCGGACGTG GAGGATTCCG AAGCGTTCCG CGAGGACGAC CTCGCCGTGA TCGAATCGGG CGAGCGCCAG GTGATCCCCG AAGAAGAACT GACGACCGCC GACGGCGAGA CCCGGATTCT GGAGACGACG AAGATCCCCT ACGAGGTCTC GGGCAGCGGC GAGGACGCCG TCCTCGGGTA CGGGCGGGAC ATCACGGATC TCAAAGAGTA CGAACGGGAA CTCGAGCGCC AGCGGGACAA CCTCGAAGTG CTCAATCAGG TCGTCCGCCA CGACATTCGC AACGAACTGC AACTCGTCGA GGCCTACGCG GACCTGCTCC AGCGACACGT CGATGGCGTG GAGGAAAACT ACGCAAACAG AGTTCTCAGA GCCGCCCGTG GTGCCGCCGA CATCCTCGAG ACTGCCCGAG ACGTGACGGA TATCATGCTC CAGGCCGACG CCGATCAGCA GCCGGTCGAC CTCGCGAGCA CCCTCCGGAA CGAGGTCGAG GACCTCCGAT CGCAGTACGA ACGGGTGGCC GTGACCGTCG AGGGGGCGAT TCCGGACGTT TCAGTCCGCG CCGACGACAT GCTCGCGTCC GTCTTCCGGA ACCTGCTGAC CAACGCCGTC CAGCACAACG ACAGCGAGAG CCCGGACGTG ACTGTCGCCG CCGACACCGA CGGGGAGCGC GTGACGGTTC GGATCGCCGA CAACGGGCCG GGGATCCCAG AGGAACGCCG GGAACGTATC TTCCAGCAGG GGGAAACCGC CCTCAACAGC GACGGGACGG GACTCGGGCT GTATCTCGTC GCGACGCTTG TCGAGCGCTA CGGCGGGACG GTCGCCGTCG AGGACAACGA TCCCACCGGA GCCGTGTTCG TCGTTGAACT GCCGATGGCA GCGTGA
|
Protein sequence | MPVEGSGEIT VLHVDDDPDL AAVAGEYVAR EDDRISVDVA TSADEGLTML ADASIDCIVS DYDMPGQDGI EFLETVREAH PDLPFILYTG KGSEEVASDA IGAGVTDYLQ KESGPDHYRL LANRIVNAVE RYRAGERVER AEQRYRSLFE EMNEGAALHE LVYEAGEPVG YEIIDVNRNF EAILDIPAEQ AIGQRAVDVY DVEEPPFLDR YAQVAETGES MEFETSFRPL EMHFHVSVFA PREGQFATVF SDVSDQRETE QHLRHERALY HAQSEATLDG YLVVDEERRI ASYNSRLLEL WDIPEDLIES RDDEAVLDHV VEKTVDPDEF REVVESLYDQ PNAESRDEIE LADGRWFDRY STPVVGEDGT RYGRLWVFRD VTQRKERERE LTRLSERLEL AVEGANLGVW DWDMTTDAVE FNEQWAEMLG HSVSEIEPHL DAWERRVHPD DLPAVEAALD AHIEDETPLY DTEHRMRTAE GDWKWIRDVG RVVDRGGDGE PRRAVGIHID IDDRKRRERQ LQLFRKAVEQ TAHAVYVTDA DGTIEYVNPA FEDVTGYPEQ EALGSDPHIL QSGEYDEDYY EAFWETITDG ERWRKEMIDR DADGERIVLE QSIAPITDAD GDPEKFVAVA QDVTERKEAE RDLERAREEL RQVIDLVPDL IFAKDREGRY LLANEATAEA YGLSPEDVEG ELESNVIPDV EDSEAFREDD LAVIESGERQ VIPEEELTTA DGETRILETT KIPYEVSGSG EDAVLGYGRD ITDLKEYERE LERQRDNLEV LNQVVRHDIR NELQLVEAYA DLLQRHVDGV EENYANRVLR AARGAADILE TARDVTDIML QADADQQPVD LASTLRNEVE DLRSQYERVA VTVEGAIPDV SVRADDMLAS VFRNLLTNAV QHNDSESPDV TVAADTDGER VTVRIADNGP GIPEERRERI FQQGETALNS DGTGLGLYLV ATLVERYGGT VAVEDNDPTG AVFVVELPMA A
|
| |