Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1893 |
Symbol | |
ID | 4710687 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 2081049 |
End bp | 2084006 |
Gene Length | 2958 bp |
Protein Length | 985 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639856366 |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_001003459 |
Protein GI | 121998672 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0591] Na+/proline symporter [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.426931 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCTTTG ACCTGCTCAC CCTCTTCCTG GTGGGCGTCG TCTACCTGGG GCTGCTCTTC GTCATCGCCT TCGCCGTGGA CCGGCGCTGG TTCCCGCGCA GCGTGGTCCG CCATCCGGTC TTCTATGTCT TCGCTCTGGG GGTCTACGCC ACCACCTGGA GCTACTACGG CTCGGTAGGC TTCGCCGACG AAAACGGCCT GATCTTTGCC ACCATCTACC TGGGCGTCAC CGCGGCCTTC CTGCTCACGC CGGTGCTCCT CGCCCCCCTG CTGCGCCTGA CCAGCAACCA ACAGCTGACC TCGGTGGCCG ACGTCTTCGC CTTCCGCTTC TCCAGCCAGT TCGCCGGCAT TCTGGTCACC GCCATCATGC TCGCCGGCAT CCTGCCGTAC ATCGCCCTGC AGATCCGCGC CGTGGCCGAG TCGACCCTGG TCCTGACCGG CGGGGGCCAC GATCCTCGGC CGCTGGCCCT GGCCTTCTGC CTGATCGTGA CGGTGTTCGC CATCATCTTC GGTGCCCGGC ACGTCACCCC GCGGGTCAAG CACACCGGGC TGGTCGTCGC TATCGGCCTG GAGAGCCTGG TCAAACTGGG TGCCCTGCTG ATGGTGGCCT GGGTCGCCGT CGACCTAGCC TTCGACGGCC CGGCGGGGCT GTCGGCGTGG CTCAGCGAGA ACCCGCAACA GCTGGACGAC TTCTACCAGC CGGCGCTGGA GGGCCCGTGG CTCAGCCTGC TGACCCTGAC CTTCGCCGCC GCCTTCCTGC TGCCACGCCA GTTCCACATC CTGTTCACCG AGAACCTCTC GCCGCGCAGC CTCACCCGGG CCAGCTGGGC CTTCCCGCTG ATCCTCCTGC TGCTGGCCCT GGCCGTCCCG CCGATCCTCT GGTCCGGCAC GGCGCTGAAC ACGGACACCG CCCCGGAGTA CTACGTGCTG GGACTGGCGG CCCTGTCCGG CTCGGAGCCG CTGACCCTGC TGGTCTATCT GGGCGGCATC TCGGCGGCGA CAGCGATGAT TGTCGTCTCG ACCCTGGCGC TGTCGTCGAT GACGCTCAAC CACCTGGTGA TGCCGCTGAT CAAGTCGCTG CCGCACCAGG AGCCGGACCT TTACGCCACG CTGCGCTGGA CCCGGCGATT CCTGATCACC ATGCTCATCG TCTTCGGGTT CACCTTCTAC GAGCTGCTCG ACCGCACCGA GGGCCTGGTT GAGTGGGGGC TGATGTCCTT CCTGGCCATG GCCCAGTTTC TGCCCGGGAT CATCGGCGTC CTCTACTGGC AGCGCGCCAC CGTCTACGGC TTTGTGGCGG GCCTGCTCGG CGGCTCGCTG ATCTGGCTGG ACACCATCCT GCTGCCGGCG CTGGCCGGCA CCGAGCCGTT CTTCCTGCTC GGCTTCCCCC AGGCCCGGGA GAGCGCGACC GAAATCTACG GCATCGCCAC CTTCTGGTCC CTGGCCCTGA ACGGCCTGCT GTTCGTCGGG GTCTCGCTGT TCACCCCCCA GCGCTCCGCG GAGCGGCAGG CCGCCGAGGT CTGTCGGGAT CAGACGGCAG CCATCGCGCC GGGGACGCTG CAGGCCGCCT CGCCGTCGCA GTTCGTGATC CAGCTGGCGC CGGTCACCGG TGAAGAGGCG GCCCGCCAGG AGGTGGACAA GGCCCTGCAC GACCTGGGGC TGGCCTGGAG CGAGAACCGT CCCGACCAGC TCCGCGCCCT GCGCGATCAG GTCGAGCGCA ATCTCTCCGG CATGATGGGA CCGATGCTCG CGCGCATGAT CGTCGACAAG CGGCTGCAGC TCGACCGCAC GGCCCGTACC GCGCTGGCCC ACAACATCCG CCAGATCGAG GAGCGCCTGG AGTCCTCGCG CAACCGCTTC CGGGGCCTCG CCGCGGAACT CGACCGGCTG CGCCGCTACC ACCGCCAGAT CCTCGAGGAC CTCCCCCTGG GCGTCGTGGC GGTGACCGCC CACGGCCGCG TGGTGCGCTG GAACACCGCC ATGCAGGGCC TGTCCGGGAT CTCGGCGAAC ACCGCGCTGG GCTCGCGGAT CTGTGACCTC CATCCCCCGT GGGATCAGCT GCTGAACCGG TTCCTGACCA TCGAGCAACC GCAGCACCAG GAGCAGTTCC CGCAGCCCGG CGGCGAGCAG CGCTGGCTGT CGCTGCACAA GACGTGCATC CGCGAGTCCG GCCGCGGCCG GACCGGCGAC ACCCTGATGA TCATCGAGGA CATCACCCAC ATCCGCCGCC TGGAGCGCGA GCTGGCCCAC AGCGAACGAC TGGCCTCCAT CGGCCGCCTG GCTGCGGGCG TGGCCCACGA GATCGGCAAC CCGGTCACCG GCATCGACTC CCTGGCGCAG AACCTGCGCC ACGAGTCCGA TCCACAGCTC CTGCGCGAGA GCGTCGACGA GATCCTGGAA CAGACGCGGC GGATCAATAA CATCGTGCAG ACGCTGATCG GCTACGCCCA CGCCGGCAAC ACCGACGAGC GCACGCCGGA CCCGGTGCCG CTCAGCGAGA CCGTCGAGGA GGCGCGCCGG CTGGTCCAGT TAAGCAAGCG TGGTCAGGAC CTGGAGATCG ACAACCAGCT CTGGCCCGAA CTCCAGGTCC GCGGTGATCG GCAACGGCTG GCCCAGGTCT TCGTCAACCT CTTCTCGAAC ACCGCCGATG CCTGCGGCCC CGGCGGACGG ATCGCGATCT CGGCCCGCCG CTACGGCGAT CGCGTGCAGA TCCGGGTGGT GGACAACGGC CCGGGCATCC CGGCCGAGCT GGTCGACAAG GTGATGGAGC CGTTCTACAC CACCAAGCCG GTGGGCCAGG GCACCGGACT CGGGCTGCCG CTGGTCTACA ACATCGTCAG TGAACAGGGG GGCGAGTTTT CGATCACGGC CGACAACGGG GGCACCACAG CCTGGATCAC CCTGCCGGTA CTCGACCCCG CGCCGCCGAC CGAGCGCGCC GAGAGCAAGG AGGAGTAA
|
Protein sequence | MTFDLLTLFL VGVVYLGLLF VIAFAVDRRW FPRSVVRHPV FYVFALGVYA TTWSYYGSVG FADENGLIFA TIYLGVTAAF LLTPVLLAPL LRLTSNQQLT SVADVFAFRF SSQFAGILVT AIMLAGILPY IALQIRAVAE STLVLTGGGH DPRPLALAFC LIVTVFAIIF GARHVTPRVK HTGLVVAIGL ESLVKLGALL MVAWVAVDLA FDGPAGLSAW LSENPQQLDD FYQPALEGPW LSLLTLTFAA AFLLPRQFHI LFTENLSPRS LTRASWAFPL ILLLLALAVP PILWSGTALN TDTAPEYYVL GLAALSGSEP LTLLVYLGGI SAATAMIVVS TLALSSMTLN HLVMPLIKSL PHQEPDLYAT LRWTRRFLIT MLIVFGFTFY ELLDRTEGLV EWGLMSFLAM AQFLPGIIGV LYWQRATVYG FVAGLLGGSL IWLDTILLPA LAGTEPFFLL GFPQARESAT EIYGIATFWS LALNGLLFVG VSLFTPQRSA ERQAAEVCRD QTAAIAPGTL QAASPSQFVI QLAPVTGEEA ARQEVDKALH DLGLAWSENR PDQLRALRDQ VERNLSGMMG PMLARMIVDK RLQLDRTART ALAHNIRQIE ERLESSRNRF RGLAAELDRL RRYHRQILED LPLGVVAVTA HGRVVRWNTA MQGLSGISAN TALGSRICDL HPPWDQLLNR FLTIEQPQHQ EQFPQPGGEQ RWLSLHKTCI RESGRGRTGD TLMIIEDITH IRRLERELAH SERLASIGRL AAGVAHEIGN PVTGIDSLAQ NLRHESDPQL LRESVDEILE QTRRINNIVQ TLIGYAHAGN TDERTPDPVP LSETVEEARR LVQLSKRGQD LEIDNQLWPE LQVRGDRQRL AQVFVNLFSN TADACGPGGR IAISARRYGD RVQIRVVDNG PGIPAELVDK VMEPFYTTKP VGQGTGLGLP LVYNIVSEQG GEFSITADNG GTTAWITLPV LDPAPPTERA ESKEE
|
| |