Gene Hhal_1893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1893 
Symbol 
ID4710687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2081049 
End bp2084006 
Gene Length2958 bp 
Protein Length985 aa 
Translation table11 
GC content69% 
IMG OID639856366 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001003459 
Protein GI121998672 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0591] Na+/proline symporter
[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.426931 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCTTTG ACCTGCTCAC CCTCTTCCTG GTGGGCGTCG TCTACCTGGG GCTGCTCTTC 
GTCATCGCCT TCGCCGTGGA CCGGCGCTGG TTCCCGCGCA GCGTGGTCCG CCATCCGGTC
TTCTATGTCT TCGCTCTGGG GGTCTACGCC ACCACCTGGA GCTACTACGG CTCGGTAGGC
TTCGCCGACG AAAACGGCCT GATCTTTGCC ACCATCTACC TGGGCGTCAC CGCGGCCTTC
CTGCTCACGC CGGTGCTCCT CGCCCCCCTG CTGCGCCTGA CCAGCAACCA ACAGCTGACC
TCGGTGGCCG ACGTCTTCGC CTTCCGCTTC TCCAGCCAGT TCGCCGGCAT TCTGGTCACC
GCCATCATGC TCGCCGGCAT CCTGCCGTAC ATCGCCCTGC AGATCCGCGC CGTGGCCGAG
TCGACCCTGG TCCTGACCGG CGGGGGCCAC GATCCTCGGC CGCTGGCCCT GGCCTTCTGC
CTGATCGTGA CGGTGTTCGC CATCATCTTC GGTGCCCGGC ACGTCACCCC GCGGGTCAAG
CACACCGGGC TGGTCGTCGC TATCGGCCTG GAGAGCCTGG TCAAACTGGG TGCCCTGCTG
ATGGTGGCCT GGGTCGCCGT CGACCTAGCC TTCGACGGCC CGGCGGGGCT GTCGGCGTGG
CTCAGCGAGA ACCCGCAACA GCTGGACGAC TTCTACCAGC CGGCGCTGGA GGGCCCGTGG
CTCAGCCTGC TGACCCTGAC CTTCGCCGCC GCCTTCCTGC TGCCACGCCA GTTCCACATC
CTGTTCACCG AGAACCTCTC GCCGCGCAGC CTCACCCGGG CCAGCTGGGC CTTCCCGCTG
ATCCTCCTGC TGCTGGCCCT GGCCGTCCCG CCGATCCTCT GGTCCGGCAC GGCGCTGAAC
ACGGACACCG CCCCGGAGTA CTACGTGCTG GGACTGGCGG CCCTGTCCGG CTCGGAGCCG
CTGACCCTGC TGGTCTATCT GGGCGGCATC TCGGCGGCGA CAGCGATGAT TGTCGTCTCG
ACCCTGGCGC TGTCGTCGAT GACGCTCAAC CACCTGGTGA TGCCGCTGAT CAAGTCGCTG
CCGCACCAGG AGCCGGACCT TTACGCCACG CTGCGCTGGA CCCGGCGATT CCTGATCACC
ATGCTCATCG TCTTCGGGTT CACCTTCTAC GAGCTGCTCG ACCGCACCGA GGGCCTGGTT
GAGTGGGGGC TGATGTCCTT CCTGGCCATG GCCCAGTTTC TGCCCGGGAT CATCGGCGTC
CTCTACTGGC AGCGCGCCAC CGTCTACGGC TTTGTGGCGG GCCTGCTCGG CGGCTCGCTG
ATCTGGCTGG ACACCATCCT GCTGCCGGCG CTGGCCGGCA CCGAGCCGTT CTTCCTGCTC
GGCTTCCCCC AGGCCCGGGA GAGCGCGACC GAAATCTACG GCATCGCCAC CTTCTGGTCC
CTGGCCCTGA ACGGCCTGCT GTTCGTCGGG GTCTCGCTGT TCACCCCCCA GCGCTCCGCG
GAGCGGCAGG CCGCCGAGGT CTGTCGGGAT CAGACGGCAG CCATCGCGCC GGGGACGCTG
CAGGCCGCCT CGCCGTCGCA GTTCGTGATC CAGCTGGCGC CGGTCACCGG TGAAGAGGCG
GCCCGCCAGG AGGTGGACAA GGCCCTGCAC GACCTGGGGC TGGCCTGGAG CGAGAACCGT
CCCGACCAGC TCCGCGCCCT GCGCGATCAG GTCGAGCGCA ATCTCTCCGG CATGATGGGA
CCGATGCTCG CGCGCATGAT CGTCGACAAG CGGCTGCAGC TCGACCGCAC GGCCCGTACC
GCGCTGGCCC ACAACATCCG CCAGATCGAG GAGCGCCTGG AGTCCTCGCG CAACCGCTTC
CGGGGCCTCG CCGCGGAACT CGACCGGCTG CGCCGCTACC ACCGCCAGAT CCTCGAGGAC
CTCCCCCTGG GCGTCGTGGC GGTGACCGCC CACGGCCGCG TGGTGCGCTG GAACACCGCC
ATGCAGGGCC TGTCCGGGAT CTCGGCGAAC ACCGCGCTGG GCTCGCGGAT CTGTGACCTC
CATCCCCCGT GGGATCAGCT GCTGAACCGG TTCCTGACCA TCGAGCAACC GCAGCACCAG
GAGCAGTTCC CGCAGCCCGG CGGCGAGCAG CGCTGGCTGT CGCTGCACAA GACGTGCATC
CGCGAGTCCG GCCGCGGCCG GACCGGCGAC ACCCTGATGA TCATCGAGGA CATCACCCAC
ATCCGCCGCC TGGAGCGCGA GCTGGCCCAC AGCGAACGAC TGGCCTCCAT CGGCCGCCTG
GCTGCGGGCG TGGCCCACGA GATCGGCAAC CCGGTCACCG GCATCGACTC CCTGGCGCAG
AACCTGCGCC ACGAGTCCGA TCCACAGCTC CTGCGCGAGA GCGTCGACGA GATCCTGGAA
CAGACGCGGC GGATCAATAA CATCGTGCAG ACGCTGATCG GCTACGCCCA CGCCGGCAAC
ACCGACGAGC GCACGCCGGA CCCGGTGCCG CTCAGCGAGA CCGTCGAGGA GGCGCGCCGG
CTGGTCCAGT TAAGCAAGCG TGGTCAGGAC CTGGAGATCG ACAACCAGCT CTGGCCCGAA
CTCCAGGTCC GCGGTGATCG GCAACGGCTG GCCCAGGTCT TCGTCAACCT CTTCTCGAAC
ACCGCCGATG CCTGCGGCCC CGGCGGACGG ATCGCGATCT CGGCCCGCCG CTACGGCGAT
CGCGTGCAGA TCCGGGTGGT GGACAACGGC CCGGGCATCC CGGCCGAGCT GGTCGACAAG
GTGATGGAGC CGTTCTACAC CACCAAGCCG GTGGGCCAGG GCACCGGACT CGGGCTGCCG
CTGGTCTACA ACATCGTCAG TGAACAGGGG GGCGAGTTTT CGATCACGGC CGACAACGGG
GGCACCACAG CCTGGATCAC CCTGCCGGTA CTCGACCCCG CGCCGCCGAC CGAGCGCGCC
GAGAGCAAGG AGGAGTAA
 
Protein sequence
MTFDLLTLFL VGVVYLGLLF VIAFAVDRRW FPRSVVRHPV FYVFALGVYA TTWSYYGSVG 
FADENGLIFA TIYLGVTAAF LLTPVLLAPL LRLTSNQQLT SVADVFAFRF SSQFAGILVT
AIMLAGILPY IALQIRAVAE STLVLTGGGH DPRPLALAFC LIVTVFAIIF GARHVTPRVK
HTGLVVAIGL ESLVKLGALL MVAWVAVDLA FDGPAGLSAW LSENPQQLDD FYQPALEGPW
LSLLTLTFAA AFLLPRQFHI LFTENLSPRS LTRASWAFPL ILLLLALAVP PILWSGTALN
TDTAPEYYVL GLAALSGSEP LTLLVYLGGI SAATAMIVVS TLALSSMTLN HLVMPLIKSL
PHQEPDLYAT LRWTRRFLIT MLIVFGFTFY ELLDRTEGLV EWGLMSFLAM AQFLPGIIGV
LYWQRATVYG FVAGLLGGSL IWLDTILLPA LAGTEPFFLL GFPQARESAT EIYGIATFWS
LALNGLLFVG VSLFTPQRSA ERQAAEVCRD QTAAIAPGTL QAASPSQFVI QLAPVTGEEA
ARQEVDKALH DLGLAWSENR PDQLRALRDQ VERNLSGMMG PMLARMIVDK RLQLDRTART
ALAHNIRQIE ERLESSRNRF RGLAAELDRL RRYHRQILED LPLGVVAVTA HGRVVRWNTA
MQGLSGISAN TALGSRICDL HPPWDQLLNR FLTIEQPQHQ EQFPQPGGEQ RWLSLHKTCI
RESGRGRTGD TLMIIEDITH IRRLERELAH SERLASIGRL AAGVAHEIGN PVTGIDSLAQ
NLRHESDPQL LRESVDEILE QTRRINNIVQ TLIGYAHAGN TDERTPDPVP LSETVEEARR
LVQLSKRGQD LEIDNQLWPE LQVRGDRQRL AQVFVNLFSN TADACGPGGR IAISARRYGD
RVQIRVVDNG PGIPAELVDK VMEPFYTTKP VGQGTGLGLP LVYNIVSEQG GEFSITADNG
GTTAWITLPV LDPAPPTERA ESKEE