Gene Hhal_0474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0474 
Symbol 
ID4711473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp542169 
End bp544388 
Gene Length2220 bp 
Protein Length739 aa 
Translation table11 
GC content69% 
IMG OID639854933 
ProductCheA signal transduction histidine kinases 
Protein accessionYP_001002064 
Protein GI121997277 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0643] Chemotaxis protein histidine kinase and related kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCATCG ACCTCGACGA CGACATCGTG CAGGACTTCC TGGTCGAGTC TCGGGAGATC 
CTCGACCGGC TGGGCGAGCA GCTGGTGGAC CTGGAGCAGC ACGGCGAGGA TAAGGACCTG
CTCAACTCGG TCTTCCGCGG CTTCCACACC ATCAAGGGCG GCGCCGGGTT CCTGGGGCTC
GACCCGCTGG TGGACGTCTG CCACCGCACC GAGGACGTCT TCAACCTGCT GCGTAATGGC
GACAAGCAGG TCTCCCAGGA ACTGATGGAC TCCGTCCTGC GCGCCCTCGA TGTGGTGGTG
GCGCAGTTCG GACAGATCGA GAGCGGCAGC GACCCGGAGC CCGCCCCGGC GGAGCTGCTC
CAGGAGCTCG ACCGCCTCAA GAAGCCGGAT GCCGGCGGCT CTGCGCCGCC GGCTGCCGGT
GCCGAGCCGG CGCCGGCCGC CGCACAGCCG GCAACGCCGG GGCAGGGCGG CGATGGGAGT
GACGCCGAGT TCGACGAGCT GCTGACAGCC CTCGATGAGG GGGATGAGGC CTCCGTCGAT
ACCGGCGGAG ACGAGATCTC CGACGCCGAG TTCGAGCAGC TCCTCGACGA ACTGCACGGC
AAGGGGCAGC ACCAGGGCAA GCCGGCGGCC GAGCAGCCGG CCGCCGACGC CCCCGCCGGG
TCGGACTCGG GTGGGGGCGG TGAGGAGATT ACCGAGGAGG AGTTCGAGCA GCTGCTCGAC
AGTCTCTACG GCCCCGGCGC CGCGCCGGGC AAGGATGCGC CGCCGCCGGC GGTGGACGGC
GCGCAGGAGA GTACCCCGGA GCCGGCCCCG CCCCAGCCGT CGCCGTCTAG TCCATCCGCC
CCCGAACAGC CGGCGGCCGA GTCGCCGCAG CCTGCCACGG GCCAGGGGAG CGGTGGTGCC
GAGAAGCCCG CCCGGCAACC CGCCGCCGGT GGCGGTGGCA ACAAGGGCGG CGGGGGTGGC
GGCGCCGCCG ATAAAGGTGG CGCCGCCGGT GCCGATGCCG GCGGTGGCGG CAACAAGGTC
GAGGCCTCGG TGCGGGTGGA TACCGCCAAG CTCGACGAGA TCATGAACCT GGTCGGCGAG
CTGGTGCTGG TGCGCAACCG CCTCTCCAAT CTGCGCAACG AGGTCGACGA CGACCGCCTC
GGCCAGGCGG TCAGCGATCT CGAGCTGGTC ACCTCCGACC TGCAGTCTTC GGTGATGAAG
ACGCGCATGC AGCCGATCAA GAAGGTCTTC GGGCGTTTCC CCCGAGTGAT CCGCGACCTG
GCGCGCCAGC TCGACAAGGA GGTCACCCTG GAGACCCACG GCGAGGACAC CGACCTGGAC
AAGAACATGG TCGAGGCCCT GGCCGATCCC ATGGTCCACC TGGTGCGCAA CGCCGTCGAC
CACGGCGTGG AGTACCCCGA TGAGCGCGAG GCCGCCGGCA AGCCGCGCCA GGGCACGGTG
ACCCTGGCCG CCGAGCAGGA GGGCGACCAC ATCCTGCTGA CCATCGCTGA CGACGGGCGC
GGCATGGACC CGGACAAGCT GCGCGGCCTG GCGGTGAAGA AGGGCCTGAT GGACGAGGAG
TCGGCCTCGC GGCTCGACGA CAAGGAGGCG TTCAGCCTCA TCTTCCACGC CGGTTTCTCC
AGCAAGGAGG AGATCTCCGA CGTCTCCGGA CGCGGCGTGG GCATGGACGT GGTCAAGAAC
AGCCTCTCGC GGCTCAACGG CACGGTGGAC ATCGACTCGG AACTGGGCCG CGGCACGGTC
ATGCGCATCC AGCTGCCCCT GACCCTGGCC ATCCTGCCGA CGCTGATGGT CAAGATCTCC
GGGCGCAAGT TCGCCCTGCC CATGTCGGTA GTCCGCGAGA TCTTCGAGCT CCAGGACGTG
CGTACCAACG TGGTCAACAA CCGCCTGGTG GCCATCGTCC GCGACAAGGC CATGCCACTG
TTCTTCCTCG AGCGCTGGCT CAGCCAGGCG GGCGAGCCGG TGTTGACCAC CAAACGCGAG
ATGGCCAACG GTGAGGAGTC CGAGGACGAG CGCCAGGTCA TCACGGTGAT CGTCGGCAAT
CAGGCGGTGG GCTTCGTCGT CGATGAGGTC ATCGGTCTCG AGGAGGTGGT CATCAAGCCC
CTCGGTGCGC TGCTCCACGG CCTGCCCGGG TTGGCTGGGT CCACGATTAC CGGCGACGGG
CAGATCGCCC TGATCGTCGA CATTCCGAGC CTGATCAAGG CGTACGGCAA GCGGCTGTAG
 
Protein sequence
MAIDLDDDIV QDFLVESREI LDRLGEQLVD LEQHGEDKDL LNSVFRGFHT IKGGAGFLGL 
DPLVDVCHRT EDVFNLLRNG DKQVSQELMD SVLRALDVVV AQFGQIESGS DPEPAPAELL
QELDRLKKPD AGGSAPPAAG AEPAPAAAQP ATPGQGGDGS DAEFDELLTA LDEGDEASVD
TGGDEISDAE FEQLLDELHG KGQHQGKPAA EQPAADAPAG SDSGGGGEEI TEEEFEQLLD
SLYGPGAAPG KDAPPPAVDG AQESTPEPAP PQPSPSSPSA PEQPAAESPQ PATGQGSGGA
EKPARQPAAG GGGNKGGGGG GAADKGGAAG ADAGGGGNKV EASVRVDTAK LDEIMNLVGE
LVLVRNRLSN LRNEVDDDRL GQAVSDLELV TSDLQSSVMK TRMQPIKKVF GRFPRVIRDL
ARQLDKEVTL ETHGEDTDLD KNMVEALADP MVHLVRNAVD HGVEYPDERE AAGKPRQGTV
TLAAEQEGDH ILLTIADDGR GMDPDKLRGL AVKKGLMDEE SASRLDDKEA FSLIFHAGFS
SKEEISDVSG RGVGMDVVKN SLSRLNGTVD IDSELGRGTV MRIQLPLTLA ILPTLMVKIS
GRKFALPMSV VREIFELQDV RTNVVNNRLV AIVRDKAMPL FFLERWLSQA GEPVLTTKRE
MANGEESEDE RQVITVIVGN QAVGFVVDEV IGLEEVVIKP LGALLHGLPG LAGSTITGDG
QIALIVDIPS LIKAYGKRL