Gene Hhal_0336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0336 
Symbol 
ID4711296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp386904 
End bp388877 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content69% 
IMG OID639854799 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_001001932 
Protein GI121997145 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAGTC GCGACGTCCG GGGTGCGAAG CAGGGAGTGG GCGGGCACGA GATCGCGCCC 
GAGGAATTGC TGGAGATGCT CGAGGGGGCG CCGGAGCTGA TCGGCGTCGC CGATCTGTGC
TGCGGCCATG AGTTGTACCG CAACCCGGCC ATGCGCCGGG TCATGGGCGA GCCGCGCCCG
GATATCCCCC TGGAGCAGCG CATTGCGCAA TCCCACCCGG AGTGGGCCGT GCGGCGGATC
CTCGACGAGG CGATCCCGGC CGCGCGGCGC GACGGGGTCT GGCAGGGCGA GACGACGGTG
CGCGCTGTCG ACGGGACGGA GTTTGCCGCG TTGCAGACGA TCATCGTCCA CGCCGATGAA
CACGGCGAGG TACAACGCTG CTCAACCATC ATCCGTGACG TCACCGAGCA GAAGGCGACC
GAGCACGCGC TGCGCAAGAG CGAGGAGCGC TACCGCCGGT TCCTCGAAGA TTTTCTCGGT
GTGGCCTACC AGATCGATCC AGACGATTAC CGGGTCGTCC TGCTGAGAGG GGCGGTGGAA
GCCATTACTG GCTACGACCG CGCCTACCTG CTCGATCACG GGTACTGCTG GGGGGATCTG
ATCCACGCCG ATGACTGGGA GCGGGTCGCT GCCGAAGACG CCGCGATCCG GCGCCGGGGT
GAGCGGGTGG GCGATCTGCG CTACCGCATC CGCCACCGCG ACGGATCCAC CCGCTGGGTG
CGGGATATCT GGCAGATGGT GGAGCTACCA GCCAATGGCC GGCAGGTCTT CCAGGGGGCG
CTCTACGACA TCACCGAGCG CATGGAACTC GAGCAGGCCC GCGCCCAGGC CGATCGGGAT
CGCACCACGT TCCTGGCGGC GGTCAGTCAC GACCTGCGCA CGCCGCTCAA TGCCGTGGTC
GGCTTCACGG GGTTGTTGCA GGACACCGAG CTGGATGCCG AGCAGCGCCG TTACGTGGAG
CTTTGCCGGA CTGCCAGTCA GACCCTGCTC GGCCTCGTCG ATACCCTGCT CAGCCTCTCC
CGGCTGGAGG CCGGGGAGAT GCAGCTGGAG TGTGCGCCCT TCGACCTGCG GGCCTTCCTG
GAAGACGAGG TCGGGCTGCT GGAGGCCCTG GCTGCCGAGA AGGGACTGAG CCTGAGCCTG
CACATCGACC CGGCGGTTCC CGGGTGGGTG GAGGGGGATG CGGTGCGCTT CGGGCAGGTG
CTCTACAACC TGGCGAACAA CGCCATCAAG TTCACCGAAC AGGGCAGCGT GGTCATCACC
GTGGAGCCGG CGGACGGCGA CGGGCTGTTG CGGGTCGCGG TCCGCGATAC CGGGCCGGGG
ATCTCCGAGG CGGATCAAAA GCGCATCTTC CGCGCCTTCA CCCAGGGCGG TGATGTCTTC
CGCCGGCAGG AGGGCAGCGG GCTGGGTTTG ACCATCTGCA AGGAGCTGGT GCGGCTGATG
GGCGGTGATC TTGGTCTGAA GAGCGCCCCC GGGGCCGGCT CCACCTTCCA GTTTACCGCC
CGGTTGCCGG CGGTGGAGCC TTCAGCGGCG GATGGCGCCG AAGCCACGGT CGCCGTGGAC
CGGTCGCACC CGCAGGCGGA CCGGCCGCTG CGCGTGCTGG TCGCCGAGGA CGAGCCGACC
AACGCCCTGC TGATCCGCAC CCTCCTGGAA CAGCGGGGCT GCCAGGTGAC GGTGGTCGAG
GACGGCCAGG CGGCGATCGA TCACTGTGCA GCCGAGCGCC CCGATCTGGT GCTGCTGGAT
GTGCAGATGC CCAGCGTGGA CGGCTGTCAG GCGCTGGGCC GGATCCGCCA GCACGAGTCG
CAGATCGGCG CGGCCCGCAC ACCGGTGGTG ATGTGCACCG CCCACGCCGT CGAGCAGATC
TGGGCGGACT GCGCCCGGCA GGGCAGCGAC TACATCCTGA CCAAGCCGAT CGATCGCGAT
GAGCTGTCCC GTGTGCTGGC CTGGGTGCGG CAGCCCGCCG TAGACGGACA CTGA
 
Protein sequence
MASRDVRGAK QGVGGHEIAP EELLEMLEGA PELIGVADLC CGHELYRNPA MRRVMGEPRP 
DIPLEQRIAQ SHPEWAVRRI LDEAIPAARR DGVWQGETTV RAVDGTEFAA LQTIIVHADE
HGEVQRCSTI IRDVTEQKAT EHALRKSEER YRRFLEDFLG VAYQIDPDDY RVVLLRGAVE
AITGYDRAYL LDHGYCWGDL IHADDWERVA AEDAAIRRRG ERVGDLRYRI RHRDGSTRWV
RDIWQMVELP ANGRQVFQGA LYDITERMEL EQARAQADRD RTTFLAAVSH DLRTPLNAVV
GFTGLLQDTE LDAEQRRYVE LCRTASQTLL GLVDTLLSLS RLEAGEMQLE CAPFDLRAFL
EDEVGLLEAL AAEKGLSLSL HIDPAVPGWV EGDAVRFGQV LYNLANNAIK FTEQGSVVIT
VEPADGDGLL RVAVRDTGPG ISEADQKRIF RAFTQGGDVF RRQEGSGLGL TICKELVRLM
GGDLGLKSAP GAGSTFQFTA RLPAVEPSAA DGAEATVAVD RSHPQADRPL RVLVAEDEPT
NALLIRTLLE QRGCQVTVVE DGQAAIDHCA AERPDLVLLD VQMPSVDGCQ ALGRIRQHES
QIGAARTPVV MCTAHAVEQI WADCARQGSD YILTKPIDRD ELSRVLAWVR QPAVDGH