Gene Hhal_2151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2151 
Symbol 
ID4709706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2357301 
End bp2360192 
Gene Length2892 bp 
Protein Length963 aa 
Translation table11 
GC content66% 
IMG OID639856626 
ProductHpt sensor hybrid histidine kinase 
Protein accessionYP_001003717 
Protein GI121998930 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.614764 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATAACC ACGCATCGCA TCCCTCCGGG CGTCTGAGTC GCCGATGGCT GCGGGGTCGG 
CCACACCAGA GAGTCCGCCA TCATCTGCAG TGGCCGGTCA TCCCGGCGGG GCGTTGGCTC
GGTCGCGCTG CGGTCCTGCT TGGCGGTGTG CTGTTCGTTG TGTCGGCGCA CGCCGAGCGA
AGCGTGACGC TGGGCATCCT CTCCTTCGAG TCTCGGGAGG TGGCCGAGGA GCGCTGGCAA
CCGCTGGTCG AGTACCTGGA CGGTGAAATC GACGACGTCC AGATCGAGGG CGCCATCGCT
GGTTACGCAG AGCTGGACGG ACTCCTGGAG AACGAGCAAC TCGACTTCCT GCTCACCAAC
CCGATGCACT ACGTGCGACT GCGCGACCAG TACGCCTTGT CCGGTGCGCT GGCGACGCTG
GTCCCCGAGC GACAGGGCGA GCGGTTACGC AGTTTCGGCG GCGTGGCCTT CACCCGGGCC
GAGCACCCTG AGGTGTCGGG GTGGGGCGAT GTGCCCGACC ACACCGTAGC GGCCGTGCAC
GAGGACTCCC TGGGCGGGTA CCAGGCCCAG GCCATGGAGT TGAAGCGGCG TGATCTCTCC
ATGCCGACGG GGGACGCAAT CCACTTCACC GGTATGCCCC ACCGCCAGGT CGTCGATCGC
GTCATGGGTG GAGAAGCCGA TATCGGCTTT GTCCGGATTA GCGTTCTGGA GCGCCTCTGG
GAGGAAGGCG AACTGGCGCG GGACGCGGTG CGCATCATCG GTGAGCAGGC GTATGCCGGC
TTCCCCTTCG TGGCTTCAAC CGCGTTGTAC CCCGAGTGGC CCCTCGTTCA CCTGGAAGAG
GCGGAGGGTC AGCAAAGCAG CATAGGCCGG GAGATGACCA CGGCTCTGCT TCGACTCGGG
CCCGACCATC CTGCGTCCCG GGCTGCCGGA ATCGCCGGCT TCTCGGTGCC CATGGATTAC
GAGCCGGTGG AGGAACTGGC GCGCGCGCTG CGCCTGCCGC CCTACGATGC GCCTCGGGCC
CTGAGCCGGG ACGAGCTATG GGATCTCTAC CGCACCCCCC TGGTTGCCGT CAGTACAGCA
CTGCTGGTCT TTGCGGCCAT GGCGTTCGGG CTGATGGTCA CCAACCGCCG GCTCAAACGG
GCTCGGCAGC AGGCCGAGCA GGCCACGCGC GCCAAGAGTG AATTCCTGGC CAACATGAGC
CACGAGATCC GCACACCCAT GAACGCCGTG GTGGGCCTGA GCCAGCTGCT GCTCGATACC
GAACTGAACG AGCGCCAGCT CGATTACCTG AAAAAGATCC AGCGCTCGTC GCGGATGCTG
CTCGGCATCA TCAACGACAT CCTCGACTTC TCCAGAATCG AGTCGGGGCA GCTGGAACTC
GACACAGGCC CCTTCGATCT CGGCGAAGTG ATCGACCACC TCGCTACCCT CTTCTCTGAA
GCGGCTCGGG AGAAAGAGAT CGAGCTGATC TACGCCATTC AGCCCGAGCT GCCGCGCATG
CTCGTGGGGG ATTCGCTGCG CATCACCCAG GTCCTGACCA ACCTGCTTAG CAATGCGATC
AAGTTCACGC CCCGCGGGGG CACTGTGGAA TTGGAAGTCC GTGAGGAGGC GGGTGCCGGC
AGTGCCGCCG CGAACATCCG CTTCCAGGTC CGCGATAGCG GCATCGGTAT GGATGCTGAG
CAGCTGGAAC GGGTCTTTCA GCCGTTCGTG CAGGCCGACG CCTCGACGAC CCGCCGGTAT
GGCGGTACGG GTCTGGGACT GGGCATCGGA CGACGGCTGG TGGAGCGGAT GGGGGGGGAT
CTGGAGGTGA CGTCGCAGCC TGGGCAGGGC AGCACCTTCT TCTTCACCCT GCAACTCCCC
ATAGGCGAGG TGCAGGGCAG CGCCCCTGCG TATCCCGCGA CCCGCAGCCG GCGCGGGCCG
CTGGACGTTT GCCACCTGGA GACGCACGAA GCGCACAGCG GTGAGGTGGC CATTGGACAG
GCGCGCGCCG AACCGGGCGA GAAGGCCGCC GACTGCCCCA AACCATCGCC GTCCCGGGTG
CCTGATCTGA CGGGCTTTTC GATCCTTCTG GTGGAGGATG ACGCGGTCAA CCAGGAAGTG
GCCACGCAGT TTCTGGAGAA GACCGGGGCA CGGGTCCGAG TGGCGGAAAA CGGGATGGCA
GCCCTGGATG CTGTCGATGA CGCGGAGCCC GATCTCATCC TGATGGATCT ACAGATGCCG
CTGATGGATG GCTTCGAGGC TTCGCGTCGG CTGCTGGAGA CTGGGTGCTC CCGAACGATC
ATCGCGCTAT CGGCGGCGGT GCTGGATGAG GATCGTCGCC GCGCCGAGGC AGCCGGGATG
CGCGGGCTGA TCGCCAAACC GATGGAGCAG GAGGTGCTCT ACGCCACGCT AATGGCCGAG
TTGAAGCCTG CGGCAGGGGC GATACGGCGG CCAGCACAGA CCCAGGGGGC AGCGGCCGAG
CCGCAGCAGA CCCACACCGA CGCTGCCGCG CCAGCGACCC TGCCCGCTGA GCTCCCGGGG
TTTGATCGTG CGCGCGGGCT CGAGATCGTT GGCGGAGATC AGTCGACCTA CGCCCGGTTA
TTGCAGTTGT TCAAGAGCCA GTTACCGGAC TATCACGCAT CGCTCATCGA GCCGCTGCGT
ACCGACTCCG TCCCTGATCG GCAGGGGCTG CATCGGATCG CCCATACCCT CAAGGGCAGT
GCCGCAAGTG TATGCGCGGT GGAGCTTGGT GAACGGGCTA GCCAGGTTGA AGCCGCCTTG
AAGCACGACG AGGCAGTCAG CGAGACGCAG ATCGATGCGT TGGAACAGGC GCTCAAGGAG
GCGGAGCAGG TCCTCCGGGA GCGGATGGCA CCTCCGCCCG GTGGACCGCT ACCGTTCGAT
GAGCGCCTCT AG
 
Protein sequence
MNNHASHPSG RLSRRWLRGR PHQRVRHHLQ WPVIPAGRWL GRAAVLLGGV LFVVSAHAER 
SVTLGILSFE SREVAEERWQ PLVEYLDGEI DDVQIEGAIA GYAELDGLLE NEQLDFLLTN
PMHYVRLRDQ YALSGALATL VPERQGERLR SFGGVAFTRA EHPEVSGWGD VPDHTVAAVH
EDSLGGYQAQ AMELKRRDLS MPTGDAIHFT GMPHRQVVDR VMGGEADIGF VRISVLERLW
EEGELARDAV RIIGEQAYAG FPFVASTALY PEWPLVHLEE AEGQQSSIGR EMTTALLRLG
PDHPASRAAG IAGFSVPMDY EPVEELARAL RLPPYDAPRA LSRDELWDLY RTPLVAVSTA
LLVFAAMAFG LMVTNRRLKR ARQQAEQATR AKSEFLANMS HEIRTPMNAV VGLSQLLLDT
ELNERQLDYL KKIQRSSRML LGIINDILDF SRIESGQLEL DTGPFDLGEV IDHLATLFSE
AAREKEIELI YAIQPELPRM LVGDSLRITQ VLTNLLSNAI KFTPRGGTVE LEVREEAGAG
SAAANIRFQV RDSGIGMDAE QLERVFQPFV QADASTTRRY GGTGLGLGIG RRLVERMGGD
LEVTSQPGQG STFFFTLQLP IGEVQGSAPA YPATRSRRGP LDVCHLETHE AHSGEVAIGQ
ARAEPGEKAA DCPKPSPSRV PDLTGFSILL VEDDAVNQEV ATQFLEKTGA RVRVAENGMA
ALDAVDDAEP DLILMDLQMP LMDGFEASRR LLETGCSRTI IALSAAVLDE DRRRAEAAGM
RGLIAKPMEQ EVLYATLMAE LKPAAGAIRR PAQTQGAAAE PQQTHTDAAA PATLPAELPG
FDRARGLEIV GGDQSTYARL LQLFKSQLPD YHASLIEPLR TDSVPDRQGL HRIAHTLKGS
AASVCAVELG ERASQVEAAL KHDEAVSETQ IDALEQALKE AEQVLRERMA PPPGGPLPFD
ERL