Gene Haur_4240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4240 
Symbol 
ID5736094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5406080 
End bp5409274 
Gene Length3195 bp 
Protein Length1064 aa 
Translation table11 
GC content50% 
IMG OID641281395 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_001547000 
Protein GI159900753 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTAGCTA ACCTGTATAG CTCAGATCAA CAACGCTCAC GCTTTATCTG GTTAATGGGC 
ATTCAAGCGG TCTTGGTGGT TGTGTTGCTG ATCGCCTCAA CCCTCTTGAC GGTTGGCATG
AACCGTTTAC GCGATCGTGA TGAAACGTTG CTGAGCTATC GGGTGCGTTT AAATGACGTT
AATATTGCTC TGTTATCGTT GCACAACAAT TTGCGCGGCT ATACCAGCAC TGGCTCAGAT
ATTTTCTACG CGGAAATTCA AAAAGAACAG GCAGTTGTTA CCGAGGGTTT GCAATTTCTC
TTAGTCAACC CGCCCGACCC GAGCTTGCCC CAAGTCGCTG AGCAAGTCGA TCAATGGCTG
CTCAATACCT ATCAGCCAGT GCTGGATGCA GTGGCCAACC AAGAAATGCT GTTGGCCGAA
TTGACGCTTG AGCAAGGTCG CCCACAAATC GAGGCGGTAG TTAACACGAC AACCAGCTTG
CGCACTGAAT TGCGCCAGCG CTCGGAAGAT TATCGCAACC AAATTGATAC TTACAATTGG
ATTAAACTGG CCTCGATTGG TTTGCTCTCA GCATTAATTG TGGTTTCGAT GATTGTGACA
GTGCGCTTCT GGCGCACCCA ACAGCACTTA ATTCACGAAA TTGAAGATAA AGGCAGTCAA
CTGCTGCAAA GCAACCATGA ATTAAATTTC AATACCAATC AACTTTCAAT TATCAACAGT
ATTTTGGGCG TGCGGATCAA CGAGTCGCGG GTGTTGCGCG AAATCAGCGA TTATTTGGTC
AATAATCCTA CCCCCGCTGA GGCCTATAGC TTTGTGGCGC AAACTGTTGG TAATGTGCTG
AATACCTGGT GTAGCATTGC GCTGCGTTTG CCCCCGCCCC GCGAGGAATT TCTCGATGTG
GTGGCTTCAT ATCATTCGTT GCCCAGTCGC CAAGCCTACA TCGACCAAAT GGTGCAAACA
GTCCAATTTC GCATCGATAA TGGTCTGTAT TCGCCAGTCT TTACCCAGCA TGCTCCCTTG
GTTGAACTCT TCAATGTGCC CGTTGAGCAA CGTCACCCCA ACTATTTGAC CAACGAGGTT
CGCGAACACT TGGAGCCATT CACGCTCTAT TCGTATATCG CTGTGCCGAT TAAAATTCAG
GATGAAGCAG TTGGCCTGAT TTCGGCAGCC TCAGATTCGC CTGAACGGCT GTTTGACCAT
GATCAAACCT TGTTTGTGCG CCAAGTCGCT GATCGTTTGG CCGCTTGGCT TGAAAATATT
CAATTATTTT ATTTGCTCAA ACAACAGGCC AACGAGCTGC AAACGAGCTT CGATAGCCTC
GACGATATTG TAGTTTCCTA CGATAGCCGT GGTCATCAAA CGCGAATCAA CGAGGCTGGC
ACGCAGTTTT TTGCTGGTCG CCACTTTGAT TTTCTCTCCA ATAATTTGGT TTGGCGCACG
GCTAAGGGCA ATGTATTGGG CTTCGATGAG CATCCGATTC AGCAGGCGTT GGCTGGCACA
ACCGTGCGTG ATGTTGAAGT TTCGTTATCA CGCTCCGATG GTGTGCCGAT TATTCACGAA
GTCAGCGTTT CGCCGCTGCG ATCTGCCGAT GGTTCAATTG AAGGCATTGT GCTGGTGGCG
CGAGATTTGA GCGCTCGCAA GGAGCTTGAT CGGCTCAAAG AAGAACTTGT TGCCAATATG
AGCCACGAGT TGCGCACGCC GCTCACTGCC ATTTTGGGCT ATAGCGAATT GCTGCTCAAA
CGCCGCACCG AAGTGCTTAC GCCTTGGCAT ACCACCAAGA TTGAGGGGAT TCGCACTGGC
GGCCAGCGTT TGCTGAGCCT CGTCAACGAT CTGCTAGATA TTGCCAAGCT TGATGCTGGC
CGCATCGAGT TGCAACGCCA AACCACCATC ATTAATAGCT TGTTGCAAGA GCAAGTGGCG
ATTTTGCAGC CGATGCTTCG CGAGAAACAG CAAACCCTCA CCTTGCAACT GGGCCAGCAC
ATCCCCTTGC TCATGATCGA TCCTGAGCGG ATTGGCCAAG CAGTGACCAA TTTGTTGAGT
AACGCCATTA AATTTACGCC AGAGCAAGGT ACAATCACCT TAGCCTCAAC CGCCTTGAAC
ATTGATGAGC ATGGCCAAAT TGACTGGCTG GATCAAGTGT TGGCGACCGA AGTGCCGCCG
ATGTTGGCAG GCCAATATGT CTTGATTCAG GTCAGCGATA GTGGGGTGGG CGTGCCAGCC
GAAGCGCTCG TAAAATTGTG GGATCGTTTT TATCAAGTTG AGGGTGGCTC GACGCGCCGT
TTTGGGGGCA CAGGCTTAGG TTTATCGATC GTTCAGCAAT TAGTTGAATT ACATGGTGGA
CGGACATGGG CGACTAGCGC AGGCGAAAAT CAAGGCAGCA GCTTCACAAT TATGTTGCCA
GTCAGCCGCG GTGCCCAATT TGTGAGCCTC ACTCAAGGCT TACGGCGTTC GATTTTGGTG
ATTGAAAACG ATCACCAAAC GGCCCAATAT TTGGAAGAAC AATTGCAAGC GGCTGGTTTT
GAAGTGATTG TGGCGACTGA TCATCATAGT GCCTTGACTT GGGCCAAAGA TCATTCACCA
GCGGCAATCA CCCTCGATTT GCTCATGCCC AATAGCGAAA GCTGGGAAAC CTTGGCAGCC
TTGCGCGAAA TTGACCATTT AGCGCAAGTT CCCGTCTTGA TCGCAAGCGA TGCTTCGGTA
TACAATGAGC TACCAGGGGT TGGTGTCTCC ACATATGTAG TCAAGCCAAT TGATAGCCAA
ATTCTGATTC GGATTATTCG TCAACTGATT GGGGCTCAAG GCCAAACAGG GTTTATCTTG
GTGGTTGATG ATGATTATGA TATGGCCGAA CTGCTGTGTG CTACTTTGCA AGAGCATGGC
TACGTTACCC AAGCCTCATA CGATGGGGCG GCGGCGCTCG ATCTGATTCA ACAGGGCAAT
TATCCCCAAC TAATTTTGCT TGATTTGATG ATGCCAGAGG TTGACGGCTT TCAGCTACTC
GAAAAACTGC GGGCAAATCC TGAAACTCGT AATATTCCCG TGATCATTGT GACTGCCCGT
GACTTAACCA ATGAAGAAAT TCGCCAATTG CGCCAAGCTG CGCAAGCCAT TCAGACCAAA
CATACCCTCA GTATGCGTAA GCTGGTTGCT GAAGTCCAAC GGTTTGCCCC GTTGAAGGAA
TCGGATACCC CATGA
 
Protein sequence
MLANLYSSDQ QRSRFIWLMG IQAVLVVVLL IASTLLTVGM NRLRDRDETL LSYRVRLNDV 
NIALLSLHNN LRGYTSTGSD IFYAEIQKEQ AVVTEGLQFL LVNPPDPSLP QVAEQVDQWL
LNTYQPVLDA VANQEMLLAE LTLEQGRPQI EAVVNTTTSL RTELRQRSED YRNQIDTYNW
IKLASIGLLS ALIVVSMIVT VRFWRTQQHL IHEIEDKGSQ LLQSNHELNF NTNQLSIINS
ILGVRINESR VLREISDYLV NNPTPAEAYS FVAQTVGNVL NTWCSIALRL PPPREEFLDV
VASYHSLPSR QAYIDQMVQT VQFRIDNGLY SPVFTQHAPL VELFNVPVEQ RHPNYLTNEV
REHLEPFTLY SYIAVPIKIQ DEAVGLISAA SDSPERLFDH DQTLFVRQVA DRLAAWLENI
QLFYLLKQQA NELQTSFDSL DDIVVSYDSR GHQTRINEAG TQFFAGRHFD FLSNNLVWRT
AKGNVLGFDE HPIQQALAGT TVRDVEVSLS RSDGVPIIHE VSVSPLRSAD GSIEGIVLVA
RDLSARKELD RLKEELVANM SHELRTPLTA ILGYSELLLK RRTEVLTPWH TTKIEGIRTG
GQRLLSLVND LLDIAKLDAG RIELQRQTTI INSLLQEQVA ILQPMLREKQ QTLTLQLGQH
IPLLMIDPER IGQAVTNLLS NAIKFTPEQG TITLASTALN IDEHGQIDWL DQVLATEVPP
MLAGQYVLIQ VSDSGVGVPA EALVKLWDRF YQVEGGSTRR FGGTGLGLSI VQQLVELHGG
RTWATSAGEN QGSSFTIMLP VSRGAQFVSL TQGLRRSILV IENDHQTAQY LEEQLQAAGF
EVIVATDHHS ALTWAKDHSP AAITLDLLMP NSESWETLAA LREIDHLAQV PVLIASDASV
YNELPGVGVS TYVVKPIDSQ ILIRIIRQLI GAQGQTGFIL VVDDDYDMAE LLCATLQEHG
YVTQASYDGA AALDLIQQGN YPQLILLDLM MPEVDGFQLL EKLRANPETR NIPVIIVTAR
DLTNEEIRQL RQAAQAIQTK HTLSMRKLVA EVQRFAPLKE SDTP