Gene Haur_4656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4656 
Symbol 
ID5736503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5947907 
End bp5949940 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content50% 
IMG OID641281820 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001547415 
Protein GI159901168 
COG category[T] Signal transduction mechanisms 
COG ID[COG2203] FOG: GAF domain
[COG2205] Osmosensitive K+ channel histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGACG TTGCAACTAC CCAAAATCAC GCACGTCCAA TCAGCGGTTT TCAGTTGTTG 
GCAATTGCCG AGGCAATCAA CAACGCAGCA AATCTTAGTT CTTTGGTAGC TACAGTGGCC
GAGTTATGTG CTACTGGTTT TGGGGTCGAA GTCGTTAAGT TGGGGCTGCT TGATAACGAA
CACCATAGCG GTTCGTCCAC GCCATATCTC TATGGCAACC CCAGCAAAGC CACCCAAAAA
TTGCTCGATC AAGCGATTGC CCAAACCATC GAACAAAAAA CTGCCCTGTT GCCAAGCAAT
GACGAGAGTG CCCAAGCCCA ATTGTTGGTG TTGCCGTTAT TGGCCAGCCA GCAATTAATT
GGCTTTTTGG CCTTGCTCTT GCCCAAAAAA GGCCGTTGGT CGGCTGAAGC AATCGAGGCT
GCTCAGTTGT TGGCGCATAA TTTGGCTCTA GCGATTTCAG CGGTTCAGCT TAAAGACTAT
ACCGCTAAAC GTAATCAAGA AATTAATACG CTCAACGATA TCGCCGCCAC AATCACTTCA
TCGCTTGATC CACGCCAAGT CTATCGCTTG GTGGTCAAGA AGATCAACGA ATATTTTCAG
GTTGAGGCTG GTTCACTGCT ACTGCTCGAT CCCGTGACCA ACGAATTGGT CTTTGTGATG
ACGCTTGAGG CGGGCGAAGA AAAATTGGCG GGCGTGCGGG TTCCACCTGG CCAAGGTTTG
GTTGGGGCAG CAATTACCAC CCGCCAGCCA GTCGTCGTGC TCGATGCCCA AAACGACCCA
CGTTTCTATC GGCGGGTTAG CGAGGATGTT GGATTTGTGA CGCGCTCGGT CTTGTGCGTG
CCCATGCTGG TGAAAAATCG TGAAATTGGG GTGATTCAGT TGTTGAATAA GCTAGAAGGC
GTATTTAACA CTGAAGATAC CCAACGCCTG CAAGCTATGG CTAACACGGT GGGCGTGGCG
ATCGATAACG CTAATCTGTT TCACGAAGTT TCGCAAAACC GCAATCGCCT CCAAGCCTTG
CTCAACTCCA CCACCGACGG TATTTTGATG ATCGACCCTG ATGATGTGGT GTTGACTGCT
AATCCAATGC TGGGTGAGTT GTTTGGCTGG GAATGGCGCA ATATCATCGG CGAGGCTGGC
AGCGATATTC TGGCTCGCAT CAAAGAGCAA TCGCGGGTGG TCAACGAGTT GCCCAATAGC
GAAACCTGCG AAATTGAAGT GTTGCGGCCT CGTACCCGCT ATGTGCGCCA AGAGCCATTG
CCAGTGCGCA ATAATTTTGG CAATGTGATT GGCACGCTGA TTGTGTTCCA CGATATTACC
GAGGAATATC AGCTAGCCCA AATTCGCGAA GATTATATGG GCATGTTGGT GCACGATTTG
CGTGCGCCAC TCACGGCGAT CATCAATGGT ATGACCATGG TGCGCCGTGG TTTTGCTGGC
CCAATTAACG ACCAGCAACG CGAATTGCTA GATATTGCCA ACAATAGCAG CCAAGAAATG
GTTGGCATGA TCAATACCTT GCTGGATATT AGCAAGATGG AAGCTGGCGA ATTGGTGCTG
AATCGTGCGC CATGCTCAGC CTACGAAATT GTTGATCGTG CTTCGGAACG TTTGATTAAC
TCAGCTCGCA GCGTTGATAT CAGCATCAAT CTGGATATGG CCCTGAATTT GCCAATTATC
GATGCTGACC AAGATAAAAT TGTGCGGATC TTGCAAAACC TGCTGGATAA CGCGATCAAA
TTTACGCCAG TTGGCGGAAG TGTTACAATC CGCGTGCGCC AATTAACTGA TAATGAAAAG
CAGACGATCT GCTGGAGTGT AATTGATGCC GGACCGGGCA TTCCCGAAAG CTATCGTGCC
AAGATTTTCG ATAAGTTTGT GCAGGTTGCT GGCCAGAGAA AAGGCACGGG CTTGGGCCTG
GCCTTTGCCA AACTTGCATC AGAAGCCCAC GGCGGGCGGA TTTGGGTTGA AAGTGTTGAG
GGCGAAGGTA GTACATTCTC GTTTACCATT CCGTATGAGC CAGCAGTGAA ATAG
 
Protein sequence
MADVATTQNH ARPISGFQLL AIAEAINNAA NLSSLVATVA ELCATGFGVE VVKLGLLDNE 
HHSGSSTPYL YGNPSKATQK LLDQAIAQTI EQKTALLPSN DESAQAQLLV LPLLASQQLI
GFLALLLPKK GRWSAEAIEA AQLLAHNLAL AISAVQLKDY TAKRNQEINT LNDIAATITS
SLDPRQVYRL VVKKINEYFQ VEAGSLLLLD PVTNELVFVM TLEAGEEKLA GVRVPPGQGL
VGAAITTRQP VVVLDAQNDP RFYRRVSEDV GFVTRSVLCV PMLVKNREIG VIQLLNKLEG
VFNTEDTQRL QAMANTVGVA IDNANLFHEV SQNRNRLQAL LNSTTDGILM IDPDDVVLTA
NPMLGELFGW EWRNIIGEAG SDILARIKEQ SRVVNELPNS ETCEIEVLRP RTRYVRQEPL
PVRNNFGNVI GTLIVFHDIT EEYQLAQIRE DYMGMLVHDL RAPLTAIING MTMVRRGFAG
PINDQQRELL DIANNSSQEM VGMINTLLDI SKMEAGELVL NRAPCSAYEI VDRASERLIN
SARSVDISIN LDMALNLPII DADQDKIVRI LQNLLDNAIK FTPVGGSVTI RVRQLTDNEK
QTICWSVIDA GPGIPESYRA KIFDKFVQVA GQRKGTGLGL AFAKLASEAH GGRIWVESVE
GEGSTFSFTI PYEPAVK