Gene Haur_4547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4547 
Symbol 
ID5736943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5817743 
End bp5820664 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content49% 
IMG OID641281709 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_001547306 
Protein GI159901059 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCTAT TATGGCTTTT TTCCACACTG GCCGAGTTTA TGAATAGTTT TAGCGCTACG 
TCCTTCCAAC CAGCCATGGA GCAACCAACG CTCGCTGAAT TAACCCAAAC TATCGCTCGT
TTAGAGGCTG AGCTGGCTGC TACCAAGCAG GCCTTGCTCG CCAGCGAACA GCGCAATCTG
CAATTTTTTC AACAAACTCA AGCGATTGAG TTAATTGTTG AGCCTGATAC TTGGCGAGTT
TTGGCGGCCA ATGCTGCTGC TTGCCGTTTT TATGGCTATA GCCAAGCCGA GTTTAGTAAT
CTCAACGTTC GCGATTTAAA TGTATTGACC AGTAAAGAAC ATCGGGTGGC CATTCGCAAC
GGTGAGCTTG GTGGTAGCAA TCGCTATGAG TTTCGCCATC GCTTACGTTC AGGAATGGTC
TGCGATGTTG AAGTTTATAT CACTCCCTTT GAATATCAAC AGCAACCAGC CTTATTTTGT
TTAATTCACG ACGTAACTGC CCATAAACAG ACGATTCGCC AAATGCAACG CCAAAATGCC
TATCTTGGCT CGCTGCACGA TGTGACCTTG GCCTTGATCG ATCAGGCCGC CCTGCCCGAT
TTGCTCGAAA TTATTCTGTG GAAGGCAACC GATTTGGTCG GCAGCGAATA TGGTGTGATG
TATATTCGCA GCGCCGATGG TCAAGCCATG GAATCTGTCG TGAGTCGCGG GGCGATCAAC
GTAATCCAGC ACATCGAATT AGGCCAAGGT GCAGTTGGCA CGGTCGGCCT TAGCGGTCAG
CCATTAATTA TTGATGATTA CTCCGCCTGG AGTGAGCGTT TATTTACGCC GCAAGATGGT
TTTGGTAATG GCCTGTTGGT ATTACCATTA TTCCGTGAAC AAACAGTCGC TGGCGCACTA
GCGATTATCT ACGAGCCAAG TCAGATTCAA ATTAGCATTA CGATGCTTGA AATGGTGCAA
CAATTTGCCC GCTTTGCCTC GCTTGCACTG GAGAAGCATA TGCTCTACAC CGCAGCCCAA
ACTGAACTGG CCGAACGTCG GCGGGTCGAA CAGCAGTTGC GTTCGCTGAA TGAGCGGTTT
GAATTGGCCC AAACGGTGCT GAATGGCGGC ATCTACGATT GGGATATGAT TCAGCAGACC
ACAGTGGTGA ATCGGAGCTT TACAACGGTG TTTGGCTATA CCAGTGAAGA GGTTGCCAGC
AGCCCAACCT GGTGGGAAGA ACATCTGCAC CCCGATGATC GTGAGTTGTT CATCCAACGA
ACTGCCGATA TTTTCGCTGG CCATAGCGAT GTTTTTTCGG CTGAATATCG CTTTCTCGAT
CAACATCAAC ATTATCGCTT TGTGCAGGAT CGCGGCCATG TGCTGCGTGA TCCCGATGGC
CAAGCTTTGC GTATGGTTGG CACACTGATC GATATCTCCG AGGAATATCG CCATATTGTT
GAGATGCTGC CCTTGCCGCT GTTTGTAGTG CAGGATGATC GAATTGTCTA TGCCAATCAG
GCTGGACTGG CGTTGACCCA AACCACCAGC AGCGAATTGC TTAGCCAGTC GGTGCTTAGT
GGCTTACGGC CACAAGATCC TGATGATTTT TATGTGCTGA TTCGCCATGC CAGCACCATG
CCCACCAGTT TTATTGAAAC AACCTTTTTG CGCCGTGATG GCACGCCAAT TTATGGCGAA
TGTTCAACCT CAGCAATTTT ATTTGAGCAT CGAAATTCAG TCCAAGTGAT TATTCGCGAT
ATTAGCGAAC GCAAGCAGGC CGAGCAAAAA CGCCTAGAAA TTGAACAACA CTTTTCAGAA
ACCCAAAAAT TAGAAAGCCT TGGCGTGTTT GCTGGCGGGA TTGCCCACGA TTTCAATAAT
GTGTTCATGT CAATTCTTGG TAATACCCAT TTAGCTATGC TCGAACTGCC TGAGCAAGCC
AACCTGCGTC CGCTTTTAGC CGAAATCGAA CAATCGGCCA AACGGGCAGC GCAAATCACC
AAGCAAATGG TTGCCTACAC CGGCCAAAAT CTTGATGTGC GGCGCAGCCT GATGATCAAC
GAATTAGTTC AGACCTCAAT TCGTTTGCTG CCCTCAAGCC TAACCCAGCA ACGTCATATT
CAGAGCAACT TAGCTGATAA CTTGCCGCTG ATCGAGGGTG ATCAACAACA TCTGCGCCAA
GCGATCAGCA ATGTACTGAT CAATGCGCTT GAAGCGTCGA GCAGCGGAAC GCTTACAGTT
ACCACCAGCT TGCGCGATTT GCAAGGAGCA CCCGAGGGCC ACACCTATAC GACAGGCACA
TTTTTGCCCG CCCAATATAT CGCGATTGAA ATTAGCGATC AAGGCTTAGG CATGGATCAT
GCGACGATCT CGCGCATGTT CGACCCATTT TTTACCACCA AATTTACTGG ACGCGGGTTG
GGTTTAGCGG TTACTTTAGG GATTGTGCGC GGCCATCGCG GTGGCATTGC GGTCAAGAGT
CGCCCTCAAC AGGGCACGAC AATAGGAATT TATCTGCCAA TTATGACAGC AGTACAAACT
GAACTAGCCA GCGAACAACC AGCCTTAGTC GATTCTGGCC CAGGCAAAAT TTTGGTGATC
GACGACGAGC CTGATGTACG AATCGTGATT GATCGGATTT TGCGTCGTTT GGGCTACGAA
ACCTTACTAG CCGAGGGTGG TAATCGGGGC ATTAGCCTCT TCCGCGACCA TCATCAAGAA
ATTGCGGGCG TTTTTCTCGA CCTAACCATG CCCGATCTCG ATGGTCAAAG CACGCTCGAA
GTGCTCAAAC AGATTCAGCC TGAGATTCGC GTGGTGATTA TGAGCGGCTA TAATGAACAA
CAAGTGAGCC AACAATTTGG TACTACTGGT ACAACCCAAT TTCTGGCAAA ACCATTTATG
ATTGAGGAGA TCCGCGATAA GCTCACGGCA TTACGGCTCT AG
 
Protein sequence
MKLLWLFSTL AEFMNSFSAT SFQPAMEQPT LAELTQTIAR LEAELAATKQ ALLASEQRNL 
QFFQQTQAIE LIVEPDTWRV LAANAAACRF YGYSQAEFSN LNVRDLNVLT SKEHRVAIRN
GELGGSNRYE FRHRLRSGMV CDVEVYITPF EYQQQPALFC LIHDVTAHKQ TIRQMQRQNA
YLGSLHDVTL ALIDQAALPD LLEIILWKAT DLVGSEYGVM YIRSADGQAM ESVVSRGAIN
VIQHIELGQG AVGTVGLSGQ PLIIDDYSAW SERLFTPQDG FGNGLLVLPL FREQTVAGAL
AIIYEPSQIQ ISITMLEMVQ QFARFASLAL EKHMLYTAAQ TELAERRRVE QQLRSLNERF
ELAQTVLNGG IYDWDMIQQT TVVNRSFTTV FGYTSEEVAS SPTWWEEHLH PDDRELFIQR
TADIFAGHSD VFSAEYRFLD QHQHYRFVQD RGHVLRDPDG QALRMVGTLI DISEEYRHIV
EMLPLPLFVV QDDRIVYANQ AGLALTQTTS SELLSQSVLS GLRPQDPDDF YVLIRHASTM
PTSFIETTFL RRDGTPIYGE CSTSAILFEH RNSVQVIIRD ISERKQAEQK RLEIEQHFSE
TQKLESLGVF AGGIAHDFNN VFMSILGNTH LAMLELPEQA NLRPLLAEIE QSAKRAAQIT
KQMVAYTGQN LDVRRSLMIN ELVQTSIRLL PSSLTQQRHI QSNLADNLPL IEGDQQHLRQ
AISNVLINAL EASSSGTLTV TTSLRDLQGA PEGHTYTTGT FLPAQYIAIE ISDQGLGMDH
ATISRMFDPF FTTKFTGRGL GLAVTLGIVR GHRGGIAVKS RPQQGTTIGI YLPIMTAVQT
ELASEQPALV DSGPGKILVI DDEPDVRIVI DRILRRLGYE TLLAEGGNRG ISLFRDHHQE
IAGVFLDLTM PDLDGQSTLE VLKQIQPEIR VVIMSGYNEQ QVSQQFGTTG TTQFLAKPFM
IEEIRDKLTA LRL