Gene Haur_4649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4649 
Symbol 
ID5736496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5939300 
End bp5940901 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content50% 
IMG OID641281813 
ProductGAF sensor signal transduction histidine kinase 
Protein accessionYP_001547408 
Protein GI159901161 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG2203] FOG: GAF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGCGA ATGAACCTGG TCAGGTGACA AGCCCCGAAG ATTCATATCA GCGTTTGCGA 
ACCGAGTATG AACGGTTGCA GTTGCTCTAT AGCCTCGCTC AACAGTTTGC GACGATGTTA
AGTTTACCCA ATGTTTTGCA GCATGTGCTC TCAGCCACAA CCCGCTTTAC CAATGCTGTG
CGTGGTAGCA TTTTCCTCTA TGGCGAGTAC GATGAAGTTC AAATGCATTT GCTTTCTGAG
CGTTTTCAAC AAGCTCGCTT GACTCCTTTT ACTAATCGTA TTTTACGCGA AGGCTTGGCA
AGTTGGGTGC TCAAGCATCG CCAAAGTGCC TTAATCGCCG ATACCAATCT CGATGAACGT
TGGATCGATT ATCCCAGTGA TACGTTGGAT GTACGCTCAG TGCTGTGTGT GCCGCTTTTG
CGTGGGCGAC GAGTGCGCGG CGTGCTGACC TTGGTGCATC CTGAAATTGG TTTTTTTACC
ATGGATGATG AGGCGCTGCT GAATGTGATT GCCCAGCATG CAGCCATGGC GATTGAAAAT
GCCCAGCTGA TCGCTGATAT TAATGATGAG CGGCGCAAGT TCGAGGGAGC TTTCACCGCT
ATGGAAGAAG GCTTGATTTT GGTTGATGGC GATGGTCGGA TTCACTTTGT TAATCCCCAA
GCCTTAGCCT TTTTTGCGGT CACTCCACCC GTGCCCGAAC ACCTGAACGA GCTTTCGCAG
CAAGTTTTGG GCTTATTCAA AGAGGCCCAA CAAACTGGTG ATAGTGTGCG TGCCGAGATT
GTGCTTGAGC AGAATCCCAC CACCGATCTG GCCATCCACA TTGCCTACAT CCCCATTTTC
AGTGAACAAG AAGATTGGTG GACGATCGTG CTCCACGATA TTACGCGGCT GAAGGAGTAT
GATCGGCTCA AAACTCAGTT TGTCGCCAAT GCTTCGCATG AATTGCGCAC GCCTTTGGCC
AATATTAAGT TGTATGCCCG GTTGGCCCAG CAAGTCAAAG CTAAAACCAA GTTACCTCAA
TATCTCGAAA CTATTGCGAG CGAAGCAGGC CGCTTGGAAG CAATCGTTGA AGATTTGTTG
ACCCTGACCC GCTTGGATAG CGGCTTGATG CATAGTAGTC CTGAATGGGT TGATATTATT
GAATTGTTGC GCAACTTAGC CCAAACCTAT CGCCCGCTGG CCGAAGCGCG GGAACAACGC
TTGATTTTTC AAGAATGTAC CAGCAATTTG CCCAAGCTTT GGTTAGCCCC CGATCAATTT
ATGCGGGTGG TCGTGAACTT GCTAAGCAAT GCCCTAAAAT TCACACCAAC TGGCGGCACA
GTCACGCTAT CGGTTGATCG CCAAACCCAA GCGGGGCAGG CTGGCACGCT TATCACCGTA
GCCGACACTG GGCCAGGCAT CGCACCTGAA CATCAGCAAC GCTTATTTGA ACGTTTTTAT
CGCGGTAGTA ACCCAACCGA GAGTGGTAGC GGTTTAGGTT TAGCCATTGT GCGTGAGTTG
TTGGCCTTGA TGGGAGGCAC AATTTCGGTA GCCAGCACCG TTGGTCAAGG TTCGCAATTC
ACCTGTTGGT TGCCCTTAGA ACAACACACT CACACCGCAT GA
 
Protein sequence
MAANEPGQVT SPEDSYQRLR TEYERLQLLY SLAQQFATML SLPNVLQHVL SATTRFTNAV 
RGSIFLYGEY DEVQMHLLSE RFQQARLTPF TNRILREGLA SWVLKHRQSA LIADTNLDER
WIDYPSDTLD VRSVLCVPLL RGRRVRGVLT LVHPEIGFFT MDDEALLNVI AQHAAMAIEN
AQLIADINDE RRKFEGAFTA MEEGLILVDG DGRIHFVNPQ ALAFFAVTPP VPEHLNELSQ
QVLGLFKEAQ QTGDSVRAEI VLEQNPTTDL AIHIAYIPIF SEQEDWWTIV LHDITRLKEY
DRLKTQFVAN ASHELRTPLA NIKLYARLAQ QVKAKTKLPQ YLETIASEAG RLEAIVEDLL
TLTRLDSGLM HSSPEWVDII ELLRNLAQTY RPLAEAREQR LIFQECTSNL PKLWLAPDQF
MRVVVNLLSN ALKFTPTGGT VTLSVDRQTQ AGQAGTLITV ADTGPGIAPE HQQRLFERFY
RGSNPTESGS GLGLAIVREL LALMGGTISV ASTVGQGSQF TCWLPLEQHT HTA