Gene Haur_4530 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4530 
Symbol 
ID5736381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5795565 
End bp5797043 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content51% 
IMG OID641281692 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001547289 
Protein GI159901042 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0351302 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATGG CTATGCAGTT GCACGCACGC GGTGGGTTGG CGCTCAATCA GCTAATTCGG 
CTGGTTTATC TGATGGCAAT TGCCGTTGGG ATGTTGGGGC TTGGTCTGAT TGGCTATGGC
CTTTGGCTGG CGCGTTTCGA TTGGTGGCAT TGGTTTTTGA TCAGCCTTGG GGCTTTATGG
TTGCTGCTGC ACATTGGCTT AATCGTGATG CTGCAACGCA AACGCCGCGC AATTCAGCAA
TTAACTGCTG TCGTCGATGT GCTAGCAACC GCCGATTTAA GTGTGCGAGC ACCCAATCTA
GGCACTGATG AGTTGGGCGA ATTGGCGCAA TCGATCAATT TGGCCGCCGA GCATTTTACC
AGCTTGCTCG ATACCCAGCG CCGCGAAACC CAGCGCGAAC GGGCAATTTT GGCGGCAATC
GACGATGGCG TGATTGTTTG CGATCAGCTT GGTCAGATTA TTTTGCTCAA TACTGCTGCC
TACAATATTA TTGCCATGGC CGAAGCTGAA CGCCTGCAAA CCAACCACGA ACAGCCTAAT
TTGAGTTTGC GCTTTTATGC TGCGCTCGAA GCCGTGCAAC CAGCCTTGGA TCAAGCGCTT
GGCCGTCCGC ATATCAAGCC TGCCGAACGA GTTTGTTTTG CTGGCCGCAC CTATCGCCTC
AGTGCTAACC CGATTTGGAT CGACGATCGG CGGATTGGGG CGGTGGCAAT TTTGCAAGAT
ATTAGTGCCC GCGTCGAGAG CGAACGCTTA CGCAGCGATT TTATGGCTTT GGCTGCCCAT
GAATTACGTT CGCCGTTGAC CAGTATTCGT GGCTTTGCTG ATATGCTATT GTGGAGCAAC
CCCGAGCATT TTAGTGCCGA AGAAATTAGC TACATCGAAG GCATCGGGCG CAACATCCAA
CGCCTAACCG AGCTGATGAA CGATGTGGTT GAGTTGGCGC GATTGGAAAC CCAACGCAAT
GAGCATACAC CGCAGCCTGT TGATTTACGC CAGGTGTTGA GCGCAGTGCT TGATGAATTT
CGGCCACGCG CTGAACGCAA AAAATTACAA TTAGATTGTT TATTGCCGAA TGAATTACCG
CTGTTGAGCC TTGACCCGCT GCATATTCGC CAAATTAGCC ATCACCTGAT TAGCAATGCG
ATCAAATATA CCCCTGAGCA AGGCCAGATT CAAATTGAAG TGCAACAACG GATCGATGAT
ATCTTAGTGG TGGTACGCGA TACGGGGATT GGCATTTCGC TGCGCGAACA GCCGCGCATT
TTTGGGCGTT TTTTTCGCAA CGATAATCCA CTTTCACGCG CCGCAGGTGG CACAGGCTTA
GGCTTATCGA TCGCCAAAGC CCTCGTCGAA ATGAACCATG GTTCAATCTA TTTTGAAAGC
ATCGAGGAAG AAGGTACAAC CTTTTATGTG GCGTTTCCCT TGGCTTTGGT ATGCGAACCT
TTGCCCTACA ATCCAGCCGC CTTGGCCGAT GCAGCCTAG
 
Protein sequence
MRMAMQLHAR GGLALNQLIR LVYLMAIAVG MLGLGLIGYG LWLARFDWWH WFLISLGALW 
LLLHIGLIVM LQRKRRAIQQ LTAVVDVLAT ADLSVRAPNL GTDELGELAQ SINLAAEHFT
SLLDTQRRET QRERAILAAI DDGVIVCDQL GQIILLNTAA YNIIAMAEAE RLQTNHEQPN
LSLRFYAALE AVQPALDQAL GRPHIKPAER VCFAGRTYRL SANPIWIDDR RIGAVAILQD
ISARVESERL RSDFMALAAH ELRSPLTSIR GFADMLLWSN PEHFSAEEIS YIEGIGRNIQ
RLTELMNDVV ELARLETQRN EHTPQPVDLR QVLSAVLDEF RPRAERKKLQ LDCLLPNELP
LLSLDPLHIR QISHHLISNA IKYTPEQGQI QIEVQQRIDD ILVVVRDTGI GISLREQPRI
FGRFFRNDNP LSRAAGGTGL GLSIAKALVE MNHGSIYFES IEEEGTTFYV AFPLALVCEP
LPYNPAALAD AA