Gene Haur_1142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1142 
Symbol 
ID5733034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1306365 
End bp1308260 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content49% 
IMG OID641278281 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_001543918 
Protein GI159897671 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG2202] FOG: PAS/PAC domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTCGGA AATTGAAATG GCCACTCGCT GGAGCATGGC AACGCCGCTT GATTAAAGCA 
ACTCGCCGTA AAGATGTAAC GCTTGAGCAA TTTCCGGTTG GCTTGGTTTG GTTTCAAGGC
TCACAGGTTT TTTTTAATCA AGCGGTCACG GCGATGATTG GCTATAGCAA TGCGGAAATT
GCGACAACTG AGCAATGGTT TAGCACCTTG TATGGCCCTG AAGCCGCCAG CATGCGCCAA
TTGTATGCAA CAACCTCGGC AGCGACCTTA GGCCAGACAA TCCATGGCTT TGTCGTGAAT
CGCCAGAACC AAAGTTGCCT TTTAGAATGT ACGTTGGCTT CCCATGGCCA TCGTCAGGTT
TGGTTGGTAC GCGATATTAC CGAGAGCAAT CGGCTTGAGC GCTTGCTGTT GCAAACTGAA
CAAACCGCCC GCGTCGGCGG CTGGGAGATC GATTTGCGCA CCAACCAAGT ATTTTGGACG
CGCGAAATGT ACCATATTTT GGATACCACA GCACACGAAT ACACGCCCAC GATTGAGAAT
CAGAATTTTT TCCATACCAA TGCCACGTTA ATCCAACTTG AGGCAATTTT TCGCCAAATG
ATCGAGCAGC GTGGCTCGTT TGATATGAGT GTGGAAATGC GGACGTTTCG GGGGCGTTCG
TTCTGGGGTC GTTTTACTGG GCGAGTTGAG CTAGAGTTTG GCCAGCCAAT CAAAATTTAT
GGCTCGTTGC AAGATGTAAC TGAGCACCAT CAATTAACTG AGGCCTTGCG GGTGGCCGAG
CACGACTATC GCACAATTTT TGAAACCACC AAAATTGGCA TCTTTCGCAT TACTCCCGAT
GGACGGGTGT TACGGGCTAA TCCGGCGTTG GTGCGCTTGG CGGGCTTTGC CCATGAACAT
GAATTGGTCG ATTATGTGGC CGATTTAACC ACGATGTATG TTGAGCCGCA GCGCTTTGAA
TACTTGCGTG AGCTGCTGCA AACCAATGGC TCGTATGATG AAATTGAATC GGAAGTTTAT
CGGCCTGCCA CTGGCGAACG GATTTGGATC AGCGAGACCT CGCGTTTGGT GTATGCTGAG
GATGGCTCGA TGCTCTACGC CGAAGGCACG ATTCAAGAAA TTACGGCACG CAAACAGGTC
GAAGAGGCGT TGCGCCATGC CCGTGATGCT GCCGAAGCCG CCAATCATGC CAAAAGCACT
TTTTTGGCCA ATATGAGCCA CGAGCTGCGC ACGCCACTGA ATGCAATTAT TGGCTATAGC
GAATTGCTGA TGGACGATAC TGATTTTGAT GATCCGACGA TGGTTGAGCA GTTTCGCCAT
GATATTGCGC AAATTAATGA TGCAGGTCAT CAATTGCTCA ATTTGGTCAA CGATGTGCTT
GATTTGGCCA AAGTTGAGGC TGGCAAATAT CAAGTTGCTG CTGAAACCTT CGATCTCAAC
AGCCTTGTAC GTGATTTGAT TGCCACAATT AACCCAATGG CTCAGAAAAA TGCCAATAGC
CTTTACTTTG AGCCAAACAA ACATCTGCCG TTAATTCATA CTGATCGCTC GATGTTGCGC
CAGATTTTGC TGAATTTATT GAGTAATGCC GCCAAATTTA CCAAAGCAGG CAGCATCAAC
ATCAGCGTCA GCTTTGATCC AGCCAGCCAA CATGTGCAAT GTCGGGTGCG CGATACTGGC
ATTGGTATGA ACGATGAGCA AATGCAGCGT TTGTTTGAGC CATTTACCCA AGGTGATGCC
TCGACGACGC GGCGCTATGG TGGCACTGGC TTGGGCTTGG CGCTTTGTCG CCATTTTATC
GAACTATTGA ATGGCTCAAT TCAAGTTGAA AGTGTCTTTG GCCAAGGCTC GATCTTTACC
ATTGTCTTTC CATGCTTGGT TGAGGCAATT GATTAG
 
Protein sequence
MPRKLKWPLA GAWQRRLIKA TRRKDVTLEQ FPVGLVWFQG SQVFFNQAVT AMIGYSNAEI 
ATTEQWFSTL YGPEAASMRQ LYATTSAATL GQTIHGFVVN RQNQSCLLEC TLASHGHRQV
WLVRDITESN RLERLLLQTE QTARVGGWEI DLRTNQVFWT REMYHILDTT AHEYTPTIEN
QNFFHTNATL IQLEAIFRQM IEQRGSFDMS VEMRTFRGRS FWGRFTGRVE LEFGQPIKIY
GSLQDVTEHH QLTEALRVAE HDYRTIFETT KIGIFRITPD GRVLRANPAL VRLAGFAHEH
ELVDYVADLT TMYVEPQRFE YLRELLQTNG SYDEIESEVY RPATGERIWI SETSRLVYAE
DGSMLYAEGT IQEITARKQV EEALRHARDA AEAANHAKST FLANMSHELR TPLNAIIGYS
ELLMDDTDFD DPTMVEQFRH DIAQINDAGH QLLNLVNDVL DLAKVEAGKY QVAAETFDLN
SLVRDLIATI NPMAQKNANS LYFEPNKHLP LIHTDRSMLR QILLNLLSNA AKFTKAGSIN
ISVSFDPASQ HVQCRVRDTG IGMNDEQMQR LFEPFTQGDA STTRRYGGTG LGLALCRHFI
ELLNGSIQVE SVFGQGSIFT IVFPCLVEAI D