Gene Haur_3458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3458 
Symbol 
ID5735319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4348577 
End bp4351453 
Gene Length2877 bp 
Protein Length958 aa 
Translation table11 
GC content53% 
IMG OID641280605 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001546222 
Protein GI159899975 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains
[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAGT TGCCATTACA GATCTTGGTG GTCGATGACG AGCCACGCTT AAGTGGTCAG 
TTGCGGTTGT TATTGGAAGG GCAAGGCTAC AGCGTAACCA CGGCGGTTGG GGGTGCTGCG
GGCATTGCCA CGCTCCAAGC AAGCGACGTT GATGTCGTGC TGACTGATGT CACCATGCCT
GATGTTGATG GCTATACGGT GTTGCAATGG GTGCGCGAAA ATCGCCCAGA CATGCCGATT
ATCGTCATGA CTGGCTATGG TTCGCTGGAA AGTGCCACCC GTGCGTTGCG CTTAGGGGCT
TACGATTATA TTTTGAAGCC CTTTTCGTTG CCAACAGTGC AAGCAGCCTT GAATCGCGCT
TCGGCGGCGG TTTCCAAGCG GCGTGCCGAT GGTCAGCGCA CCGCCGAGCT TTCAGCAATT
GCGGCAATCG CCCGCCAAAT GGGGCGCTCG CTAGAGCCAG TGGCCATGGC CGATGCCACC
TTGAGCGCAA TTGCCGAATC AACTGGCGCT CAAGTGGCCT TGCTCTACAC CGCCGCTGAA
AATGGCATGC GCTGGCATTT ATGGCGACAC GTTGGCCTGA ATGAGCAACT TATTCAACAA
TTAGCCGAAT TATTTCCGCC CAGCCCGCTG AGCAAAGCCA CAATCGTTTG GGAAGATCGT
CCGCCTTGGT TGGCCGAGAT GCCCTACGAA TTGCTTCAAG GCCTGTGCAG CAGCGTTGGC
GGCAAAGTCG CTTGGGGCGC ATTGCCACTC GATGCCGGCC CGCAAGCGCT TATGTCGCTG
GTCTTGGTTG GCTCAAGTCG CAAAACTCAA GCTTGGTCGC CGCCATTTCT CATTTCGTTG
GGCAATGCCT TGAGCATGGC CTTGGTCAAT ACGCGCTTGT TCAATGCAGT GCGCGAGGAA
CGCGACCGCT TGCGCTTGCT CTATGGCATC AGCCGTGAGT TGGCTAGCTC GCTCGACCCC
GATCAATTAC TTTCGCGAAT TATTCAACAC ACGGTGGCAG CAGTTAACGC CGAACGTGGC
AGCATTATCA TTTCATCGCA AGATGGCCGC ACGACTCAGC GGATTGTTGC CCGTTATGGC
ATGGATCAAT CGGTGACTGA ATCGGTAGCG GCGGCCTTGT TGCAAGCCGG GCTTTCGGGC
TGGGTCTTTC GCCAACGTGA AGCCGCGCGA ATTGCCGATG TGCGCGTTGA TAAACGCTGG
GTCGAATTGC CCTCAACTCG TGGTCGGGTG CGTTCGGCCT TGGCAGTGCC CTTGTTGCGC
GAAGATCAAG TGCTTGGCGT GATGACCTTG ACGCACCCAC GGATCGATCA TTTTAGCGCT
GCCGACTTGG AATTGGTGCG TTCGGTTTCG GCTCAAGCAG CCGTCGCGAT TGAAAATGCC
AATTTGTTTG CTGAGCTAGA GCAGCGGGTA TTCGATCTTG AGGGCTTAAA TTCGACCAGC
CGCGAATTGG CTAGCTCGCT TGATCCATTG GAAGTAGCGC GAAAAGTGGC TTATCGCTGT
GCTGAAATGC TCGATGCCTC AATGGTAGCA TTGCTGCATG TCGATGATAA ACAAGGCACA
TTGCCCTTGG TTTCCTTGAT TAACGGCCAA GAACAGCCGT TATTGAAATT GGGGCCGATT
AGCGCTGCTC TGAATGATCC AGAACCTGTG CTATTAAACT CGGCTGGCAG CCAAATTGAA
CTGTTGGTGA ATAACGAAGA ACCCTTGGGC GAAAGCTGGA TTGGCGTGCC CTTGATGCTG
GGTGATGGGA TCAACGGCTT ATTGATTGCC GCCGATGAGC GCCGCGATGC CTTCGATGCT
TACGAACGAC AATTACTCAC AGCCTTGGCA GGCCAAGCAG CGGTAGCAAT GGAAAGTGCG
CGACTGTATG TCACTGCATC CGAGGAGCGA ACGCTCTTGG CAGCAGTGAT TGAATCGGTC
AGTGATGGCA TTTTGCTGAC TGATGAAGGC CAAATTGTGG TTGCTAACCC TGCCGCTGGC
GCAATTGCGG GCGTTTCTAA TAGTCGCTTG GTTAATCAAC CCTTGCTGAC CTTCTTCCCA
ATGTTGGCAA TGCTAGCTCG CCGCGAAGAT CACGAAAGCA AAGAAATTGC CATCAGCAAC
CGTTATTATG CGGTCAATAC TGCGCCATTG CAAAACAGTT CCTTGGGCGG CCAAGTGATC
GTCTTGCAGG ATATTACCCA TTTCAAAGAG CTAGACCAAA TCAAGAGCCG CTTTGTTTCG
ATGGTTTCGC ACGACCTCAA GTCGCCGCTG ACCGCAATTC AAGGCTATGC CCAATTGGTT
GCCGACGGGC ATATGGGCAC GGTCAACGAG ATGCAGCGTG ATGCCTTGCA AGCAGTTGTG
CGCAACACAG GCGCAATGAC TGCCTTGATC AGCGATTTGC TCGATTTGGG CAAAATCGAA
GCAGGGATTG GCATTTCGCC GCAAGAAACT GATTTGGCGG TGGTGTTGCG CGAAGTTATC
GACGAGCTGA AATTGCGGGC CAAAATGGGT CAAATCAGCG TCCAGCCTGA AATTCCGCCC
AGCTTGCCCT TGGTGGCTGA TCCCTCGCGG ATGCGCCAAG TGTTTACCAA CATTCTTTCG
AATGCAATTA AATACACGCC AAGCGGGGGG CAGGTCCAAA TTCGCGCCAA TAATGGCGAT
GCTAAAATGC ATGTGCAAAT TCAAGATAGC GGCTTAGGAA TTCCCGAAGA TTCCTTGCCG
CATATTTTTG AGCGCTTCTA TCGTGTCAAG CGGGATATTG ATTCGCCGAT TGAAGGGACT
GGCCTTGGCT TAGCAATTAC CAAAAGCATT GTCGATGAGC ATGGCGGCAC GATCGAGGTG
CAAAGTGTGA TCGGCGAGGG CACAACCTTC AATGTATATC TTCCCCAACA CAAATAA
 
Protein sequence
MNELPLQILV VDDEPRLSGQ LRLLLEGQGY SVTTAVGGAA GIATLQASDV DVVLTDVTMP 
DVDGYTVLQW VRENRPDMPI IVMTGYGSLE SATRALRLGA YDYILKPFSL PTVQAALNRA
SAAVSKRRAD GQRTAELSAI AAIARQMGRS LEPVAMADAT LSAIAESTGA QVALLYTAAE
NGMRWHLWRH VGLNEQLIQQ LAELFPPSPL SKATIVWEDR PPWLAEMPYE LLQGLCSSVG
GKVAWGALPL DAGPQALMSL VLVGSSRKTQ AWSPPFLISL GNALSMALVN TRLFNAVREE
RDRLRLLYGI SRELASSLDP DQLLSRIIQH TVAAVNAERG SIIISSQDGR TTQRIVARYG
MDQSVTESVA AALLQAGLSG WVFRQREAAR IADVRVDKRW VELPSTRGRV RSALAVPLLR
EDQVLGVMTL THPRIDHFSA ADLELVRSVS AQAAVAIENA NLFAELEQRV FDLEGLNSTS
RELASSLDPL EVARKVAYRC AEMLDASMVA LLHVDDKQGT LPLVSLINGQ EQPLLKLGPI
SAALNDPEPV LLNSAGSQIE LLVNNEEPLG ESWIGVPLML GDGINGLLIA ADERRDAFDA
YERQLLTALA GQAAVAMESA RLYVTASEER TLLAAVIESV SDGILLTDEG QIVVANPAAG
AIAGVSNSRL VNQPLLTFFP MLAMLARRED HESKEIAISN RYYAVNTAPL QNSSLGGQVI
VLQDITHFKE LDQIKSRFVS MVSHDLKSPL TAIQGYAQLV ADGHMGTVNE MQRDALQAVV
RNTGAMTALI SDLLDLGKIE AGIGISPQET DLAVVLREVI DELKLRAKMG QISVQPEIPP
SLPLVADPSR MRQVFTNILS NAIKYTPSGG QVQIRANNGD AKMHVQIQDS GLGIPEDSLP
HIFERFYRVK RDIDSPIEGT GLGLAITKSI VDEHGGTIEV QSVIGEGTTF NVYLPQHK