Gene ECH74115_0476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0476 
SymbolphoR 
ID6969745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp483455 
End bp484750 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content53% 
IMG OID643384524 
Productphosphate regulon sensor protein 
Protein accessionYP_002269038 
Protein GI209398260 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR02966] phosphate regulon sensor kinase PhoR 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGGAAC GGCTGTCGTG GAAAAGGCTG GTGCTGGAGC TGCTACTTTG CTGCCTCCCG 
GCTTTCATCC TGGGTGCATT TTTTGGTTAC CTGCCCTGGT TTTTGCTGGC ATCGGTAACA
GGACTGCTTA TCTGGCATTT CTGGAATTTA TTGCGCCTTT CATGGTGGCT GTGGGTGGAT
CGCAGTATGA CCCCGCCACC GGGGCGTGGT AGCTGGGAAC CGCTACTATA CGGCTTACAC
CAGATGCAGC TGCGAAATAA AAAACGCCGC CGTGAACTGG GCAATCTGAT TAAACGCTTT
CGTAGCGGCG CGGAGTCGCT GCCCGACGCG GTGGTGCTGA CCACGGAAGA GGGCGGTATT
TTCTGGTGTA ACGGTCTGGC GCAACAGATT CTTGGTTTGC GCTGGCCGGA AGATAACGGG
CAGAACATCC TTAACCTGCT GCGTTACCCG GAGTTTACGC AATATTTGAA AACGCGTGAT
TTTTCTCGCC CGCTCAATCT GGTGCTCAAC ACCGGGCGGC ATCTGGAAAT TCGCGTCATG
CCTTATACCC ACAAACAGTT GCTGATGGTG GCGCGTGATG TCACGCAAAT GCATCAACTG
GAAGGGGCGC GGCGTAACTT TTTTGCCAAC GTGAGCCATG AGTTACGTAC GCCATTGACC
GTGTTACAGG GTTACCTGGA GATGATGGAT GAGCAGCCGC TGGAAGGCGC GGTACGCGAA
AAAGCGTTGC ACACCATGCG CGAGCAGACC CAGCGGATGG AAGGGCTGGT GAAGCAGTTG
CTAACGCTGT CGAAAATTGA AGCCGCACCG ACGCAATTGC TCAATGAAAA GGTTGATGTG
CCGATGATGC TGCGCGTTGT TGAGCGCGAG GCTCAGACTC TGAGTCAGAA AAAACAGACA
TTTACCTTCG AGATAGATAA CGGCCTCAAG GTGTCTGGCA ACGAAGATCA GCTACGCAGT
GTGATTTCGA ACCTGGTCTA TAACGCCGTG AATCATACGC CGGAAGGCAC GCATATCACC
GTACGCTGGC AGCGAGTGCC GCACGGTGCC GAATTTAGCG TTGAAGATAA CGGACCGGGC
ATTGCACCGG AGCATATTCC GCGCCTGACC GAGCGTTTTT ATCGCGTTGA TAAAGCGCGT
TCCCGGCAAA CCGGCGGTAG CGGATTAGGG TTAGCGATCG TGAAACATGC GGTGAATCAT
CACGAAAGTC GCCTGAATAT TGAGAGTACA GTAGGAAAAG GAACACGTTT CAGTTTTGTT
ATCCCGGAAC GTTTAATTGT CAAAAACAGC GATTAA
 
Protein sequence
MLERLSWKRL VLELLLCCLP AFILGAFFGY LPWFLLASVT GLLIWHFWNL LRLSWWLWVD 
RSMTPPPGRG SWEPLLYGLH QMQLRNKKRR RELGNLIKRF RSGAESLPDA VVLTTEEGGI
FWCNGLAQQI LGLRWPEDNG QNILNLLRYP EFTQYLKTRD FSRPLNLVLN TGRHLEIRVM
PYTHKQLLMV ARDVTQMHQL EGARRNFFAN VSHELRTPLT VLQGYLEMMD EQPLEGAVRE
KALHTMREQT QRMEGLVKQL LTLSKIEAAP TQLLNEKVDV PMMLRVVERE AQTLSQKKQT
FTFEIDNGLK VSGNEDQLRS VISNLVYNAV NHTPEGTHIT VRWQRVPHGA EFSVEDNGPG
IAPEHIPRLT ERFYRVDKAR SRQTGGSGLG LAIVKHAVNH HESRLNIEST VGKGTRFSFV
IPERLIVKNS D