Gene EcolC_3233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3233 
SymbolphoR 
ID6066771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3540462 
End bp3541757 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content53% 
IMG OID641602648 
Productphosphate regulon sensor protein 
Protein accessionYP_001726182 
Protein GI170021228 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR02966] phosphate regulon sensor kinase PhoR 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.62554 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000115446 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
GTGCTGGAAC GGCTGTCGTG GAAAAGGCTG GTGCTGGAGC TGCTACTTTG CTGCCTCCCG 
GCTTTCATCC TGGGTGCATT TTTTGGTTAC CTGCCCTGGT TTTTGCTGGC ATCGATAACA
GGACTGCTTA TCTGGCATTT CTGGAATTTA TTGCGCCTTT CATGGTGGCT GTGGGTGGAT
CGCAGTATGA CCCCGCCACC GGGGCGTGGT AGCTGGGAAC CGCTACTATA CGGCTTACAC
CAGATGCAGC TGCGAAATAA AAAACGCCGC CGTGAACTGG GCAATCTGAT TAAACGCTTT
CGTAGCGGCG CGGAGTCGCT GCCCGACGCG GTGGTGCTGA CCACGGAAGA GGGCGGTATT
TTCTGGTGTA ACGGTCTGGC GCAACAAATT CTTGGTTTGC GCTGGCCGGA AGATAACGGG
CAGAACATCC TTAACCTACT GCGTTACCCG GAGTTTACGC AATATCTGAA AACGCGTGAT
TTTTCTCGCC CGCTCAATCT GGTGCTCAAC ACCGGGCGGC ATCTGGAAAT TCGCGTCATG
CCTTATACCC ACAAACAGTT GCTGATGGTG GCGCGTGATG TCACGCAAAT GCATCAACTG
GAAGGGGCGC GGCGCAACTT TTTCGCCAAC GTAAGCCATG AGTTACGCAC GCCATTGACC
GTGTTACAGG GTTACCTGGA GATGATGGAT GAGCAGCCGC TGGAAGGCGC GGTACGCGAA
AAAGCGTTGC ACACCATGCG CGAGCAGACC CAGCGGATGG AAGGGCTGGT GAAGCAATTG
CTGACGCTGT CGAAAATTGA AGCCGCGCCG ACGCATTTGC TCAATGAAAA GGTTGATGTG
CCGATGATGC TGCGCGTTGT TGAGCGCGAG GCTCAGACTC TGAGTCAGAA AAAACAGACA
TTTACCTTTG AGATAGATAA CGGCCTCAAG GTGTCTGGCA ATGAAGATCA GCTACGCAGT
GCGATTTCGA ACCTGGTCTA TAACGCCGTG AATCATACGC CGGAAGGCAC GCATATCACC
GTACGCTGGC AGCGAGTGCC GCACGGTGCC GAATTTAGCG TTGAAGATAA CGGACCGGGC
ATTGCACCGG AGCATATTCC GCGCCTGACC GAGCGTTTTT ATCGCGTTGA TAAAGCGCGT
TCCCGGCAAA CCGGCGGTAG CGGATTAGGG TTAGCGATCG TGAAACATGC GGTGAATCAT
CACGAAAGTC GCCTGAATAT TGAGAGTACA GTAGGAAAAG GAACACGTTT CAGTTTTGTT
ATCCCGGAAC GTTTAATTGC CAAAAACAGC GATTAA
 
Protein sequence
MLERLSWKRL VLELLLCCLP AFILGAFFGY LPWFLLASIT GLLIWHFWNL LRLSWWLWVD 
RSMTPPPGRG SWEPLLYGLH QMQLRNKKRR RELGNLIKRF RSGAESLPDA VVLTTEEGGI
FWCNGLAQQI LGLRWPEDNG QNILNLLRYP EFTQYLKTRD FSRPLNLVLN TGRHLEIRVM
PYTHKQLLMV ARDVTQMHQL EGARRNFFAN VSHELRTPLT VLQGYLEMMD EQPLEGAVRE
KALHTMREQT QRMEGLVKQL LTLSKIEAAP THLLNEKVDV PMMLRVVERE AQTLSQKKQT
FTFEIDNGLK VSGNEDQLRS AISNLVYNAV NHTPEGTHIT VRWQRVPHGA EFSVEDNGPG
IAPEHIPRLT ERFYRVDKAR SRQTGGSGLG LAIVKHAVNH HESRLNIEST VGKGTRFSFV
IPERLIAKNS D