Gene EcSMS35_0430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0430 
SymbolphoR 
ID6143504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp440164 
End bp441459 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content53% 
IMG OID641615326 
Productphosphate regulon sensor protein 
Protein accessionYP_001742533 
Protein GI170681341 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR02966] phosphate regulon sensor kinase PhoR 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.157336 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGGAAC GGCTGTCGTG GAAAAGGCTG GTGCTGGAGC TGCTACTTTG CTGCCTCCCG 
GCTTTCATCC TGGGTGCATT TTTTGGTTAC CTGCCCTGGT TTTTGCTGGC ATCGGTAACG
GGACTGCTTA TCTGGCATTT CTGGAATTTA TTGCGCCTTT CATGGTGGCT GTGGGTGGAT
CGTAGTATGA CCCCGCCACC GGGGCGTGGT AGCTGGGAAC CGCTGCTATA CGGATTACAC
CAGATGCAGT TGCGAAATAA AAAACGCCGC CGCGAACTGG GCAATCTGAT TAAACGCTTT
CGTAGCGGCG CGGAGTCGCT GCCTGATGCG GTGGTGCTGA CCACGGAAGA GGGCGGTATT
TTCTGGTGTA ATGGTCTGGC GCAACAAATT CTTGGTTTGC GCTGGCCGGA AGATAACGGG
CAGAACATCC TTAATCTGCT TCGTTATCCG GAGTTTACGC AATATCTGAA AACGCGTGAT
TTTTCTCGCC CGCTCAATCT GGTGCTCAAT ACCGGGCGGC ATCTGGAAAT TCGCGTCATG
CCTTATACCC ACAAACAGTT GCTGATGGTG GCGCGTGATG TGACGCAAAT GCATCAACTG
GAAGGGGCGC GGCGTAACTT TTTTGCCAAC GTGAGCCATG AGTTACGTAC GCCATTGACC
GTGTTACAGG GTTACCTGGA GATGATGGAT GAGCAACCGC TGGAAGGCGC GGTACGTGAA
AAAGCGTTGC ACACCATGCG CGAGCAGACA CAGCGGATGG AAGGGCTGGT GAAGCAATTG
CTGACGCTGT CGAAAATAGA AGCCGCACCG ACGCAGTTGC TCAATGAAAA GGTTGATGTG
CCGATGATGC TGCGCGTTGT TGAGCGCGAG GCTCAGACTC TGAGTCAGAA AAAACAGACA
TTTACCTTTG AGATAGATAA CGGCCTCAAG GTGTCAGGCA GCGAAGATCA GCTACGCAGT
GCGATTTCGA ACCTGGTCTA TAACGCCGTG AATCATACGC CGGAAGGCAC GCATATCACC
GTACGCTGGC AGCGAGTGCC GCATGGTGCC GAATTTAGCG TTGAAGATAA CGGACCGGGC
ATTGCACCGG AGCATATTCC GCGCCTGACC GAGCGTTTTT ATCGCGTTGA TAAAGCGCGT
TCCCGGCAAA CCGGTGGTAG CGGATTAGGG TTAGCGATCG TGAAACATGC GGTGAATCAT
CACGAAAGTC GCCTGAATAT TGAGAGTACA GTAGGAAAAG GTACACGTTT CAGTTTTGTT
ATCCCGGAAC GTTTAATTGC CAAAAACAGC GATTAA
 
Protein sequence
MLERLSWKRL VLELLLCCLP AFILGAFFGY LPWFLLASVT GLLIWHFWNL LRLSWWLWVD 
RSMTPPPGRG SWEPLLYGLH QMQLRNKKRR RELGNLIKRF RSGAESLPDA VVLTTEEGGI
FWCNGLAQQI LGLRWPEDNG QNILNLLRYP EFTQYLKTRD FSRPLNLVLN TGRHLEIRVM
PYTHKQLLMV ARDVTQMHQL EGARRNFFAN VSHELRTPLT VLQGYLEMMD EQPLEGAVRE
KALHTMREQT QRMEGLVKQL LTLSKIEAAP TQLLNEKVDV PMMLRVVERE AQTLSQKKQT
FTFEIDNGLK VSGSEDQLRS AISNLVYNAV NHTPEGTHIT VRWQRVPHGA EFSVEDNGPG
IAPEHIPRLT ERFYRVDKAR SRQTGGSGLG LAIVKHAVNH HESRLNIEST VGKGTRFSFV
IPERLIAKNS D