Gene ECH74115_5366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5366 
SymbolcpxA 
ID6972233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5006675 
End bp5008048 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content55% 
IMG OID643389020 
Producttwo-component sensor protein 
Protein accessionYP_002273429 
Protein GI209398760 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.135229 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGGCA GCTTAACCGC GCGCATCTTC GCCATCTTCT GGCTGACGCT GGCGCTGGTG 
TTGATGTTGG TTTTGATGTT ACCCAAGCTC GATTCACGCC AGATGACCGA GCTTCTGGAT
AGCGAACAGC GTCAGGGTCT GATGATTGAG CAGCATGTTG AAGCGGAGCT GGCGAACGAT
CCGCCCAACG ATTTAATGTG GTGGCGGCGT CTGTTCCGGG CGATTGATAA GTGGGCACCG
CCAGGACAGC GTTTGTTATT GGTGACCACC GAAGGCCGCG TGATCGGCGC TGAACGCAGC
GAAATGCAGA TCATTCGTAA CTTTATTGGT CAGGCCGATA ACGCCGATCA TCCGCAGAAG
AAAAAGTATG GCCGCGTGGA ACTGGTCGGT CCGTTCTCCG TGCGTGATGG CGAAGATAAT
TACCAACTTT ATCTGATTCG TCCGGCCAGC AGTTCTCAAT CCGATTTCAT TAACTTACTG
TTTGACCGCC CGCTATTACT GCTGATTGTC ACCATGTTGG TCAGTACGCC GCTGCTGTTG
TGGTTGGCCT GGAGTCTGGC AAAACCGGCG CGTAAGCTGA AAAACGCTGC CGATGAAGTT
GCCCAGGGAA ACTTACGCCA GCACCCGGAA CTGGAAGCGG GGCCACAGGA ATTCCTTGCC
GCAGGTGCCA GTTTTAACCA GATGGTCACC GCGCTGGAGC GCATGATGAC CTCTCAGCAG
CGTCTGCTTT CTGATATCTC TCACGAGCTG CGCACCCCGC TGACGCGTCT GCAACTGGGT
ACGGCGTTAC TGCGCCGTCG TAGTGGTGAA AGCAAGGAAC TGGAGCGTAT TGAAACCGAA
GCGCAACGTC TGGACAGCAT GATTAACGAC CTGTTGGTGA TGTCACGTAA TCAGCAAAAA
AACGCGCTGG TTAGCGAGAC CATCAAAGCC AATCAGTTGT GGAGTGAAGT GCTGGATAAC
GCGGCGTTCG AAGCCGAGCA AATGGGCAAG TCGTTGACAG TTAACTTCCC GCCTGGGCCG
TGGCCGCTGT ACGGCAACCC GAACGCCCTG GAGAGTGCGC TGGAAAACAT TGTTCGTAAT
GCCCTGCGTT ATTCCCATAC GAAGATTGAA GTGGGCTTTG CGGTAGATAA AGACGGTATC
ACCATTACGG TGGACGACGA TGGTCCTGGC GTTAGCCCGG AAGATCGCGA ACAGATTTTC
CGTCCGTTCT ATCGGACCGA TGAAGCGCGC GATCGTGAAT CTGGCGGTAC AGGTTTGGGA
CTGGCGATTG TTGAAACCGC CATTCAGCAG CATCGTGGCT GGGTGAAAGC AGAAGACAGC
CCGCTGGGCG GTTTACGGCT GGTGATTTGG TTGCCGCTGT ATAAGCGGAG TTAA
 
Protein sequence
MIGSLTARIF AIFWLTLALV LMLVLMLPKL DSRQMTELLD SEQRQGLMIE QHVEAELAND 
PPNDLMWWRR LFRAIDKWAP PGQRLLLVTT EGRVIGAERS EMQIIRNFIG QADNADHPQK
KKYGRVELVG PFSVRDGEDN YQLYLIRPAS SSQSDFINLL FDRPLLLLIV TMLVSTPLLL
WLAWSLAKPA RKLKNAADEV AQGNLRQHPE LEAGPQEFLA AGASFNQMVT ALERMMTSQQ
RLLSDISHEL RTPLTRLQLG TALLRRRSGE SKELERIETE AQRLDSMIND LLVMSRNQQK
NALVSETIKA NQLWSEVLDN AAFEAEQMGK SLTVNFPPGP WPLYGNPNAL ESALENIVRN
ALRYSHTKIE VGFAVDKDGI TITVDDDGPG VSPEDREQIF RPFYRTDEAR DRESGGTGLG
LAIVETAIQQ HRGWVKAEDS PLGGLRLVIW LPLYKRS