Gene EcSMS35_4304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4304 
SymbolcpxA 
ID6143490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4406029 
End bp4407402 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content54% 
IMG OID641619125 
Producttwo-component sensor protein 
Protein accessionYP_001746249 
Protein GI170683679 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.335985 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.00831139 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATAGGCA GCTTAACCGC GCGCATCTTC GCCATCTTCT GGCTGACGCT GGCGCTGGTG 
TTGATGTTGG TTTTGATGTT ACCCAAGCTC GATTCACGCC AGATGACCGA GCTTCTGGAT
AGCGAACAGC GTCAGGGTCT GATGATTGAG CAGCATGTCG AAGCGGAGCT GGCGAACGAT
CCGCCCAACG ATTTAATGTG GTGGCGGCGT CTGTTCCGGG CGATTGATAA GTGGGCACCG
CCAGGACAGC GTTTGTTATT GGTGACCACC GAAGGCCGCG TGATCGGCGC TGAACGCAGC
GAAATGCAGA TCATTCGTAA CTTTATTGGT CAGGCCGATA ACGCCGATCA TCCGCAGAAG
AAAAAGTATG GCCGCGTGGA ACTGGTCGGT CCGTTCTCCG TGCGTGATGG CGAAGATAAT
TACCAACTTT ATCTGATTCG TCCGGCCAGC AGTTCTCAAT CCGATTTCAT TAACTTACTG
TTTGACCGCC CGCTATTACT GCTGATTGTC ACCATGTTGG TCAGTACGCC GCTGCTGTTG
TGGTTGGCCT GGAGTCTGGC AAAACCGGCG CGTAAGCTGA AAAACGCTGC CGATGAAGTT
GCCCAGGGAA ACTTACGCCA GCACCCGGAA CTTGAAGCGG GGCCACAGGA ATTCCTTGCC
GCTGGTGCCA GTTTTAACCA GATGGTCACC GCGCTGGAGC GCATGATGAC CTCTCAGCAG
CGTCTGCTTT CTGATATCTC TCACGAGCTG CGCACCCCGC TGACGCGTCT GCAACTGGGT
ACGGCGTTAC TGCGCCGTCG TAGCGGTGAA AGCAAAGAAC TGGAGCGTAT TGAAACCGAA
GCGCAACGTC TGGACAGCAT GATTAACGAC CTGTTGGTGA TGTCACGTAA TCAGCAGAAA
AACGCACTGG TTAGCGAGAC CATCAAAGCC AATCAGTTGT GGAGTGAAGT GCTGGATAAC
GCGGCGTTCG AAGCCGAGCA AATGGGCAAG TCGTTGACAG TTAACTTCCC GCCTGGGCCG
TGGCCGCTGT ACGGCAATCC GAACGCCCTG GAAAGTGCGC TGGAAAACAT TGTTCGTAAT
GCCCTGCGTT ATTCCCATAC GAAGATTGAA GTGGGCTTTG CGGTAGATAA AGACGGTATC
ACCATTACGG TGGACGACGA TGGTCCTGGT GTTAGCCCGG AAGATCGCGA ACAGATTTTC
CGTCCGTTCT ATCGTACCGA TGAAGCACGC GATCGTGAAT CTGGCGGTAC AGGTTTGGGG
CTGGCGATTG TTGAAACCGC CATCCAGCAG CACCGTGGCT GGGTTAAAGC GGAAGACAGT
CCGCTTGGCG GTTTACGGCT GGTGATTTGG TTGCCGCTGT ATAAGCGGAG TTAA
 
Protein sequence
MIGSLTARIF AIFWLTLALV LMLVLMLPKL DSRQMTELLD SEQRQGLMIE QHVEAELAND 
PPNDLMWWRR LFRAIDKWAP PGQRLLLVTT EGRVIGAERS EMQIIRNFIG QADNADHPQK
KKYGRVELVG PFSVRDGEDN YQLYLIRPAS SSQSDFINLL FDRPLLLLIV TMLVSTPLLL
WLAWSLAKPA RKLKNAADEV AQGNLRQHPE LEAGPQEFLA AGASFNQMVT ALERMMTSQQ
RLLSDISHEL RTPLTRLQLG TALLRRRSGE SKELERIETE AQRLDSMIND LLVMSRNQQK
NALVSETIKA NQLWSEVLDN AAFEAEQMGK SLTVNFPPGP WPLYGNPNAL ESALENIVRN
ALRYSHTKIE VGFAVDKDGI TITVDDDGPG VSPEDREQIF RPFYRTDEAR DRESGGTGLG
LAIVETAIQQ HRGWVKAEDS PLGGLRLVIW LPLYKRS