Gene EcSMS35_2131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2131 
SymboltorS 
ID6144177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2140482 
End bp2143184 
Gene Length2703 bp 
Protein Length900 aa 
Translation table11 
GC content54% 
IMG OID641617007 
Producthybrid sensory histidine kinase TorS 
Protein accessionYP_001744182 
Protein GI170681062 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type
[TIGR02956] TMAO reductase sytem sensor TorS 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTTTG CCCTGATGGC GCTGTTAACC CTGACCAGTA CCCTGGTGGG ATGGTACAAC 
CTGCGCTTTA TCAGCCAGGT GGAAAAAGAC AACACTCAGG CATTGATTCC TACCATGAAT
ATGGCGCGCC AGTTGAGCGA AGCCAGCGCC TGGGAACTTT TCGCCGCGCA GAACCTGACC
AGTGCCGATA ACGAAAAGAT GTGGCAGGCG CAGGGGCGAA TGCTCACCGC ACAAAGCCTG
AAGATTAATG CGTTGCTGCA AGCGTTACGG GAACAAGGTT TTGACACCAC CGCTATTGAA
CTACAGGAGC AGGAGATCTC CCGTTCGTTA CGTCAGCAAG GGGAACTGGT GGGACAGCGT
TTGCAACTTC GCCAGCAACA ACAGCAACTC AGTCAGCAGA TAGTCGCCGC CGCCGATGAA
ATCGCACGCC TGGCACAAGG TCAGGCGAAT AATGCGGCAA CCTCCGCCGG AGCGACCCAG
GCCGGGATTT ACGATTTGAT CGAACAAGAT CAGCGTCAGG CCGCAGAAAG TGCACTCGAT
CGGCTGATTG ATATCGATCT TGAGTATGTT AACCAGATGA ATGAACTGCG CCTTAGCGCC
CTGCGGGTGC AGCAAATGGT GATGAATCTG GGGCTGGAGC AGATCCAGAA AAATGCAGCA
ACGCTGGAAC ACCAGCTCAA TAATGCGGTG AAAATTCTGC AACGTCGGCA AATACGCATT
GAAGATCCGG GAGTTCGTGC TCAGGTCGCA ACAACGTTAA CTACCGTTAG CCAATATAGC
GATTTGCTGG CGCTGTATCA GCAGGACAGT GAAATCAGCA ATCGCCTGCA AACTCTCGCC
CAAAATAACA TCGCCCAGTT CGCGCAGTTT AGTAGCGAAG TCAGTCAGCT GGTCGACACC
ATTGAGCTGC GTAATCAGCA CGGACTGGCG CATCTGGAAA AAGCCAGTGC ACGCGGGCAA
TACAGCCTGT TATTGCTGGG GATGGTTTCA CTTTGCGCAC TGATTCTGAT CCTCTGGCGC
GTGGTTTATC GCTCAGTCAC GCGTCCACTT GCCGAACAAA CGCAGGCGCT GCAACGGCTG
CTGGACGGTG ATATCGACTC CCCTTTCCCG GAAACCGCTG GCGTACGGGA GCTGGATACC
ATCGGGCGGC TGATGGATGC GTTTCGCAGC AGTGTTCATG CACTGAATCG CCACCGTGAA
CAGCTGGCGG CGCAGGTCAA AGCGCGTACA GCTGAATTGC AGGAACTGGT GATAGAACAC
CGACAGGCAC GGGCGGAAGC AGAAAAAGCC AGCCAGGCAA AATCGGCGTT TCTGGCGGCG
ATGAGCCATG AGATCCGCAC ACCGCTGTAC GGTATTCTCG GCACCGCGCA ACTGCTGGCA
GATAACCCGG CACTTAACAC CCAGCGTGAT GATTTGCGGG CAATTACTGA TTCTGGCGAA
TCGTTGCTGA CCATCCTCAA CGATATTCTC GATTATTCGG CTATCGAAGC GGGTGGCAAG
AATGTTTCGG TCAGCGATGA GCCCTTTGAA CCGCGCCCGC TGCTGGAAAG TACCCTGCAA
TTAATGAGCG GACGGGTTAA AGGTCGCCCG ATTCACCTGG CAACAGCAAT TGCCGACGAT
GTACCGACCG CGTTAATGGG CGATCCGCGA CGTATTCGTC AGGTTATAAC CAACCTGTTG
AGCAACGCCC TGCGTTTTAC TGACGAAGGG CAGATCGTTT TACGTAGCCG CACTGATGGC
GAGCAATGGC TGGTCGAAGT GGAAGACAGC GGCTGCGGTA TTGATCCCGC GAAACTGGCA
GAAATCTTCC AGCCATTTGT GCAGGTAAGC GGCAAACGCG GCGGCACCGG GCTGGGGCTG
ACTATCAGTA GCCGTCTGGC CCAGGCGATG GGCGGCGAAC TGAGCGCCAC CAGCACGCCG
GAGGTTGGAA GCTGTTTTTG TTTACGCTTG CCGTTACGTG TTGCCACTGC TCCCGTGCCA
AAAACAGTCA ATCAGGCGGT GCGTCTGGAC GGTTTACGTT TGCTGTTAAT TGAAGATAAC
CCGCTAACCC AGCGAATTAC CGTTGAGATG CTGAACACCA GTGGTGCGCA GGTTGTTGCT
GTTGGCAATG CCGCGCAGGC TTTAGAGACA CTGCAAAATA GCGAACCGTT TGCTGCCGCA
CTGGTGGATT TTGATCTACC GGATATCGAC GGCATTACGC TTGCCCGACA ACTGGCACAG
CAATATCCGT CGCTGGTTTT GATTGGCTTT AGCGCCCATG TCATTGACGA AACACTGCGA
CAGCGTACCA GTTCGCTATT TCGCGGGATT ATCCCTAAAC CGGTGCCGCG TGAAGTGCTC
GGTCAATTAC TGGCGCACTA TCTCCAACTG CAAGTCAATA ACGATCAACC GCTGGATGTA
TCGCAACTCA ATGAAGATGC TCAGTTGATG GGGACGGAGA AGATCCACGA ATGGCTGATA
TTATTTAAAC AACATGCCCT GCCGCTTCTC GATGACATCG ACATTGCCCG CGCCAGCCAG
AACAGCGAAA AAATAAAGCG TGCCGCACAT CAGCTAAAAA GCAGTTGCTC AAGTCTGGGA
ATGCGTAGCG CCAGCCAGCT TTGCGCACAA CTGGAGCAGC AGCCATTATC TGCCCCCCTT
CCACACGAAG AAATTACACG CAGTGTTGCC GCTCTGGAAG CATGGTTAAT AAGAAAGACC
TGA
 
Protein sequence
MGFALMALLT LTSTLVGWYN LRFISQVEKD NTQALIPTMN MARQLSEASA WELFAAQNLT 
SADNEKMWQA QGRMLTAQSL KINALLQALR EQGFDTTAIE LQEQEISRSL RQQGELVGQR
LQLRQQQQQL SQQIVAAADE IARLAQGQAN NAATSAGATQ AGIYDLIEQD QRQAAESALD
RLIDIDLEYV NQMNELRLSA LRVQQMVMNL GLEQIQKNAA TLEHQLNNAV KILQRRQIRI
EDPGVRAQVA TTLTTVSQYS DLLALYQQDS EISNRLQTLA QNNIAQFAQF SSEVSQLVDT
IELRNQHGLA HLEKASARGQ YSLLLLGMVS LCALILILWR VVYRSVTRPL AEQTQALQRL
LDGDIDSPFP ETAGVRELDT IGRLMDAFRS SVHALNRHRE QLAAQVKART AELQELVIEH
RQARAEAEKA SQAKSAFLAA MSHEIRTPLY GILGTAQLLA DNPALNTQRD DLRAITDSGE
SLLTILNDIL DYSAIEAGGK NVSVSDEPFE PRPLLESTLQ LMSGRVKGRP IHLATAIADD
VPTALMGDPR RIRQVITNLL SNALRFTDEG QIVLRSRTDG EQWLVEVEDS GCGIDPAKLA
EIFQPFVQVS GKRGGTGLGL TISSRLAQAM GGELSATSTP EVGSCFCLRL PLRVATAPVP
KTVNQAVRLD GLRLLLIEDN PLTQRITVEM LNTSGAQVVA VGNAAQALET LQNSEPFAAA
LVDFDLPDID GITLARQLAQ QYPSLVLIGF SAHVIDETLR QRTSSLFRGI IPKPVPREVL
GQLLAHYLQL QVNNDQPLDV SQLNEDAQLM GTEKIHEWLI LFKQHALPLL DDIDIARASQ
NSEKIKRAAH QLKSSCSSLG MRSASQLCAQ LEQQPLSAPL PHEEITRSVA ALEAWLIRKT