Gene EcSMS35_1842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1842 
Symbol 
ID6145922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1866223 
End bp1868208 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content49% 
IMG OID641616718 
ProductRNase II stability modulator 
Protein accessionYP_001743896 
Protein GI170681077 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0112439 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000000000244061 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAACCG TTAGGGAGTC CACAACGTTG TACAACTTTC TCGGATCGCA CAATCCATAC 
TGGCGGTTAA CGGAAAGCAG CGATGTTTTG CGCTTTTCTA CCACCGAAAC CACAGAACCT
GAACGTACGT TGCAGTTATC TGCCGAACAG GCTGCTCGCA TCAGGGAAAT GACGGTCATC
ACCTCCAGCC TGATGATGAG TCTGACCGTC GATGAAAGCG ATCTTTCTGT GCATCTGGTA
GGACGAAAAA TCAATAAACG GGAATGGGCT GGCAACGCGT CTGCATGGCA TGACACACCA
GCAGTTGCTC GTGATTTATC ACACGGGCTT TCCTTTGCTG AGCAGGTAGT TTCTGAAGCA
CATTCCGCAA TAGTGATTCT CGACAGCCGG GGGAATATCC AACGCTTCAA TCGGTTATGT
GAAGATTACA CTGGGTTGAA AGAACACGAC GTCATTGGCC AAAGCGTGTT TAAACTGTTT
ATGAGCCGTC GTGAAGCTGC GGCATCCAGG CGCAATAACC GTGTATTTTT TCGAAGCGGC
AATGCATATG AAGTCGAACT GTGGATACCA ACACGTAAAG GCCAGCGGCT GTTTCTGTTT
CGCAATAAAT TTGTCCACAG CGGCAGTGGC AAAAACGAGA TTTTTTTAAT CTGTTCCGGC
ACCGACATTA CCGAAGAGCG CCGCGCTCAG GAGCGACTGC GTATTCTGGC AAATACCGAC
AGTATCACCG GACTGCCGAA TCGTAATGCA ATGCAGGAGT TAATCGATCA CGCTATTAAT
CAGGCAGATA ACAATAAAGT TGGGGTTGTG TATCTTGATT TGGATAATTT TAAAAAGGTC
AACGACGCCT ATGGGCATTT ATTTGGTGAC CAGTTATTAC GCGACGTGTC ATTGGCCATT
TTAAGCTGTC TCGAACATGA TCAGGTGTTG GCGCGTCCAG GTGGCGATGA GTTTCTGGTA
CTGGCATCCA ATACCTCACA AAGCGCGCTG GAAGCAATGG CATCACGAAT TTTGACCCGC
TTACGACTCC CCTTCCGCAT TGGTTTAATT GAAGTTTATA CCAGCTGTTC AGTAGGTATT
TCACTCTCTC CCGAACATGG TTCGGACAGC GCGGCTATTA TTCGTCACGC CGACACAGCA
ATGTACACAG CGAAGGAAGG CGGACGAGGA CAATTTTGCG TTTTTACCCC AGAAATGAAT
CAGAGGGTAT TTGAATATCT CTGGCTGGAT ACCAACTTGC GTAAAGCACT GGAAAACGAT
CAGTTGGTTA TTCACTATCA ACCGAAAATC ACCTGGCGTG GCGAAGTGCG CAGTCTGGAA
GCACTAGTAC GTTGGCAGTC ACCTGAACGT GGGTTGATTC CACCGTTGGA CTTCATTTCC
TACGCCGAAG AGTCAGGGCT AATTGTGCCT TTAGGCCGTT GGGTGATTCT CGATGTCGTA
CGCCAGGTGG CAAAGTGGCG GGATAAAGGT ATTAACCTGC GAGTGGCGGT AAACATTTCT
GCACGTCAGC TCGCCGATCA AACCATTTTC ACCGCCCTGA AACAGGTTCT CCAGGAACTC
AATTTTGAAT ACTGCCCTAT TGATGTTGAA CTGACGGAGA GCTGCCTGAT TGAGAATGAT
GAACTGGCAC TGTCTGTTAT TCAACAATTT AGCCAACTAG GTGCGCAAGT GCATCTGGAC
GATTTTGGTA CCGGCTACTC TTCACTTTCG CAACTGGCGC GCTTTCCGAT CGATGCCATC
AAACTTGACC AGGTTTTTGT TCGGGATATT CACAAACAGC CTGTCTCGCA GTCACTGGTC
CGGGCGATCG TCGCTGTGGC ACAGGCATTG AATCTTCAGG TGATCGCTGA AGGAGTGGAG
AGTAAAAAGG AAGATGCTTT TTTAACCAAG AACGGGATCA ATGAGCGGCA AGGATTTTTG
TTTGCCAAAC CGATGCCCGC CGTCGCCTTC GAACGCTGGT ATAAACGCTA TCTGAAGCGC
GCATAA
 
Protein sequence
MKTVRESTTL YNFLGSHNPY WRLTESSDVL RFSTTETTEP ERTLQLSAEQ AARIREMTVI 
TSSLMMSLTV DESDLSVHLV GRKINKREWA GNASAWHDTP AVARDLSHGL SFAEQVVSEA
HSAIVILDSR GNIQRFNRLC EDYTGLKEHD VIGQSVFKLF MSRREAAASR RNNRVFFRSG
NAYEVELWIP TRKGQRLFLF RNKFVHSGSG KNEIFLICSG TDITEERRAQ ERLRILANTD
SITGLPNRNA MQELIDHAIN QADNNKVGVV YLDLDNFKKV NDAYGHLFGD QLLRDVSLAI
LSCLEHDQVL ARPGGDEFLV LASNTSQSAL EAMASRILTR LRLPFRIGLI EVYTSCSVGI
SLSPEHGSDS AAIIRHADTA MYTAKEGGRG QFCVFTPEMN QRVFEYLWLD TNLRKALEND
QLVIHYQPKI TWRGEVRSLE ALVRWQSPER GLIPPLDFIS YAEESGLIVP LGRWVILDVV
RQVAKWRDKG INLRVAVNIS ARQLADQTIF TALKQVLQEL NFEYCPIDVE LTESCLIEND
ELALSVIQQF SQLGAQVHLD DFGTGYSSLS QLARFPIDAI KLDQVFVRDI HKQPVSQSLV
RAIVAVAQAL NLQVIAEGVE SKKEDAFLTK NGINERQGFL FAKPMPAVAF ERWYKRYLKR
A