Gene EcSMS35_0500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0500 
Symbol 
ID6146235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp505380 
End bp506930 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content48% 
IMG OID641615394 
Productcyclic diguanylate phosphodiesterase domain-containing protein 
Protein accessionYP_001742601 
Protein GI170682326 
COG category[T] Signal transduction mechanisms 
COG ID[COG4943] Predicted signal transduction protein containing sensor and EAL domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAACAC GACATCTGGT CGGCCTTATT TCGGGAGTAC TGATTCTTTC AGTATTGCTG 
CCTGTCGGCT TAAGCATCTG GCTGGCCCAT CAGCAGGTAG AAACATCATT TATTGAAGAG
CTGAATACGT ATTCCTCCCG CGTCGCTATT CGGGCCAATA AGGTGGCGAC ACAAGGGAAA
GATGCGCTGC AAGAGCTGGA AAGATGGCAA GGCGCTGCCT GTAGCGAAGC CCATCTCATG
GAAATGCGTC GGGTATCTTA CAGTTATCGC TATATTCAGG AAGTGGTTTA TATCGATAAC
AACGTTCCCC AGTGTTCGTC TCTGGAGCAT GAAAGCCCGC CCGATACCTT CCCCGAGCCA
GGTAAAATTT CGAAAGATGG TTATCGTGTC TGGTTAACAT CGCATAACGA TTTAGGCATT
ATCCGTTACA TGGTCGCCAT GGGAACGGCA CATTATGTCG TCATGATCGA CCCCGCTTCC
TTTATTGATG TCATTCCCTA TAGCTCATGG CAAATTGATG CCGCCATTAT TGGCAATGCC
CATAACGTTG TGATAACCAG CAGTGATGAA ATTGCTCAGG GAATTATTAC CAGACTACAA
AAAACACCCG GTGAGCATAT CGAAAATAAT GGAATCATTT ACGATATCCT GCCCTTACCG
GAGATGAATA TTTCGATCAT CACCTGGGCT TCAACGAAAA TGTTGCAGAA AGGCTGGCAT
CGGCAAGTCT TTATTTGGTT ACCGTTCGGG TTGGTGATTG GCCTGCTGGC AGCGATGTTT
GTGCTGCGTA TTTTGCGCCG TATCCAGTCA CCGCATCATC GGCTGCAGGA TGCTATCGAA
AATCGTGATA TTTGCGTGCA CTATCAGCCG ATTGTCTCCT TAGCCAATGG CAAAATTGTC
GGTGCTGAGG CACTGGCGCG CTGGCCGCAG ACAGACGGTA GTTGGTTGTC ACCAGATAGT
TTTATTCCGC TGGCACAGCA AACGGGGCTT TCTGAGCCGT TGACGCTACT GATTATAAAA
AGTGTCTTTG AAGATATGGG CGACTGGCTG CGTCAGCATC CGCAGCAGCA TATTTCGATC
AATCTTGAAT CCACCGTGCT CACCTCGGAA AAAATCCCGC AATTGCTGCG TGAAATGATC
AATCACTATC AGGTTAATCC CAGACAGATC GCGCTTGAAC TCACTGAACG CGAGTTTGCC
GATCCGAAAA CCAGCGCCCC GATAATTTCT CGCTACCGGG AGGCGGGCCA TGAAATTTAT
CTCGATGATT TTGGTACGGG GTATTCAAGT TTAAGCTATT TACAGGATCT GGATGTCGAC
ATTCTGAAGA TCGATAAATC TTTCGTTGAT GCGCTGGAAT ATAAAAATGT CACGCCACAT
ATCATCGAAA TGGCAAAAAC ACTGAAACTG AAAATGGTAG CGGAGGGAAT CGAAACCAGT
AAACAAGAAG AGTGGCTACG TCAGCATAGC GTGCACTACG GCCAGGGCTG GCTCTACAGC
AAGGCATTAC CGAAAGAGGA TTTCTTACGC TGGGCCGAGC AACATTTGTG A
 
Protein sequence
MRTRHLVGLI SGVLILSVLL PVGLSIWLAH QQVETSFIEE LNTYSSRVAI RANKVATQGK 
DALQELERWQ GAACSEAHLM EMRRVSYSYR YIQEVVYIDN NVPQCSSLEH ESPPDTFPEP
GKISKDGYRV WLTSHNDLGI IRYMVAMGTA HYVVMIDPAS FIDVIPYSSW QIDAAIIGNA
HNVVITSSDE IAQGIITRLQ KTPGEHIENN GIIYDILPLP EMNISIITWA STKMLQKGWH
RQVFIWLPFG LVIGLLAAMF VLRILRRIQS PHHRLQDAIE NRDICVHYQP IVSLANGKIV
GAEALARWPQ TDGSWLSPDS FIPLAQQTGL SEPLTLLIIK SVFEDMGDWL RQHPQQHISI
NLESTVLTSE KIPQLLREMI NHYQVNPRQI ALELTEREFA DPKTSAPIIS RYREAGHEIY
LDDFGTGYSS LSYLQDLDVD ILKIDKSFVD ALEYKNVTPH IIEMAKTLKL KMVAEGIETS
KQEEWLRQHS VHYGQGWLYS KALPKEDFLR WAEQHL