Gene EcSMS35_0858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0858 
Symbol 
ID6146660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp864657 
End bp867005 
Gene Length2349 bp 
Protein Length782 aa 
Translation table11 
GC content42% 
IMG OID641615746 
Productcyclic diguanylate phosphodiesterase domain-containing protein 
Protein accessionYP_001742938 
Protein GI170681940 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAGTT TATACGAAAA GATAAAGATA AGGCTGATAA TTTTATTTTT ATTGGCAGCA 
CTGTCATTTA TTGGTCTTTT TTTCATCATT AACTATCAAC TGGTATCGGA ACGCGCGGTA
AAACGTGCCG ATAGCCGCTT TGAACTTATT CAGAAAAACG TTGGCTATTT CTTTAAAGAT
ATTGAACGTT CGGCCCTGAC ATTAAAGGAC TCACTATATT TATTAAAAAA TACAGAGGAG
ATTCAACGCG CCGTAATTCT GAAAATGGAA ATGATGCCAT TTTTAGACTC GGTGGGACTG
GTACTTGATG ATAATAAATA TTATCTCTTT TCGCGGAGGA CGAATGATAA AATCGTTGTT
TATCATCAGG AACAAGTAAA TGGACCGCTT GTCGACGAGT CAGGGCGGGT TATTTTTGCC
GATTTTAACC CATCGAAACG ACCGTGGTCG GTGGCTTCAG ATGACTCTAA CAACAGCTGG
TATCCGGCAT ACAATTGCTT TGATCGTCCG GGTAAAAAAT GTATCTCTTT TACGCTACGC
ATCAACGGCA AAGATCACGA TTTGTTAGCG GTGGATAAAA TACATGTCGA TTTAAACTGG
CGATATCTGA ACGAGTATCT TGATCACATC AGCGCTAATG ATGAAGTTCT ATTTTTGAAA
CAAGGCCATG AGATCATTGC CAAGAATCAA CTCGCGCGTG AAAAACTGAT TATTTATAAT
AGCGAAGGTA ATTATAATAT TATTGATTCT GTCGATACTG AATATATCGA AAAAACATCA
GTGGTGCCAA ACAACGCATT ATTCGAAATC TATTTTTATT ATCCTGGCGG TAATTTATTG
AACGCATCAG ATAAACTTTT TTATCTGCCG TTTGCGTTCA TTATTATCGT ATTGTTGGTG
GTTTATTTAA TGACCACTCG TGTGTTCCGT CGGCAATTTT CTGAAATGAC CGAGCTGGTT
AATACGCTGG CGTTTTTGCC CGACTCAACG GATCAGATCG AGGCTCTGAA AATTCGCGAA
GGCGATGCGA AAGAGATTAT CAGCATCAAA AATTCGATCG CGGAAATGAA AGATGCCGAA
ATTGAACGGT CAAATAAATT GCTCTCACTG ATCTCTTACG ATCAGGAAAG CGGTTTTATT
AAAAATATGG CGATTATTGA GTCCAACAAT AATCAGTATC TGGCTGTGGG GATCATCAAA
CTGTGTGGTC TGGAAGCCGT GGAAGCGGTG TTTGGTGTTG ATGAACGCAA TAAAATCGTC
AGAAAATTGT GTCAGCGAAT TGCCGAGAAA TATGCGCAAT GCTGCGATAT CGTGACATTT
AATGCCGATC TCTATTTACT CCTGTGCCGG GAAAATGTAC AGACGTTTAC CCGTAAGATA
GCGACGGTAA ACGATTTTGA CAGCAGTTTT GGCTACCGCA ATCTGCGCAT CCATAAGTCT
GCTATTTGTG AACCTTTGCA GGGGGAAAAC GCCTGGAGTT ACGCAGAAAA GCTGAAACTG
GCGATTTCCA GTATCCGCAA CCATATGTTC TCAGAGTTTA TTTTCTGTGA TGATGCGAAA
CTCAACGAAA TAGAAGAGAA TATCTGGATT GCGCGTAATA TTCGCCATGC AATGGAAATT
GGCGAACTAT TCCTCGTCTA TCAACCGATC GTTGATATTA ACACCCGCGC CATTCTGGGC
GCGGAGGCGT TGTGCCGTTG GGTGTCTGCG GAGCGGGGGA TCATTTCACC GCTAAAGTTC
ATTACCATTG CTGAAGATAT CGGGTTTATC AATGAGCTGG GTTATCAGAT TATTAAAACC
GCGATGGGTG AATTCAGACA TTTTAGTCAG CGTGCGGTCC TGAAGGACGG TTTCTTACTG
CATATTAATG TTTCGCCCTG GCAGTTAAAC GAACCACACT TTCATGAGCG TTTTACCACC
ATCATGGAAG AAAATGGCCT GAAGGTGAAC AGCCTCTGTG TTGAGATCAC TGAAACCGTG
ATTGAGCGAA TTAATGAACA TTTTTATCTC AATATTGAAC AACTGCGTAA ACAAGGGGTA
CGGATATCGA TTGATGACTT TGGCACCGGT TTGTCAAACC TGAAACGTTT TTATGAAATT
AATCCAGATA GCATAAAAGT GGACTCACAA TTTACCGGCG ATATTTTCGG TACTGCGGGA
AAAATTGTGC GCATTATTTT CGATCTGGCA CGCTATAACC GGATCCCGGT GATTGCGGAA
GGCGTAGAGA GCGAAGACGT TGCGCGCGAA TTAATCAAAT TAGGATGTGT TCAGGCTCAG
GGGTATCTGT ACCAGAAACC CATGCCGTTC TCCGCCTGGG ATAAAAGTGG AAAATTAGTA
AAAGAGTAG
 
Protein sequence
MLSLYEKIKI RLIILFLLAA LSFIGLFFII NYQLVSERAV KRADSRFELI QKNVGYFFKD 
IERSALTLKD SLYLLKNTEE IQRAVILKME MMPFLDSVGL VLDDNKYYLF SRRTNDKIVV
YHQEQVNGPL VDESGRVIFA DFNPSKRPWS VASDDSNNSW YPAYNCFDRP GKKCISFTLR
INGKDHDLLA VDKIHVDLNW RYLNEYLDHI SANDEVLFLK QGHEIIAKNQ LAREKLIIYN
SEGNYNIIDS VDTEYIEKTS VVPNNALFEI YFYYPGGNLL NASDKLFYLP FAFIIIVLLV
VYLMTTRVFR RQFSEMTELV NTLAFLPDST DQIEALKIRE GDAKEIISIK NSIAEMKDAE
IERSNKLLSL ISYDQESGFI KNMAIIESNN NQYLAVGIIK LCGLEAVEAV FGVDERNKIV
RKLCQRIAEK YAQCCDIVTF NADLYLLLCR ENVQTFTRKI ATVNDFDSSF GYRNLRIHKS
AICEPLQGEN AWSYAEKLKL AISSIRNHMF SEFIFCDDAK LNEIEENIWI ARNIRHAMEI
GELFLVYQPI VDINTRAILG AEALCRWVSA ERGIISPLKF ITIAEDIGFI NELGYQIIKT
AMGEFRHFSQ RAVLKDGFLL HINVSPWQLN EPHFHERFTT IMEENGLKVN SLCVEITETV
IERINEHFYL NIEQLRKQGV RISIDDFGTG LSNLKRFYEI NPDSIKVDSQ FTGDIFGTAG
KIVRIIFDLA RYNRIPVIAE GVESEDVARE LIKLGCVQAQ GYLYQKPMPF SAWDKSGKLV
KE