Gene EcSMS35_3838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3838 
Symbol 
ID6147336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3908168 
End bp3910117 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content53% 
IMG OID641618664 
Productputative phosphodiesterase 
Protein accessionYP_001745804 
Protein GI170684297 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain
[COG2200] FOG: EAL domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.261716 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGCAG CCGTTGTCCT GGTGTTCGTT TTTATTTTTT GCACCGTTTT GCTGTTCCAT 
CTGGTCCAGC AGAATCGCTA TAACACGGCT ACGCAACTGG AAAGCATTGC TCGCTCTGTC
CGCGAACCCT TATCTTCTGC CATTTTGAAA GGCGATATTC CCGAAGCGGA AGCTATTCTT
GCCAGCATTA AACCGGCAGG CGTGGTCAGC CGTGCCGATG TGGTGCTGCC TAACCAGTTC
CAGGCGCTGC GTAAAAGTTT TATTCCAGAG CGTCCGGTGC CGGTAATGGT TACTCGCCTG
TTTGAGCTAC CGGTGCAAAT CTCGCTGGGC GTTTACTCGC TGGAACGTCC GGCAAACCCG
CAGCCAATTG CCTATCTGGT GCTACAGGCG GATTCCTTCC GCATGTATAA GTTCGTGATG
AGCACCCTCT CAACGTTAGT GACCATTTAC TTACTTTTGT CGTTAATATT GACGGTGGCG
ATTAGCTGGT GCATTAACCG CCTGATTTTG CATCCATTAC GCAATATTGC TCGCGAACTT
AACGCCATCC CAGCCCAGGA GCTTGTTGGT CACCAACTGG CATTACCGCG TCTGCATCAG
GACGATGAAA TCGGTATGTT GGTGCGCAGT TACAACCTCA ACCAGCAATT GCTGCAGCGC
CATTATGAAG AACAGAACGA AAATGCGATG CGCTTCCCGG TGTCGGATTT GCCGAACAAA
GCCTTGCTGA TGGAGATGCT GGAGCAGGTG GTGGCGCGTA AACAAACCAC CGCGCTGATG
ATCATCACCT GTGAAACCCT GCGTGATACT GCGGGTGTGC TGAAAGAGGC GCAACGAGAA
ATTCTGCTGC TGACGCTGGT GGAAAAACTC AAATCGGTAC TGTCGCCACG TATGATCCTC
GCGCAGATTA GCGGTTATGA CTTTGCTGTC ATTGCCAACG GTGTACAGGA ACCGTGGCAC
GCAATCACTT TGGGTCAGCA AGTGCTCACT ATCATGAGCG AGCGCCTGCC GATTGAACGT
ATTCAACTCC GTCCGCACTG TAGCATTGGC GTGGCGATGT TCTACGGCGA TCTCACCGCC
GAACAGCTTT ATAGCCGCGC TATTTCTGCG GCATTTACCG CTCGCCATAA AGGCAAGAAT
CAGATTCAGT TCTTTGATCC GCAGCAGATG GAAGCAGCCC AACAGCGGTT GACGGAAGAG
AGCGATATCC TTAATGCACT GGAAAATCAT CAGTTTGCGA TTTGGTTACA GCCACAGGTC
GAGATGACCA GCGGTAAACT GGTCAGTGCG GAAGTGTTAC TGCGTATCCA GCAACCGGAT
GGCAGTTGGG ATCTGCCGGA TGGCTTAATC GATCGCATTG AATGCTGTGG GCTGATGGTT
ACCGTTGGTC ACTGGGTGCT GGAAGAGTCC TGTCGACTGC TTGCTGCCTG GCAAGAGCGC
GGCATTATGC TGCCCTTGTC GGTAAACCTC TCAGCGCTGC AACTGATGCA CCCGAATATG
GTGGCGGATA TGCTGGAACT GTTAACCCGC TATCGCATTC AGCCGGGAAC ACTGATTCTG
GAAGTGACAG AAAGCCGACG TATTGACGAC CCTCATGCTG CGGTGGCAGT CCTCCGTCCG
CTGCGCAATG CCGGCGTTCG GGTGGCGCTG GATGATTTCG GCATGGGCTA TGCGGGGCTG
CGTCAGCTGC AGCATATGAA ATCGTTGCCA ATCGACGTAC TGAAAATCGA CAAAATGTTT
GTTGAAGGCT TGCCGGAAGA TAGCAGCATG ATTGCTGCAA TTATCATGCT GGCGCAGAGC
CTGAACTTAC AAATGATTGC CGAAGGCGTG GAGACTGAAG CACAACGCGA CTGGCTGGCA
AAAGCGGGCG TTGGTATTGC CCAGGGCTTC CTTTTTGCTC GCCCACTCCC TATTGAAATC
TTCGAAGAGA GTTACCTGGA AGAAAAGTAG
 
Protein sequence
MVAAVVLVFV FIFCTVLLFH LVQQNRYNTA TQLESIARSV REPLSSAILK GDIPEAEAIL 
ASIKPAGVVS RADVVLPNQF QALRKSFIPE RPVPVMVTRL FELPVQISLG VYSLERPANP
QPIAYLVLQA DSFRMYKFVM STLSTLVTIY LLLSLILTVA ISWCINRLIL HPLRNIAREL
NAIPAQELVG HQLALPRLHQ DDEIGMLVRS YNLNQQLLQR HYEEQNENAM RFPVSDLPNK
ALLMEMLEQV VARKQTTALM IITCETLRDT AGVLKEAQRE ILLLTLVEKL KSVLSPRMIL
AQISGYDFAV IANGVQEPWH AITLGQQVLT IMSERLPIER IQLRPHCSIG VAMFYGDLTA
EQLYSRAISA AFTARHKGKN QIQFFDPQQM EAAQQRLTEE SDILNALENH QFAIWLQPQV
EMTSGKLVSA EVLLRIQQPD GSWDLPDGLI DRIECCGLMV TVGHWVLEES CRLLAAWQER
GIMLPLSVNL SALQLMHPNM VADMLELLTR YRIQPGTLIL EVTESRRIDD PHAAVAVLRP
LRNAGVRVAL DDFGMGYAGL RQLQHMKSLP IDVLKIDKMF VEGLPEDSSM IAAIIMLAQS
LNLQMIAEGV ETEAQRDWLA KAGVGIAQGF LFARPLPIEI FEESYLEEK