Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3838 |
Symbol | |
ID | 6147336 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3908168 |
End bp | 3910117 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641618664 |
Product | putative phosphodiesterase |
Protein accession | YP_001745804 |
Protein GI | 170684297 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2199] FOG: GGDEF domain [COG2200] FOG: EAL domain |
TIGRFAM ID | [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.261716 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGCAG CCGTTGTCCT GGTGTTCGTT TTTATTTTTT GCACCGTTTT GCTGTTCCAT CTGGTCCAGC AGAATCGCTA TAACACGGCT ACGCAACTGG AAAGCATTGC TCGCTCTGTC CGCGAACCCT TATCTTCTGC CATTTTGAAA GGCGATATTC CCGAAGCGGA AGCTATTCTT GCCAGCATTA AACCGGCAGG CGTGGTCAGC CGTGCCGATG TGGTGCTGCC TAACCAGTTC CAGGCGCTGC GTAAAAGTTT TATTCCAGAG CGTCCGGTGC CGGTAATGGT TACTCGCCTG TTTGAGCTAC CGGTGCAAAT CTCGCTGGGC GTTTACTCGC TGGAACGTCC GGCAAACCCG CAGCCAATTG CCTATCTGGT GCTACAGGCG GATTCCTTCC GCATGTATAA GTTCGTGATG AGCACCCTCT CAACGTTAGT GACCATTTAC TTACTTTTGT CGTTAATATT GACGGTGGCG ATTAGCTGGT GCATTAACCG CCTGATTTTG CATCCATTAC GCAATATTGC TCGCGAACTT AACGCCATCC CAGCCCAGGA GCTTGTTGGT CACCAACTGG CATTACCGCG TCTGCATCAG GACGATGAAA TCGGTATGTT GGTGCGCAGT TACAACCTCA ACCAGCAATT GCTGCAGCGC CATTATGAAG AACAGAACGA AAATGCGATG CGCTTCCCGG TGTCGGATTT GCCGAACAAA GCCTTGCTGA TGGAGATGCT GGAGCAGGTG GTGGCGCGTA AACAAACCAC CGCGCTGATG ATCATCACCT GTGAAACCCT GCGTGATACT GCGGGTGTGC TGAAAGAGGC GCAACGAGAA ATTCTGCTGC TGACGCTGGT GGAAAAACTC AAATCGGTAC TGTCGCCACG TATGATCCTC GCGCAGATTA GCGGTTATGA CTTTGCTGTC ATTGCCAACG GTGTACAGGA ACCGTGGCAC GCAATCACTT TGGGTCAGCA AGTGCTCACT ATCATGAGCG AGCGCCTGCC GATTGAACGT ATTCAACTCC GTCCGCACTG TAGCATTGGC GTGGCGATGT TCTACGGCGA TCTCACCGCC GAACAGCTTT ATAGCCGCGC TATTTCTGCG GCATTTACCG CTCGCCATAA AGGCAAGAAT CAGATTCAGT TCTTTGATCC GCAGCAGATG GAAGCAGCCC AACAGCGGTT GACGGAAGAG AGCGATATCC TTAATGCACT GGAAAATCAT CAGTTTGCGA TTTGGTTACA GCCACAGGTC GAGATGACCA GCGGTAAACT GGTCAGTGCG GAAGTGTTAC TGCGTATCCA GCAACCGGAT GGCAGTTGGG ATCTGCCGGA TGGCTTAATC GATCGCATTG AATGCTGTGG GCTGATGGTT ACCGTTGGTC ACTGGGTGCT GGAAGAGTCC TGTCGACTGC TTGCTGCCTG GCAAGAGCGC GGCATTATGC TGCCCTTGTC GGTAAACCTC TCAGCGCTGC AACTGATGCA CCCGAATATG GTGGCGGATA TGCTGGAACT GTTAACCCGC TATCGCATTC AGCCGGGAAC ACTGATTCTG GAAGTGACAG AAAGCCGACG TATTGACGAC CCTCATGCTG CGGTGGCAGT CCTCCGTCCG CTGCGCAATG CCGGCGTTCG GGTGGCGCTG GATGATTTCG GCATGGGCTA TGCGGGGCTG CGTCAGCTGC AGCATATGAA ATCGTTGCCA ATCGACGTAC TGAAAATCGA CAAAATGTTT GTTGAAGGCT TGCCGGAAGA TAGCAGCATG ATTGCTGCAA TTATCATGCT GGCGCAGAGC CTGAACTTAC AAATGATTGC CGAAGGCGTG GAGACTGAAG CACAACGCGA CTGGCTGGCA AAAGCGGGCG TTGGTATTGC CCAGGGCTTC CTTTTTGCTC GCCCACTCCC TATTGAAATC TTCGAAGAGA GTTACCTGGA AGAAAAGTAG
|
Protein sequence | MVAAVVLVFV FIFCTVLLFH LVQQNRYNTA TQLESIARSV REPLSSAILK GDIPEAEAIL ASIKPAGVVS RADVVLPNQF QALRKSFIPE RPVPVMVTRL FELPVQISLG VYSLERPANP QPIAYLVLQA DSFRMYKFVM STLSTLVTIY LLLSLILTVA ISWCINRLIL HPLRNIAREL NAIPAQELVG HQLALPRLHQ DDEIGMLVRS YNLNQQLLQR HYEEQNENAM RFPVSDLPNK ALLMEMLEQV VARKQTTALM IITCETLRDT AGVLKEAQRE ILLLTLVEKL KSVLSPRMIL AQISGYDFAV IANGVQEPWH AITLGQQVLT IMSERLPIER IQLRPHCSIG VAMFYGDLTA EQLYSRAISA AFTARHKGKN QIQFFDPQQM EAAQQRLTEE SDILNALENH QFAIWLQPQV EMTSGKLVSA EVLLRIQQPD GSWDLPDGLI DRIECCGLMV TVGHWVLEES CRLLAAWQER GIMLPLSVNL SALQLMHPNM VADMLELLTR YRIQPGTLIL EVTESRRIDD PHAAVAVLRP LRNAGVRVAL DDFGMGYAGL RQLQHMKSLP IDVLKIDKMF VEGLPEDSSM IAAIIMLAQS LNLQMIAEGV ETEAQRDWLA KAGVGIAQGF LFARPLPIEI FEESYLEEK
|
| |