Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2545 |
Symbol | |
ID | 6142810 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2605027 |
End bp | 2607216 |
Gene Length | 2190 bp |
Protein Length | 729 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641617417 |
Product | diguanylate cyclase |
Protein accession | YP_001744588 |
Protein GI | 170684005 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2200] FOG: EAL domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00846864 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGTGG AGCATAACCT GATAAAAAAT ATCAAGATAT TCACACTAGC GTTTACGCTC ACCGTGGTAC TTATTCAGCT ATCCCGTTTT ATTTCGCCAC TTGCCATTAT CCATTCCAGT TATATCTTTC TGGCGTGGAT GCCACTGTGC GTAATGCTGT CAATCTTGTT TATCTTTGGC TGGCGCGGTG TCGTTCCCGT TTTATGCGGG ATGTTTTGCA CCAATCTGTG GAACTTTCAT CTCTCTTTTT TACAGACCGC GGTCATGCTC GGCAGCCAGA CGTTTGTTGT GTTGTGTGCC TGCGCAATAT TACGCTGGCA GCTGGGGACG CGTTGGCGTT ATGGATTGAC CAGCCGATAT GTCTGGCAAC GTCTGTTCTG GCTTGGTTTG GTGGCGCCGA TTGGCATCAA ATGCAGCATG TATCTTGTGG GAAATTTCTT TGATTTTCCG CTAAAGATAT CTACCTTTTT CGGCGATGCG GATGCCATTT TCACGGTCGT TGATTTGCTA AGCCTTTTCA CCGCAGTGCT GATTTACAAC ATGCTTTTTT ACTATCTCAC CCGCATGATT GTAAGTCCCC ACTTTGCACA GATATTGTGG CGTAGGGATA TCGCTCCGTC GTTGAGCAAA GAGAAACGCG CATTTACCTT AAGCTGGCTG GCAGCTCTTA GCGTGCTGCT ACTTCTGATG TGCACACCGT ATGAAAATGA CTTTATTGCC GGTTACCTGG TACCTGTTTT CTTTATCATC TTTACCCTCG GTGTCGGTAA GCTTCGCTAT CCGTTTTTAA ATCTCACCTG GGCTGTTTCA ACGCTTTGCC TTCTGAATTA CAACCAGAAC TTTTTGCAAG GGGTACTAAC CGAATATTCG CTGGCATTTA TTCTCGCAGT ACTGATTTCC TTTAGTGTTT GCCTGCTCTA TATGGTGCGT ATTTATCATC GTAGTGAATG GCTTAACCGC CGCTGGCATT TGCAGGCGCT GACCGATCCG TTAACGCTTC TACCCAACTT TCGTGCGTTG GAGCAAGCGC CGGAGCAAGA GGCGGGCAAG AGTTTTTGCT GCCTGCGCAT TGATAATCTT GAGTTTATGA GTCGTCATTA CGGCTTAATG ATGCGCGTTC ACTGTATCCG CTCAATTTAC CGTACGCTGC TGCCGTTGAT GCAGGAAAAC GAAAAGTTGT ATCAATTGCC GGGTAGTGAA CTGCTGATAG TGCTGAGCGG GCCGGAAACG GAAGGGCGAC TCCAGCATAT GGTTAACATC CTGAATAGTC GGCAAATTCA CTGGAACAAT ACCGGGCTGG ATATGGGCTA TGGTGCTGCC TGGGGGCGTT TTGATGGAAA TCAGGAAACC CTGCAACCCT TGTTGGGGCA GTTAAGCTGG CTGGCGGAAC AATCCTGCGC TCATCATCAT GTGCTGGCGC TGGATAGCAG AGAGGAGATG GTTTCCGGGC AGACCACTAA ACAGGTGCTA TTGCTGAATA CCATTCGTAC TGCATTAGAT CAGGGCGATT TGCTGCTCTA CGCCCAGCCA ATTCGCAACA AAGAGGGTGA AGGTTATGAT GAGATCCTCG CGCGACTGAA ATATGACGGC GGCATTATGA CCCCGGATAA GTTTCTGCCC CTTATTGCTC AGTTTAACCT TAGCGCGCGT TTTGATTTGC AAGTGCTGGA ATCCTTGTTG AAGTGGCTGG CAACACACCC TTGCGACAAA AAAGGTCCGC GCTTTTCAGT CAATTTAATG CCGCTCACGC TGCTGCAAAA AAATATCGCC GGGCGGATTA TTCGTCTGTT TAAGCGTTAT CACATCTCCC CGCAGGCGGT CATTCTTGAG ATCACCGAGG AGCAGGCGTT TTCTAACGCA GAAAGCAGCA TGTACAACAT CGAGCAACTG CATAAGTTTG GTTTCCGGAT TGCGATTGAT GACTTTGGCA CCGGCTATGC CAACTACGAA CGGTTAAAGC GTTTGCAGGC TGATATCATC AAAATTGATG GTGTCTTTGT GAAAGATATC GTCACGAATA CGCTGGATGC GATGATTGTG AGATCAATTA CCGATCTGGC GAAAGCGAAG TCATTGAGTG TGGTCGCGGA GTTTGTCGAG ACGCAACAGC AGCAGGCGCT ATTGCATAAG CTCGGGGTGC AATATCTGCA AGGGTATTTG ATTGGTCGCC CGCAGCCATT AGCTGATTAA
|
Protein sequence | MFVEHNLIKN IKIFTLAFTL TVVLIQLSRF ISPLAIIHSS YIFLAWMPLC VMLSILFIFG WRGVVPVLCG MFCTNLWNFH LSFLQTAVML GSQTFVVLCA CAILRWQLGT RWRYGLTSRY VWQRLFWLGL VAPIGIKCSM YLVGNFFDFP LKISTFFGDA DAIFTVVDLL SLFTAVLIYN MLFYYLTRMI VSPHFAQILW RRDIAPSLSK EKRAFTLSWL AALSVLLLLM CTPYENDFIA GYLVPVFFII FTLGVGKLRY PFLNLTWAVS TLCLLNYNQN FLQGVLTEYS LAFILAVLIS FSVCLLYMVR IYHRSEWLNR RWHLQALTDP LTLLPNFRAL EQAPEQEAGK SFCCLRIDNL EFMSRHYGLM MRVHCIRSIY RTLLPLMQEN EKLYQLPGSE LLIVLSGPET EGRLQHMVNI LNSRQIHWNN TGLDMGYGAA WGRFDGNQET LQPLLGQLSW LAEQSCAHHH VLALDSREEM VSGQTTKQVL LLNTIRTALD QGDLLLYAQP IRNKEGEGYD EILARLKYDG GIMTPDKFLP LIAQFNLSAR FDLQVLESLL KWLATHPCDK KGPRFSVNLM PLTLLQKNIA GRIIRLFKRY HISPQAVILE ITEEQAFSNA ESSMYNIEQL HKFGFRIAID DFGTGYANYE RLKRLQADII KIDGVFVKDI VTNTLDAMIV RSITDLAKAK SLSVVAEFVE TQQQQALLHK LGVQYLQGYL IGRPQPLAD
|
| |