Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3547 |
Symbol | |
ID | 6143611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3628037 |
End bp | 3629977 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641618376 |
Product | regulatory protein CsrD |
Protein accession | YP_001745523 |
Protein GI | 170679868 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2200] FOG: EAL domain |
TIGRFAM ID | [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000352708 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGATTAA CGACGAAATT TTCGGCCTTT GTTACGCTGC TCACCGGGTT AACAATTTTT GTGACTTTGC TGGGCTGTTC GCTAAGTTTC TACAACGCCA TTCAGTATAA GTTTAGTCAT CGTGTTCAGG CGGTGGCGAC GGCGATCGAT ACCCACCTGG TGTCGAATGA CTTCAGCACA TTAAGGCCAC AAATTACCGA ATTAATGATG TCGGCAGATA TCGTTCGTGT AGACCTGCTC CATGGTGATA AGCAGGTTTA TACCCTGGCC AGAAATGGTA GTTATCGTCC GGTTGGCACC AACGATCTAT TTCGTGAACT GAGCGTTCCG TTGATAAAGC ATCCGGGGAT GTCGCTGCGT CTGGTTTATC AGGATCCGAT GGGCAACTAT TTCCATTCGT TGATGACCAC CGCGCCGCTC ACGGGGGCGA TTGGCTTTAT CATTCTTATG CTCTTCCTGG CGGTACGCTG GTTACAACGG CAACTTGCCG GGCAAGAATT GCTGGAAACC CGGGCTACTC GTATCTTAAA CGGTGAGCGT GGCTCTAATG TGTTGGGAAC CATCTATGAA TGGCCGCCCA GAACCAGCAG TGCGCTGGAT ACGCTGCTTC GTGAAATTCA GAACGCACGC GAACAACACA GCCGTCTTGA TACGCTGATC CGCTCTTATG CCGCCCAGGA CATGAAAACC GGCCTCAATA ACCGACTCTT CTTCGATAAT CAGTTAGCAA CGTTACTGGA AGATCAGGAG AAAGTAGGTA CCCACGGGAT CGTGATGATG ATTCGTCTGC CGGATTTCAA TATGTTGAGT GATACCTGGG GGCACAGCCA GGTTGAAGAA CAGTTCTTCT CTCTGACGAA TCTGCTGTCG ACATTTATGA TGCGCTACCC TGGCGCACTG CTGGCGCGTT ACCACCGCAG TGATTTTGCT GCGCTGTTAC CGCACCGAAC GTTAAAAGAG GCAGAGAGCA TCGCCAGTCA GTTAATCAAA GCCGTCGATA CCTTGCCGAA CAATAAAATG CTCGATCGCG ACGATATGAT CCACATTGGT ATCTGTGCCT GGCGTAGTGG TCAGGATACC GAGCAGGTAA TGGAACATGC AGAGTCTGCC ACGCGTAATG CGGGATTGCA GGGCGGCAAT AGCTGGGCTA TTTACGATGA CTCGTTGCCT GAAAAAGGAC GCGGTAATGT TCGCTGGCGT ACGCTTATCG AGCAAATGCT GAGTCGCGGC GGCCCGCGCC TTTATCAAAA ACCGGCGGTT ACTCGCGAAG GTCAGGTTCA TCATCGCGAA CTCATGTGCC GCATCTTCGA TGGTAATGAA GAGGTTAGCT CGGCGGAGTA TATGCCGATG GTCTTGCAGT TTGGCTTATC GGAAGAGTAT GACCGCCTGC AAATCAGCCG TCTGATTCCA CTATTGCGTT ACTGGCCGGA GGAAAATCTG GCGATTCAGG TTACCGTTGA GTCGCTGATT CGCCCGCGTT TTCAGCGTTG GCTGCGCGAT ACGTTAATGC AATGTGAAAG ATCGCAACGA AAACGCATAA TTATTGAACT TGCAGAGGCC GATGTAGGTC AACATATCAG TCGCTTACAA CCTGTTATTC GTTTAGTGAA TGCTTTAGGG GTACGGGTAG CCGTCAACCA GGCTGGTTTG ACGCTGGTAA GCACCAGTTG GATCAAAGAA CTTAATGTTG AGTTACTCAA GCTCCATCCG GGGCTGGTCA GAAACATTGA GAAGCGAACG GAGAACCAGC TGCTGGTTCA AAGCCTGGTG GAAGCCTGCT CCGGGACCAG CACCCAGGTT TACGCCACCG GCGTGCGTTC GCGAAGCGAG TGGCAGACCC TGATTCAGCG CGGTGTTACA GGTGGGCAAG GGGATTTTTT CGCGTCCTCA CAGCCACTTG ATACTAACGT GAAAAAATAT TCACAAAGAT ACTCGGTTTA A
|
Protein sequence | MRLTTKFSAF VTLLTGLTIF VTLLGCSLSF YNAIQYKFSH RVQAVATAID THLVSNDFST LRPQITELMM SADIVRVDLL HGDKQVYTLA RNGSYRPVGT NDLFRELSVP LIKHPGMSLR LVYQDPMGNY FHSLMTTAPL TGAIGFIILM LFLAVRWLQR QLAGQELLET RATRILNGER GSNVLGTIYE WPPRTSSALD TLLREIQNAR EQHSRLDTLI RSYAAQDMKT GLNNRLFFDN QLATLLEDQE KVGTHGIVMM IRLPDFNMLS DTWGHSQVEE QFFSLTNLLS TFMMRYPGAL LARYHRSDFA ALLPHRTLKE AESIASQLIK AVDTLPNNKM LDRDDMIHIG ICAWRSGQDT EQVMEHAESA TRNAGLQGGN SWAIYDDSLP EKGRGNVRWR TLIEQMLSRG GPRLYQKPAV TREGQVHHRE LMCRIFDGNE EVSSAEYMPM VLQFGLSEEY DRLQISRLIP LLRYWPEENL AIQVTVESLI RPRFQRWLRD TLMQCERSQR KRIIIELAEA DVGQHISRLQ PVIRLVNALG VRVAVNQAGL TLVSTSWIKE LNVELLKLHP GLVRNIEKRT ENQLLVQSLV EACSGTSTQV YATGVRSRSE WQTLIQRGVT GGQGDFFASS QPLDTNVKKY SQRYSV
|
| |