Gene EcSMS35_3547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3547 
Symbol 
ID6143611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3628037 
End bp3629977 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content51% 
IMG OID641618376 
Productregulatory protein CsrD 
Protein accessionYP_001745523 
Protein GI170679868 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000352708 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATTAA CGACGAAATT TTCGGCCTTT GTTACGCTGC TCACCGGGTT AACAATTTTT 
GTGACTTTGC TGGGCTGTTC GCTAAGTTTC TACAACGCCA TTCAGTATAA GTTTAGTCAT
CGTGTTCAGG CGGTGGCGAC GGCGATCGAT ACCCACCTGG TGTCGAATGA CTTCAGCACA
TTAAGGCCAC AAATTACCGA ATTAATGATG TCGGCAGATA TCGTTCGTGT AGACCTGCTC
CATGGTGATA AGCAGGTTTA TACCCTGGCC AGAAATGGTA GTTATCGTCC GGTTGGCACC
AACGATCTAT TTCGTGAACT GAGCGTTCCG TTGATAAAGC ATCCGGGGAT GTCGCTGCGT
CTGGTTTATC AGGATCCGAT GGGCAACTAT TTCCATTCGT TGATGACCAC CGCGCCGCTC
ACGGGGGCGA TTGGCTTTAT CATTCTTATG CTCTTCCTGG CGGTACGCTG GTTACAACGG
CAACTTGCCG GGCAAGAATT GCTGGAAACC CGGGCTACTC GTATCTTAAA CGGTGAGCGT
GGCTCTAATG TGTTGGGAAC CATCTATGAA TGGCCGCCCA GAACCAGCAG TGCGCTGGAT
ACGCTGCTTC GTGAAATTCA GAACGCACGC GAACAACACA GCCGTCTTGA TACGCTGATC
CGCTCTTATG CCGCCCAGGA CATGAAAACC GGCCTCAATA ACCGACTCTT CTTCGATAAT
CAGTTAGCAA CGTTACTGGA AGATCAGGAG AAAGTAGGTA CCCACGGGAT CGTGATGATG
ATTCGTCTGC CGGATTTCAA TATGTTGAGT GATACCTGGG GGCACAGCCA GGTTGAAGAA
CAGTTCTTCT CTCTGACGAA TCTGCTGTCG ACATTTATGA TGCGCTACCC TGGCGCACTG
CTGGCGCGTT ACCACCGCAG TGATTTTGCT GCGCTGTTAC CGCACCGAAC GTTAAAAGAG
GCAGAGAGCA TCGCCAGTCA GTTAATCAAA GCCGTCGATA CCTTGCCGAA CAATAAAATG
CTCGATCGCG ACGATATGAT CCACATTGGT ATCTGTGCCT GGCGTAGTGG TCAGGATACC
GAGCAGGTAA TGGAACATGC AGAGTCTGCC ACGCGTAATG CGGGATTGCA GGGCGGCAAT
AGCTGGGCTA TTTACGATGA CTCGTTGCCT GAAAAAGGAC GCGGTAATGT TCGCTGGCGT
ACGCTTATCG AGCAAATGCT GAGTCGCGGC GGCCCGCGCC TTTATCAAAA ACCGGCGGTT
ACTCGCGAAG GTCAGGTTCA TCATCGCGAA CTCATGTGCC GCATCTTCGA TGGTAATGAA
GAGGTTAGCT CGGCGGAGTA TATGCCGATG GTCTTGCAGT TTGGCTTATC GGAAGAGTAT
GACCGCCTGC AAATCAGCCG TCTGATTCCA CTATTGCGTT ACTGGCCGGA GGAAAATCTG
GCGATTCAGG TTACCGTTGA GTCGCTGATT CGCCCGCGTT TTCAGCGTTG GCTGCGCGAT
ACGTTAATGC AATGTGAAAG ATCGCAACGA AAACGCATAA TTATTGAACT TGCAGAGGCC
GATGTAGGTC AACATATCAG TCGCTTACAA CCTGTTATTC GTTTAGTGAA TGCTTTAGGG
GTACGGGTAG CCGTCAACCA GGCTGGTTTG ACGCTGGTAA GCACCAGTTG GATCAAAGAA
CTTAATGTTG AGTTACTCAA GCTCCATCCG GGGCTGGTCA GAAACATTGA GAAGCGAACG
GAGAACCAGC TGCTGGTTCA AAGCCTGGTG GAAGCCTGCT CCGGGACCAG CACCCAGGTT
TACGCCACCG GCGTGCGTTC GCGAAGCGAG TGGCAGACCC TGATTCAGCG CGGTGTTACA
GGTGGGCAAG GGGATTTTTT CGCGTCCTCA CAGCCACTTG ATACTAACGT GAAAAAATAT
TCACAAAGAT ACTCGGTTTA A
 
Protein sequence
MRLTTKFSAF VTLLTGLTIF VTLLGCSLSF YNAIQYKFSH RVQAVATAID THLVSNDFST 
LRPQITELMM SADIVRVDLL HGDKQVYTLA RNGSYRPVGT NDLFRELSVP LIKHPGMSLR
LVYQDPMGNY FHSLMTTAPL TGAIGFIILM LFLAVRWLQR QLAGQELLET RATRILNGER
GSNVLGTIYE WPPRTSSALD TLLREIQNAR EQHSRLDTLI RSYAAQDMKT GLNNRLFFDN
QLATLLEDQE KVGTHGIVMM IRLPDFNMLS DTWGHSQVEE QFFSLTNLLS TFMMRYPGAL
LARYHRSDFA ALLPHRTLKE AESIASQLIK AVDTLPNNKM LDRDDMIHIG ICAWRSGQDT
EQVMEHAESA TRNAGLQGGN SWAIYDDSLP EKGRGNVRWR TLIEQMLSRG GPRLYQKPAV
TREGQVHHRE LMCRIFDGNE EVSSAEYMPM VLQFGLSEEY DRLQISRLIP LLRYWPEENL
AIQVTVESLI RPRFQRWLRD TLMQCERSQR KRIIIELAEA DVGQHISRLQ PVIRLVNALG
VRVAVNQAGL TLVSTSWIKE LNVELLKLHP GLVRNIEKRT ENQLLVQSLV EACSGTSTQV
YATGVRSRSE WQTLIQRGVT GGQGDFFASS QPLDTNVKKY SQRYSV