Gene EcSMS35_4523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4523 
Symbol 
ID6147170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4623059 
End bp4624645 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content45% 
IMG OID641619339 
Productcyclic diguanylate phosphodiesterase domain-containing protein 
Protein accessionYP_001746451 
Protein GI170680947 
COG category[T] Signal transduction mechanisms 
COG ID[COG4943] Predicted signal transduction protein containing sensor and EAL domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCATC GTGCACGACA CCAATTACTG GCGTTGCCGG GCATTATCTT TTTAGTTCTC 
TTTCCCATCA TTCTGTCGCT ATGGATTGCC TTCTTTTGGG CAAAATCAGA AGTGAATAAT
CAGCTCCGAA CCTTTGCTCA GCTGGCGTTG GATAAATCTG AGCTGGTCAT TCGTCAGGCA
GATTTAGTGA GCGATGCAGC TGAACGCTAT CAGGGGCAAG TTTGCACCCC AGCCCATCAA
AAGCGAATGT TGAATATTAT TCGTGGCTAT CTTTATATTA ATGAATTGAT CTATGCCCGT
GATAACCATT TTTTATGCTC ATCGCTGATA GCGCCTGTAA ACGGCTATAC GATTGCACCG
GCCGATTATA AGCGTGAACC TAACGTTTCT ATCTATTATT ACCGCGATAC GCCTTTTTTC
TCTGGCTATA AAATGACCTA TATGCAGCGG GGAAATTATG TGGCGGTCAT CAACCCTCTC
TTCTGGAGTG AAGTGATGTC TGATGATCCG ACATTGCAAT GGGGTGTATA TGATACGGTA
ACGAAAACCT TTTTCTCGTT AAGCAACGAG GCCTCGGCAG CAACGTTTTC TCCACTGATT
CATTTGAAGG ATTTAACTGT ACAAAGAAAT GGCTATTTAT ATGCGACAGT TTATTCGACA
AAACGCCCAA TTGCGGCCAT TGTTGCGACT TCATATCAAC GTCTTATAGC GCATTTTTAT
AATCATCTTA TTTTTGCGTT ACCCGCCGGT ATTTTGGGGA GTCTTGTTCT GCTATTACTC
TGGCTACGTA TTCGACAAAA CTATTTGTCT CCCAAACGCA AATTGCAACG CGCCCTCGAA
AAACATCAAC TTTGCCTTTA TTACCAGCCA ATAATCGATA TCAAAACAGA AAAATGTATC
GGGGCTGAAG CATTGTTACG TTGGCCTGGT GAGCAGGGGC AAGTAATGAA TCCGGCAGAG
TTTATTCCGC TGGCAGAAAA GGAGGGGATG ATCGGACAGG TAACTGATTA TGTTATTGAT
AATGTCTTCC GCGATCTGGG CGCATACCTG GCAACACATG CCGATCGCTA TGTTTCTATT
AACCTGTCGG CCTCCGATTT TCATACGTCA CGGTTGATAG CGCGAACCAA TCAGAAAACA
GAGCAATACG CGGTGCGTCC ACAGCAAATT AAATTTGAAG TGACTGAACA TGCATTTCTT
GATGTCGACA AAATGACACC AATTATTCTG GCTTTCCGCC AGGCTGGTTA CGAAGTGGCA
ATTGATGATT TTGGTATTGG CTACTCTAAC TTGCATAACC TTAAATCATT GAATGTCGAT
ATTTTGAAAA TCGATAAATC GTTTGTTGAG ACGCTGACCA CCCATAAAAC CAGTCATTTG
ATTGCGGAAC ACATCATCGA GCTGGCGCAC AGTCTGGGGT TAAAAACGAT CGCTGAAGGC
GTCGAAACTG AGGAACAGGT TAACTGGCTG CGCAAACGCG GCGTGCGCTA TTGCCAGGGA
TGGTTCTTTG CGAAGGCGAT GCCGCCGCAG GTGTTTATGC AATGGATGGA GCAATTACCC
GCGCGGGAGT TAACGCGCGG GCAATAA
 
Protein sequence
MSHRARHQLL ALPGIIFLVL FPIILSLWIA FFWAKSEVNN QLRTFAQLAL DKSELVIRQA 
DLVSDAAERY QGQVCTPAHQ KRMLNIIRGY LYINELIYAR DNHFLCSSLI APVNGYTIAP
ADYKREPNVS IYYYRDTPFF SGYKMTYMQR GNYVAVINPL FWSEVMSDDP TLQWGVYDTV
TKTFFSLSNE ASAATFSPLI HLKDLTVQRN GYLYATVYST KRPIAAIVAT SYQRLIAHFY
NHLIFALPAG ILGSLVLLLL WLRIRQNYLS PKRKLQRALE KHQLCLYYQP IIDIKTEKCI
GAEALLRWPG EQGQVMNPAE FIPLAEKEGM IGQVTDYVID NVFRDLGAYL ATHADRYVSI
NLSASDFHTS RLIARTNQKT EQYAVRPQQI KFEVTEHAFL DVDKMTPIIL AFRQAGYEVA
IDDFGIGYSN LHNLKSLNVD ILKIDKSFVE TLTTHKTSHL IAEHIIELAH SLGLKTIAEG
VETEEQVNWL RKRGVRYCQG WFFAKAMPPQ VFMQWMEQLP ARELTRGQ