Gene EcSMS35_1405 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1405 
Symbol 
ID6143368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1386779 
End bp1388269 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content46% 
IMG OID641616283 
Productdiguanylate cyclase 
Protein accessionYP_001743463 
Protein GI170683497 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.856707 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTGC ACCATAGAAT GCTCCGGCAT TTTATCGCCG CAAGTGTCAT TGTGCTGACA 
TCTTCCTTCC TTATTTTTGA ACTTGTCGCC AGTGACAGAG CAATGAGTGC CTATCTGCGC
TATATCGTGC AGAGAGCAGA CTCCTCCTTT CTTTATGATA AGTATCAGAA TCAGAGTATT
GCCGCGCATG TGATGCGCGC TCTCGCTGCT GAGCAGTCGG AAGTGTCGCC AGAACAGCGG
CGAGCCATCT GCGAGGCTTT TGAGTCTGCC AATAATACCC ATGGCTTAAA CCTGACTGCC
CATAAATACC CTGGCTTACG CGGCACACTA CAAACCGCAT CCACTGACTG CGACACAATT
GTGGAAGCTG CTGCACTATT ACCCGCTTTT GATCAGGCAG TGGAAGGCAA CCGCCACCAG
GATGATTACG GTTCAGGTCT TGGGATGGCC GAAGAGAAAT TTCACTATTA TCTCGATCTC
AATGACCGCT ATGTCTATTT TTATGAGCCG GTTAATGTTG AGTACTTTGC GATGAATAAC
TGGTCCTTCC TGCAGTCAGG AAGTATTGGC ATCGATCGCA AAGATATTGA AAAGGTATTT
ACCGGGCGTA CCGTATTATC GAGCATTTAC CAGGATCAGC GTACTAAACA GAACGTGATG
AGTTTGCTGA CGCCGGTATA TGTCGCAGGG CAGCTAAAAG GGATTGTGCT GCTGGATATT
AACAAAAACA ATCTGCGGAA TATCTTTTAC ACTCATGACC GCCCTCTCCT CTGGCGTTTT
CTCAATGTCA CGCTAACTGA TACCGATTCG GGGCGCGACA TTATCATCAA CCAGAGCGAA
GATAATCTGT TCCAGTATGT CAGTTACGTC CATGACTTAC CGGGCGGCAT TCGTGTCTCG
TTATCCATTG ATATTCTTTA CTTTATCACG TCTTCGTGGA AAAGCGTTCT GTTCTGGATT
TTGACGGCGT TAATTTTGCT GAATATGGTG CGGATGCACT TCCGTTTATA CCAAAATGTG
TCGCGAGAAA ATATTAGTGA TGCGATGACT GGACTCTATA ATCGCAAAAT TTTAACCCCT
GAACTGGAGC AGCGGTTGCA GAAACTGGTG CAATCCGGTT CTTCGGTGAT GTTTATTGCA
ATTGACATGG ACAAGTTAAA GCAAATAAAT GACACCCTTG GTCATCAGGA GGGGGATTTA
GCGATTACGT TATTAGCTCA GGCGATTAAA CAATCGATTC GTAAAAGTGA TTATGCCATC
CGACTCGGTG GCGATGAATT CTGCATCATT CTTGTCGATT CGACACCGCA AATTGCAGCA
CAACTGCCTG AACGTATCGA AAAACGTCTG CAACATATCG CGCCGCAGAA AGAGATCGGC
TTCTCTTCCG GTATTTACGC GATGAAAGAA AACGATACGT TACATGATGC GTATAAAGCT
TCCGATGAGC GTTTATATGT CAATAAGCAG AACAAAAACA GCCGTTCATG A
 
Protein sequence
MKLHHRMLRH FIAASVIVLT SSFLIFELVA SDRAMSAYLR YIVQRADSSF LYDKYQNQSI 
AAHVMRALAA EQSEVSPEQR RAICEAFESA NNTHGLNLTA HKYPGLRGTL QTASTDCDTI
VEAAALLPAF DQAVEGNRHQ DDYGSGLGMA EEKFHYYLDL NDRYVYFYEP VNVEYFAMNN
WSFLQSGSIG IDRKDIEKVF TGRTVLSSIY QDQRTKQNVM SLLTPVYVAG QLKGIVLLDI
NKNNLRNIFY THDRPLLWRF LNVTLTDTDS GRDIIINQSE DNLFQYVSYV HDLPGGIRVS
LSIDILYFIT SSWKSVLFWI LTALILLNMV RMHFRLYQNV SRENISDAMT GLYNRKILTP
ELEQRLQKLV QSGSSVMFIA IDMDKLKQIN DTLGHQEGDL AITLLAQAIK QSIRKSDYAI
RLGGDEFCII LVDSTPQIAA QLPERIEKRL QHIAPQKEIG FSSGIYAMKE NDTLHDAYKA
SDERLYVNKQ NKNSRS