Gene EcSMS35_1683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1683 
Symbol 
ID6142740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1685009 
End bp1686391 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content45% 
IMG OID641616559 
Productdiguanylate cyclase 
Protein accessionYP_001743737 
Protein GI170683103 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGATGT ATTTTAAAAG AATGAAAGAT GAGTGGACCG GGCTTGTCGA ACAAGCAGAT 
CCGCTCATTC GTGCTAAAGC CGCGGAAATT GCCGTTGCGC ATGCCCATTA TCTGAGTATT
GAGTTTTACC GAATTGTCCG TATCGACCCG CATGCCGAAG AATTCTTGAG TAATGAACAA
GTTGAGCGGC AGTTGAAAAG TGCGATGGAA CGCTGGATTA TTAACGTGCT TTCTGCCCAG
GTTGACGATG TCGAAAGGCT AATACAAATC CAGCATACCG TCGCGGAAGT GCATGCCCGC
ATAGGAATTC CGGTAGAAAT TGTCGAGATG GGGTTTCGGG TGCTGAAAAA AATCCTCTAT
CCGGTCATCT TCTCTTCGGA TTATTCCGCC GCGGAAAAAC TTCAGGTCTA CCATTTCTCG
ATTAACAGTA TTGATATCGC TATGGAAGTG ATGACCCGCG CGTTTACCTT TAGTGACAGT
AGTGCCTCAA AGGAAGATGA AAACTATCGT ATCTTCTCGT TACTGGAAAA TGCCGAAGAA
GAAAAAGAAC GGCAAATAGC CTCAATACTT TCATGGGAAA TAGATATTAT CTATAAAGTC
CTGCTGGATT CTGATTTAGG CAGTAGTTTG CCTTTAAGCC AGGCTGATTT TGGCCTGTGG
TTTAACCATA AAGGTCGACA TTATTTTAGC GGCATAGCCG AAGTGGGCCA TATCTCCCGT
CTGATTCAGG ATTTCGACGG TATTTTCAAT CAAACCATGC GTAACACCAG GAATTTGAAT
AACAGAAGTT TACGGGTGAA ATTTTTATTA CAGATAAGAA ATACCGTATC GCAAATTATT
ACCTTGTTGC GTGAATTGTT TGAAGAAGTA TCGCGCCACG AAGTCGGTAT GGATGTACTG
ACGAAATTAC TTAACCGCCG TTTCCTGCCG ACGATCTTCA AACGCGAAAT TGCCCATGCC
AACCGGACCG GTACACCGCT GTCAGTGCTG ATTATTGACG TTGATAAATT CAAAGAGATT
AACGATACGT GGGGCCATAA CACTGGCGAT GAAATTCTGC GTAAAGTCTC TCAGGCCTTT
TATGACAACG TCCGCAGTAG TGATTATGTT TTCCGCTACG GGGGCGATGA ATTTATCATT
GTTTTGACTG AAGCTTCGGA AAACGAAACG TTACGTACCG CAGAACGTAT TCGCAGTCGG
GTGGAGAAAA CCAAACTGAA AGCCGCAAAC GGCGAGGATA TTGCCCTCTC ACTTTCCATC
GGTGCCGCCA TGTTTAATGG TCATCCTGAC TATGAGCGCC TCATTCAAAT AGCCGATGAA
GCTCTGTATA TCGCCAAAAG ACGAGGTAGA AACCGTGTTG AACTCTGGAA AGCCAGTCTT
TAG
 
Protein sequence
MEMYFKRMKD EWTGLVEQAD PLIRAKAAEI AVAHAHYLSI EFYRIVRIDP HAEEFLSNEQ 
VERQLKSAME RWIINVLSAQ VDDVERLIQI QHTVAEVHAR IGIPVEIVEM GFRVLKKILY
PVIFSSDYSA AEKLQVYHFS INSIDIAMEV MTRAFTFSDS SASKEDENYR IFSLLENAEE
EKERQIASIL SWEIDIIYKV LLDSDLGSSL PLSQADFGLW FNHKGRHYFS GIAEVGHISR
LIQDFDGIFN QTMRNTRNLN NRSLRVKFLL QIRNTVSQII TLLRELFEEV SRHEVGMDVL
TKLLNRRFLP TIFKREIAHA NRTGTPLSVL IIDVDKFKEI NDTWGHNTGD EILRKVSQAF
YDNVRSSDYV FRYGGDEFII VLTEASENET LRTAERIRSR VEKTKLKAAN GEDIALSLSI
GAAMFNGHPD YERLIQIADE ALYIAKRRGR NRVELWKASL