Gene EcSMS35_1650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1650 
Symbol 
ID6143487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1637306 
End bp1638724 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content49% 
IMG OID641616526 
Productdiguanylate cyclase 
Protein accessionYP_001743704 
Protein GI170683666 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGTAC AGCCCATTTC AACTTTCCGT TTGTTCCAGG AAGGTCATCT GCTACGTAAT 
AGCATCGCTA TTTTTGTGCT AACCACGCTG TTCTATTTTA TTGGTGCAGA GTTACGGCTG
GTTCACGAAC TTTCTCTTTT CTGGCCGCTG AATGGCGTAA TGGCGGGGGT GTTTGCCCGC
TATGTCTGGC TTAATCGACT GCATTACTAT GCGATCAGTT ATGTGGCGAT GCTGGTTTAT
GATGCCATAA CCACCGAATG GGGGCTGGTT TCACTGGCTA TCAATTTCTC CAATATGATG
TTTATTGTTA CCGTCGCCTT ACTGGTCGCG CGGGATAAGC GTCTGGGGAA AAATAAGTAT
GAGCCAGTAA GTGCCTTACG GCTATTTAAT TACTGTCTGA TTGCCGCATT ATTATGCGCT
ATTGTCGGGG CGATTGGTTC GGTCAGTATT GATAGTCTGG ATTTCTGGCC TTTGCTTGCC
GACTGGTTCA GTGAGCAATT CTCAACGGGC GTGTTGATCG TGCCTTGTAT GCTGACGTTG
GCAATTCCTG GAGTACTGCC GCGCTTTAAA GCAGAGCAGA TGATGCCTGC TATCGCGCTT
ATTGTGTCGG TTATTGCCTC GGTAGTCATT GGCGGAGCGG GGAGTCTGGC GTTTCCGCTC
CCTGCATTAA TCTGGTGTGC AGTGCGCTAT ACGCCGCAGG TAACATGTCT GTTGACCTTT
GTCACCGGTG CGGTGGAAAT CGTACTGGTG GCAAATTCGG TGATTGATAT CTCGGTCGGT
TCGCCATTCT CCATTCCAGA AATGTTCTCC GCACGTCTCG GTATTGCCAC GATGGCGATA
TGCCCAATTA TGGTTTCTTT TAGCGTGGCA GCGATCAATT CGCTAATGAA GCAAGTTGCG
CTGCGAGCCG ACTTTGATTT TCTGACTCAG GTTTACTCAC GGTCCGGTCT TTATGAGGCG
CTGAAAAGTC CATCGCTGAA ACAGACACAA CATCTGACTG TCATGCTGCT TGATATCGAC
TATTTCAAAA GCATTAACGA TAACTATGGA CATGAATGTG GCGATAAAGT GTTAAGCGTG
TTTGCCCAGC ATATTCAGAA GATTGTCGGT GATAAGGGGC TGGTGGCGCG AATGGGCGGC
GAGGAATTTG CTGTTGCAGT GCCATCGGTG AATCCTGTAG ATGGTCTGCT AATGGCGGAA
AAAATCCGTA AAGGCGTTGA ACTGCAACCG TTCACCTGGC AACAAAAAAC GCTTTATCTC
ACGGTAAGTA TTGGCGTCGG TAGTGGTTGC GCATCGTACC GAACGCTGAC CGATGACTTT
AATAAATTGA TGGTCGAAGC CGATACATGT CTGTATCGCT CGAAGAAAGA TGGGCGCAAC
CGTACCAGCA CTATGCGTTA CGGTGAGGAA GTTGTCTGA
 
Protein sequence
MHVQPISTFR LFQEGHLLRN SIAIFVLTTL FYFIGAELRL VHELSLFWPL NGVMAGVFAR 
YVWLNRLHYY AISYVAMLVY DAITTEWGLV SLAINFSNMM FIVTVALLVA RDKRLGKNKY
EPVSALRLFN YCLIAALLCA IVGAIGSVSI DSLDFWPLLA DWFSEQFSTG VLIVPCMLTL
AIPGVLPRFK AEQMMPAIAL IVSVIASVVI GGAGSLAFPL PALIWCAVRY TPQVTCLLTF
VTGAVEIVLV ANSVIDISVG SPFSIPEMFS ARLGIATMAI CPIMVSFSVA AINSLMKQVA
LRADFDFLTQ VYSRSGLYEA LKSPSLKQTQ HLTVMLLDID YFKSINDNYG HECGDKVLSV
FAQHIQKIVG DKGLVARMGG EEFAVAVPSV NPVDGLLMAE KIRKGVELQP FTWQQKTLYL
TVSIGVGSGC ASYRTLTDDF NKLMVEADTC LYRSKKDGRN RTSTMRYGEE VV