Gene EcolC_1687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1687 
Symbol 
ID6065583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1889101 
End bp1890795 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content52% 
IMG OID641601101 
Productdiguanylate cyclase 
Protein accessionYP_001724666 
Protein GI170019712 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0356503 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00372154 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCAGCACG AGACAAAAAT GGAAAACCAG AGCTGGTTGA AAAAACTCGC ACGCCGCCTG 
GGGCCTGGTC ATGTCGTTAA TCTCTGCTTT ATCGTGGTAT TGCTTTTTTC CACCTTGCTC
ACCTGGCGTG AAGTAGTGGT GCTGGAAGAT GCCTATATCT CCAGCCAGCG TAATCATCTG
GAAAACGTAG CCAACGCGCT CGATAAGCAT TTGCAGTATA ACGTCGACAA ACTGATCTTT
TTGCGTAATG GCATGCGCGA AGCTCTCGTA GCGCCACTGG ATTTCACTTC ACTGCGTAAT
GCTGTAACCG AGTTCGAACA GCATCGCGAC GAGCACGCCT GGCAAATTGA ACTCAACCGA
CGACGCACCC TGTCAGTCAA TGGCGTATCG GATGCATTAG TCAGCGAGGG GAATCTCCTG
TCTCGCGAAA ATGAAAGCCT CGACAATGAA ATTACCGCTG CACTGGAAGT TGGTTACTTG
CTGCGACTGG CGCACAACAC CTCGTCGATG GTTGAACAGG CGATGTATGT CTCGCGTGCC
GGATTTTACG TTTCGACGCA GCCGACCTTG TTTACGCGCA ATGTACCAAC GCGTTATTAC
GGCTATGTCA CCCAACCCTG GTTTATCGGC CATTCGCAAC GAGAAAATCG TCACCGCGCG
GTACGCTGGT TTACTTCGCA ACCGGAACAC GCCAGCAATA CTGAACCGCA GGTTACCGTC
AGTGTTCCGG TAGACAGTAA TAACTACTGG TATGGCGTGC TGGGGATGAG TATTCCCGTG
CGTACCATGC AGCAATTTTT AAGAAACGCC ATCGATAAAA ACCTCGATGG TGAGTATCAG
CTCTATGACA GTAAGCTGAG ATTTTTGACC TCTTCCAATC CTGATCATCC AACAGGGAAT
ATTTTTGATC CTCGTGAACT GGCCTTGCTG GCGCAGGCAA TGGAACATGA CACGCGGGGC
GGCATTCGTA TGGACAGTCG CTATGTTAGT TGGGAACGTC TGGACCATTT CGACGGTGTG
CTGGTGCGCG TCCATACGCT AAGCGAAGGC GTGCGCGGCG ATTTCGGCAG TATCAGCATT
GCATTAACCC TGCTGTGGGC GCTCTTTACC ACCATGTTAC TCATCTCCTG GTATGTGATT
CGCCGGATGG TCAGCAACAT GTATGTTCTG CAAAGCTCGT TGCAGTGGCA GGCGTGGCAC
GACACCTTAA CGCGTTTATA TAACCGTGGC GCACTGTTCG AAAAAGCCCG TCCGCTCGCG
AAAATGTGTC AGACGCACCA ACATCCTTTT TCTGTCATTC AGGTCGATCT TGACCATTTC
AAAGCGATTA ATGACCGCTT TGGTCATCAG GCGGGCGACC GTGTTCTTTC TCATGCTGCC
GGATTAATTA GCAGTTCCTT GCGTGCGCAG GACGTTGCCG GGCGGGTCGG TGGTGAGGAG
TTTTGTGTGA TTCTGCCAGG CGCGAGTCTG ACGGAGGCTG CGGAAGTCGC AGAACGTATT
CGCCTGAAGT TAAATGAAAA AGAGATGTTG ATTGCTAAGA GTACGACGAT ACGCATCAGT
GCCTCGTTGG GGGTAAGTAG CAGCGAGGAA ACCGGTGATT ATGATTTTGA ACAACTCCAG
TCACTGGCTG ACCGTCGGCT TTATCTCGCT AAACAGGCCG GGCGTAATCG GGTATGCGCG
AGCGATAACG CTTAA
 
Protein sequence
MQHETKMENQ SWLKKLARRL GPGHVVNLCF IVVLLFSTLL TWREVVVLED AYISSQRNHL 
ENVANALDKH LQYNVDKLIF LRNGMREALV APLDFTSLRN AVTEFEQHRD EHAWQIELNR
RRTLSVNGVS DALVSEGNLL SRENESLDNE ITAALEVGYL LRLAHNTSSM VEQAMYVSRA
GFYVSTQPTL FTRNVPTRYY GYVTQPWFIG HSQRENRHRA VRWFTSQPEH ASNTEPQVTV
SVPVDSNNYW YGVLGMSIPV RTMQQFLRNA IDKNLDGEYQ LYDSKLRFLT SSNPDHPTGN
IFDPRELALL AQAMEHDTRG GIRMDSRYVS WERLDHFDGV LVRVHTLSEG VRGDFGSISI
ALTLLWALFT TMLLISWYVI RRMVSNMYVL QSSLQWQAWH DTLTRLYNRG ALFEKARPLA
KMCQTHQHPF SVIQVDLDHF KAINDRFGHQ AGDRVLSHAA GLISSSLRAQ DVAGRVGGEE
FCVILPGASL TEAAEVAERI RLKLNEKEML IAKSTTIRIS ASLGVSSSEE TGDYDFEQLQ
SLADRRLYLA KQAGRNRVCA SDNA