Gene EcolC_1846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1846 
Symbol 
ID6065158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2043157 
End bp2044647 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content46% 
IMG OID641601260 
Productdiguanylate cyclase 
Protein accessionYP_001724822 
Protein GI170019868 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0897436 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTGC ACCATAGAAT GCTCCGGCAT TTTATCGCCG CAAGTGTCAT TGTGCTGACA 
TCTTCCTTCC TTATTTTTGA ACTTGTCGCC AGCGACAGAG CAATGAGTGC CTATCTGCGC
TATATCGTGC AGAAAGCAGA CTCCTCCTTT CTTTATGATA AGTATCAGAA TCAGAGTATT
GCCGCGCATG TGATGCGCGC TCTCGCTGCT GAGCAGTCGG AAGTGTCGCC AGAACAGCGG
CGCGCCATCT GCGAGGCTTT TGAGTCTGCC AATAATACCC ATGGCTTAAA CCTGACTGCC
CATAAATACC CGGGCTTACG CGGCACACTA CAAACCGCAT CCACTGACTG CGACACAATT
GTGGAAGCTG CAGCACTATT ACCCGCTTTT GATCAGGCAG TGGAAGGCAA CCGCCACCAG
GATGATTACG GTTCAGGTCT TGGGATGGCC GAAGAGAAAT TTCACTATTA TCTCGATCTC
AATGACCGCT ATGTCTATTT TTATGAGCCG GTTAATGTTG AATACTTTGC GATGAATAAC
TGGTCCTTCC TGCAGTCAGG AAGTATTGGC ATCGATCGCA AAGATATTGA AAAGGTATTT
ACCGGGCGTA CCGTATTGTC GAGCATTTAC CAGGATCAGC GTACTAAACA GAACGTGATG
AGTTTGCTGA CGCCGGTATA TGTCGCAGGG CAGCTAAAAG GGATTGTGCT GCTGGATATT
AACAAAAACA ATTTGCGGAA TATCTTTTAC ACTCATGACC GCCCTCTCCT CTGGCGTTTT
CTCAATGTCA CGCTAACCGA TACCGATTCG GGGCGCGACA TTATCATCAA CCAGAGCGAA
GATAATCTGT TCCAGTATGT CAGTTACGTC CATGACTTAC CGGGCGGCAT TCGTGTCTCG
TTATCCATTG ATATTCTTTA CTTTATCACG TCTTCGTGGA AAAGCGTTCT GTTCTGGATT
TTGACGGCGT TAATTTTGCT GAATATGGTG CGGATGCACT TCCGTTTATA CCAAAATGTG
TCGCGAGAAA ATATTAGTGA TGCGATGACT GGACTGTATA ATCGCAAAAT TTTAACCCCT
GAACTGGAGC AGCGGTTGCA GAAACTGGTG CAAGCCGGTT CTTCGGTGAT GTTTATTGCA
ATTGACATGG ACAAGTTAAA GCAAATAAAT GACACCCTCG GTCATCAGGA GGGGGATTTA
GCGATTACGT TATTAGCTCA GGCGATTAAA CAATCGATTC GTAAAAGTGA TTATGCCATC
CGACTCGGTG GCGATGAATT CTGCATCATT CTTGTCGATT CGACGCCGCA AATTGCAGCA
CAACTGCCTG AACGTATCGA AAAACGTCTG CAACATATCG CGCCGCAGAA AGAGATTGGC
TTCTCTTCCG GTATTTACGC GATGAAAGAA AACGATACGT TACATGATGC GTATAAAGCT
TCCGATGAGC GTTTATATGT CAATAAGCAG AACAAAAACA GCCGTTCATG A
 
Protein sequence
MKLHHRMLRH FIAASVIVLT SSFLIFELVA SDRAMSAYLR YIVQKADSSF LYDKYQNQSI 
AAHVMRALAA EQSEVSPEQR RAICEAFESA NNTHGLNLTA HKYPGLRGTL QTASTDCDTI
VEAAALLPAF DQAVEGNRHQ DDYGSGLGMA EEKFHYYLDL NDRYVYFYEP VNVEYFAMNN
WSFLQSGSIG IDRKDIEKVF TGRTVLSSIY QDQRTKQNVM SLLTPVYVAG QLKGIVLLDI
NKNNLRNIFY THDRPLLWRF LNVTLTDTDS GRDIIINQSE DNLFQYVSYV HDLPGGIRVS
LSIDILYFIT SSWKSVLFWI LTALILLNMV RMHFRLYQNV SRENISDAMT GLYNRKILTP
ELEQRLQKLV QAGSSVMFIA IDMDKLKQIN DTLGHQEGDL AITLLAQAIK QSIRKSDYAI
RLGGDEFCII LVDSTPQIAA QLPERIEKRL QHIAPQKEIG FSSGIYAMKE NDTLHDAYKA
SDERLYVNKQ NKNSRS