Gene EcHS_A1872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1872 
Symbol 
ID5591174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1890018 
End bp1891508 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content46% 
IMG OID640921014 
Productdiguanylate cyclase 
Protein accessionYP_001458566 
Protein GI157161248 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value0.320559 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTGC ACCATAGAAT GCTCCGGCAT TTTATCGCCG CAAGTGTCAT TGTGCTGACA 
TCTTCCTTCC TTATTTTTGA ACTTGTCGCC AGCGACAGAG CAATGAGTGC CTATCTGCGC
TATATCGTGC AGAAAGCAGA CTCCTCCTTT CTTTATGATA AGTATCAGAA TCAGAGTATT
GCCGCGCATG TGATGCGCGC TCTCGCTGCT GAGCAGTCGG AAGTGTCGCC AGAACAGCGG
CGCGCCATCT GCGAGGCTTT TGAGTCTGCC AATAATACCC ATGGCTTAAA CCTGACTGCC
CATAAATACC CGGGCTTACG CGGCACACTA CAAACCGCAT CCACTGACTG CGACACAATT
GTGGAAGCTG CAGCACTATT ACCCGCTTTT GATCAGGCAG TGGAAGGCAA CCGCCACCAG
GATGATTACG GTTCAGGTCT TGGGATGGCC GAAGAGAAAT TTCACTATTA TCTCGATCTC
AATGACCGCT ATGTCTATTT TTATGAGCCG GTTAATGTTG AATACTTTGC GATGAATAAC
TGGTCCTTCC TGCAGTCAGG AAGTATTGGC ATCGATCGCA AAGATATTGA AAAGGTATTT
ACCGGGCGTA CCGTATTGTC GAGCATTTAC CAGGATCAGC GTACTAAACA GAACGTGATG
AGTTTGCTGA CGCCGGTATA TGTCGCAGGG CAGCTAAAAG GGATTGTGCT GCTGGATATT
AACAAAAACA ATTTGCGGAA TATCTTTTAC ACTCATGACC GCCCTCTCCT CTGGCGTTTT
CTCAATGTCA CGCTAACCGA TACCGATTCG GGGCGCGACA TTATCATCAA CCAGAGCGAA
GATAATCTGT TCCAGTATGT CAGTTACGTC CATGACTTAC CGGGCGGCAT TCGTGTCTCG
TTATCCATTG ATATTCTTTA CTTTATCACG TCTTCGTGGA AAAGCGTTCT GTTCTGGATT
TTGACGGCGT TAATTTTGCT GAATATGGTG CGGATGCACT TCCGTTTATA CCAAAATGTG
TCGCGAGAAA ATATTAGTGA TGCGATGACT GGACTGTATA ATCGCAAAAT TTTAACCCCT
GAACTGGAGC AGCGGTTGCA GAAACTGGTG CAAGCCGGTT CTTCGGTGAT GTTTATTGCA
ATTGACATGG ACAAGTTAAA GCAAATAAAT GACACCCTCG GTCATCAGGA GGGGGATTTA
GCGATTACGT TATTAGCTCA GGCGATTAAA CAATCGATTC GTAAAAGTGA TTATGCCATC
CGACTCGGTG GCGATGAATT CTGCATCATT CTTGTCGATT CGACGCCGCA AATTGCAGCA
CAACTGCCTG AACGTATCGA AAAACGTCTG CAACATATCG CGCCGCAGAA AGAGATTGGC
TTCTCTTCCG GTATTTACGC GATGAAAGAA AACGATACGT TACATGATGC GTATAAAGCT
TCCGATGAGC GTTTATATGT CAATAAGCAG AACAAAAACA GCCGTTCATG A
 
Protein sequence
MKLHHRMLRH FIAASVIVLT SSFLIFELVA SDRAMSAYLR YIVQKADSSF LYDKYQNQSI 
AAHVMRALAA EQSEVSPEQR RAICEAFESA NNTHGLNLTA HKYPGLRGTL QTASTDCDTI
VEAAALLPAF DQAVEGNRHQ DDYGSGLGMA EEKFHYYLDL NDRYVYFYEP VNVEYFAMNN
WSFLQSGSIG IDRKDIEKVF TGRTVLSSIY QDQRTKQNVM SLLTPVYVAG QLKGIVLLDI
NKNNLRNIFY THDRPLLWRF LNVTLTDTDS GRDIIINQSE DNLFQYVSYV HDLPGGIRVS
LSIDILYFIT SSWKSVLFWI LTALILLNMV RMHFRLYQNV SRENISDAMT GLYNRKILTP
ELEQRLQKLV QAGSSVMFIA IDMDKLKQIN DTLGHQEGDL AITLLAQAIK QSIRKSDYAI
RLGGDEFCII LVDSTPQIAA QLPERIEKRL QHIAPQKEIG FSSGIYAMKE NDTLHDAYKA
SDERLYVNKQ NKNSRS