Gene EcHS_A3732 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3732 
Symbol 
ID5593874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3724942 
End bp3726897 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content53% 
IMG OID640922847 
Productputative phosphodiesterase 
Protein accessionYP_001460326 
Protein GI157163008 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain
[COG2200] FOG: EAL domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones64 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATGG TGGCAGCCGT TGTCCTGGTG TTCGTTTTTA TTTTTTGCAC CGTTTTGCTG 
TTCCATCTGG TCCAGCAGAA TCGCTATAAC ACGGCTACGC AACTGGAAAG CATTGCTCGC
TCTGTCCGCG AACCCTTATC TTCAGCTATT TTGAAAGGCG ATATTCCCGA AGCGGAAGCT
ATTCTTGCCA GCATTAAACC GGCAGGCGTG GTCAGCCGTG CCGATGTAGT GCTGCCTAAC
CAGTTCCAGG CGCTGCGTAA AAGTTTTATT CCAGAGCGCC CGGTGCCGGT AATGGTTACT
CGCCTGTTTG AGCTACCGGT TCAAATCTCG CTGGGCGTTT ACTCGCTCGA ACGTCCGGCA
AACCCGCAGC CAATTGCCTA TCTGGTACTA CAGGCGGATT CCTTCCGTAT GTATAAGTTC
GTGATGAGCA CCCTCTCAAC GTTAGTGACC ATTTACTTAC TTTTGTCGCT TATCCTGACC
GTCGCCATCA GCTGGTGCAT TAACCGCCTG ATTTTGCATC CGTTACGCAA TATTGCTCGC
GAACTTAACG CCATCCCAGC CAAGGAGCTT GTTGGTCACC AACTGGCATT ACCGCGTCTG
CATCAGGACG ATGAAATCGG TATGTTGGTG CGCAGTTACA ACCTCAACCA GCAATTGCTG
CAGCGCCATT ATGAAGAACA GAACGAAAAT GCGATGCGCT TCCCGGTGTC GGATTTGCCG
AACAAAGCCT TGCTGATGGA GATGCTGGAG CAGGTTGTCG CGCGTAAACA AACCACCGCG
CTGATGATCA TCACCTGTGA AACCCTGCGT GATACTGCGG GCGTGCTGAA AGAGGCGCAA
CGAGAAATTC TGCTGCTGAC GCTGGTGGAA AAACTCAAAT CGGTACTGTC GCCACGTATG
ATCCTCGCGC AGATTAGCGG TTATGACTTT GCTGTCATTG CCAACGGTGT ACAGGAACCG
TGGCACGCAA TCACCTTAGG TCAGCAAGTG CTCACTATCA TGAGCGAGCG CCTGCCGATT
GAACGTATTC AACTCCGTCC GCACTGTAGC ATTGGCGTGG CGATGTTCTA CGGCGATCTC
ACCGCCGAAC AGCTTTACAG TCGCGCTATT TCTGCGGCAT TTACCGCTCG CCATAAAGGC
AAGAATCAGA TTCAGTTCTT TGATCCGCAG CAGATGGAAG CCGCCCAGAA GCGGTTGACG
GAAGAGAGCG ATATCCTTAA TGCACTGGAA AATCATCAGT TTGCTATTTG GTTACAGCCA
CAGGTCGAGA TGACCAGCGG TAAACTGGTC AGTGCGGAAG TGTTACTGCG TATCCAGCAA
CCGGATGGCA GTTGGGACCT GCCGGATGGC TTAATCGATC GCATTGAGTG CTGTGGGCTG
ATGGTTACCG TCGGTCACTG GGTGCTGGAA GAGTCCTGTC GATTGCTTGC AGCCTGGCAA
GAGCGCGGCA TTATGCTGCC CTTGTCGGTA AACCTCTCTG CGCTGCAACT GATGCACCCG
AATATGGTGG CGGATATGCT GGAACTGTTA ACCCGCTATC GCATTCAGCC GGGAACACTG
ATTCTGGAAG TGACAGAAAG CCGACGTATT GACGACCCTC ATGCTGCGGT GGCAATCCTC
CGTCCGCTGC GCAATGCCGG AGTTCGGGTG GCGCTGGATG ATTTCGGCAT GGGCTACGCA
GGGCTGCGTC AGCTGCAGCA TATGAAATCG TTGCCAATCG ACGTACTGAA AATCGACAAA
ATGTTTGTTG AAGGCTTGCC GGGAGATAGC AGCATGATTG CTGCAATTAT CATGCTGGCG
CAGAGCCTGA ACTTACAAAT GATTGCCGAA GGCGTGGAGA CTGAAGCACA ACGCGACTGG
CTGGCAAAAG CGGGCGTTGG TATTGCCCAG GGCTTCCTTT TTGCTCGCCC ACTCCCTATT
GAAATCTTCG AAGAGAGTTA CCTGGAAGAA AAGTAG
 
Protein sequence
MAMVAAVVLV FVFIFCTVLL FHLVQQNRYN TATQLESIAR SVREPLSSAI LKGDIPEAEA 
ILASIKPAGV VSRADVVLPN QFQALRKSFI PERPVPVMVT RLFELPVQIS LGVYSLERPA
NPQPIAYLVL QADSFRMYKF VMSTLSTLVT IYLLLSLILT VAISWCINRL ILHPLRNIAR
ELNAIPAKEL VGHQLALPRL HQDDEIGMLV RSYNLNQQLL QRHYEEQNEN AMRFPVSDLP
NKALLMEMLE QVVARKQTTA LMIITCETLR DTAGVLKEAQ REILLLTLVE KLKSVLSPRM
ILAQISGYDF AVIANGVQEP WHAITLGQQV LTIMSERLPI ERIQLRPHCS IGVAMFYGDL
TAEQLYSRAI SAAFTARHKG KNQIQFFDPQ QMEAAQKRLT EESDILNALE NHQFAIWLQP
QVEMTSGKLV SAEVLLRIQQ PDGSWDLPDG LIDRIECCGL MVTVGHWVLE ESCRLLAAWQ
ERGIMLPLSV NLSALQLMHP NMVADMLELL TRYRIQPGTL ILEVTESRRI DDPHAAVAIL
RPLRNAGVRV ALDDFGMGYA GLRQLQHMKS LPIDVLKIDK MFVEGLPGDS SMIAAIIMLA
QSLNLQMIAE GVETEAQRDW LAKAGVGIAQ GFLFARPLPI EIFEESYLEE K