Gene EcHS_A0891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0891 
Symbol 
ID5593030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp900861 
End bp903209 
Gene Length2349 bp 
Protein Length782 aa 
Translation table11 
GC content41% 
IMG OID640920063 
Productcyclic diguanylate phosphodiesterase domain-containing protein 
Protein accessionYP_001457630 
Protein GI157160312 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value0.941678 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAGTT TATACGAAAA GATAAAGATA AGGCTGATAA TTTTATTTTT ATTGGCAGCA 
CTGTCATTTA TTGGTCTTTT TTTCATCATT AACTATCAAC TGGTATCGGA ACGCGCGGTA
AAACGTGCCG ATAGCCGCTT TGAACTTATT CAGAAAAACG TTGGCTATTT CTTTAAAGAT
ATTGAACGTT CGGCCCTGAC ATTAAAGGAC TCACTGTATT TATTAAAAAA TACAGAGGAG
ATTCAACGCG CCGTGATTCT GAAAATGGAA ATGATGCCAT TTTTAGACTC GGTGGGACTG
GTACTTGATG ATAATAAATA TTATCTCTTT TCACGGAGGG CGAATGATAA AATCGTTGTT
TATCATCAGG AACAAGTAAA TGGACCGCTT GTCGACGAGT CAGGGCGGGT TATTTTTGCC
GATTTTAACC CATCGAAACG ACCGTGGTCG GTGGCTTCAG ATGACTCTAA CAACAGCTGG
AATCCGGCAT ACAATTGCTT TGATCGTCCG GGTAAAAAAT GTATCTCTTT TACGCTACGT
ATCAACGGCA AAGATCACGA TTTGTTAGCG GTAGATAAAA TTCATGTCGA TTTAAACTGG
CGGTATCTGA ACGAGTATCT TGATCAAATC AGCGCTAATG ATGAAGTTCT ATTTTTGAAA
CAAGGTCATG AGATCATTGC CAAGAATCAA CTCGCTCGTG AAAAACTGAT TATTTATAAT
AGCGAAGGTA ATTATAATAT TATTGATTCT GTCGATACGG AGTATATTGC AAAGACATCA
GCAGTGCCAA ATAATGCATT GTTCGAAATC TATTTTTATT ATCCTGGCGG TAATTTATTG
AACGCCTCAG ATAAACTTTT TTATCTGCCG TTTGCGTTCA TTATTATCGT ATTGCTGGTG
GTTTATTTAA TGACCACTCG TGTGTTCCGT CGGCAATTTT CTGAAATGAC AGAGCTGGTT
AATACGCTGG CATTTTTGCC CGACTCAACG GATCAAATCG AGGCTCTGAA AATTCGCGAA
GGCGATGCGA AAGAGATTAT CAGCATTAAA AATTCGATCG CGGAAATGAA AGATGCCGAA
ATTGAACGGT CAAATAAATT GCTCTCACTG ATCTCTTACG ATCAGGAAAG CGGTTTTATT
AAAAATATGG CGATTATTGA GTCCAACAAT AATCAGTATC TGGCTGTGGG GATCATCAAA
CTGTGTGGTC TGGAAGCCGT GGAAGCGGTG TTTGGTGTTG ATGAACGCAA TAAAATCGTC
AGGAAATTGT GTCAGCGAAT TGCCGAAAAA TATGCGCAAT GCTGCGATAT CGTGACATTC
AATGCCGATC TCTATTTACT TCTGTGCCGG GAAAATGTAC AGACATTTAT CCGTAAAATA
GCGATGGTAA ACGATTTTGA CAGCAGCTTT GGCTACCGTA ATCTGCGCAT CCATAAGTCT
GCCATTTGTG AACCTTTGCA GGGGGAAAAC GCCTGGAGTT ACGCAGAAAA ACTGAAACTG
GCGATTTCCA GTATCCGTGA CCATATGTTC TCAGAGTTTA TTTTCTGTGA TGACGCGAAA
CTCAACGAAA TAGAAGAGAA TATCTGGATT GCGCGTAATA TTCGCCATGC AATGGAAATT
GGCGAACTAT TCCTGGTCTA TCAACCGATC GTTGATATTA ACACCCGCGC CATTCTGGGC
GCGGAGGCGT TGTGCCGTTG GGTGTCTGCG GAGCGGGGGA TCATTTCACC GCTGAAGTTC
ATTACCATTG CTGAAGATAT CGGGTTTATC AATGAGCTGG GTTATCAGAT TATTAAAACG
GCAATGGGTG AATTCAGACA TTTTAGTCAG CGTGCGTCGC TGAAGGATGA TTTCTTACTG
CATATTAATA TTTCGCCATG GCAGTTAAAC GAACCGCACT TTCATGAGCG TTTTACCACC
ATCATGAAAG AAAATGGCCT GAAGGCGAAC AGCCTCTGTG TTGAGATCAC TGAAACCGTG
ATCGAGCGAA TTAATGAACA TTTTTATCTC AATATTGAAC AACTGCGTAA ACAAGGGGTA
CGGATATCGA TTGATGACTT TGGCACCGGT TTGTCAAACC TGAAACGTTT TTATGAAATT
AATCCAGATA GCATAAAGGT GGACTCGCAA TTTACCGGCG ATATTTTCGG TACTGCGGGA
AAAATTGTGC GCATTATTTT CGACCTGGCA CGCTATAACC GGATCCCGGT GATTGCGGAA
GGCGTAGAGA GCGAAGACGT TGCGCGCGAA TTAATCAAAT TAGGATGTGT TCAGGCTCAG
GGGTATCTGT ACCAGAAACC CATGCCATTC TCCGCCTGGG ATAAAAGTGG AAAATTAGTA
AAAGAGTAG
 
Protein sequence
MLSLYEKIKI RLIILFLLAA LSFIGLFFII NYQLVSERAV KRADSRFELI QKNVGYFFKD 
IERSALTLKD SLYLLKNTEE IQRAVILKME MMPFLDSVGL VLDDNKYYLF SRRANDKIVV
YHQEQVNGPL VDESGRVIFA DFNPSKRPWS VASDDSNNSW NPAYNCFDRP GKKCISFTLR
INGKDHDLLA VDKIHVDLNW RYLNEYLDQI SANDEVLFLK QGHEIIAKNQ LAREKLIIYN
SEGNYNIIDS VDTEYIAKTS AVPNNALFEI YFYYPGGNLL NASDKLFYLP FAFIIIVLLV
VYLMTTRVFR RQFSEMTELV NTLAFLPDST DQIEALKIRE GDAKEIISIK NSIAEMKDAE
IERSNKLLSL ISYDQESGFI KNMAIIESNN NQYLAVGIIK LCGLEAVEAV FGVDERNKIV
RKLCQRIAEK YAQCCDIVTF NADLYLLLCR ENVQTFIRKI AMVNDFDSSF GYRNLRIHKS
AICEPLQGEN AWSYAEKLKL AISSIRDHMF SEFIFCDDAK LNEIEENIWI ARNIRHAMEI
GELFLVYQPI VDINTRAILG AEALCRWVSA ERGIISPLKF ITIAEDIGFI NELGYQIIKT
AMGEFRHFSQ RASLKDDFLL HINISPWQLN EPHFHERFTT IMKENGLKAN SLCVEITETV
IERINEHFYL NIEQLRKQGV RISIDDFGTG LSNLKRFYEI NPDSIKVDSQ FTGDIFGTAG
KIVRIIFDLA RYNRIPVIAE GVESEDVARE LIKLGCVQAQ GYLYQKPMPF SAWDKSGKLV
KE