Gene EcHS_A0892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0892 
Symbol 
ID5593031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp903217 
End bp904545 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content43% 
IMG OID640920064 
Productdiguanylate cyclase 
Protein accessionYP_001457631 
Protein GI157160313 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value0.701109 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAGAA TCAATAAGTT CGTACTTACA GTCAGTCTGC TGATTTTTAT CATGATTTCA 
GCAGTTGCCT GCGGGATCTA CACTCAAATG GTAAAGGAAC GGGTGTATGG CCTGAAACAG
TCCGTTATTG ATACTGCTTT TGCGGTGGCA AATATTGCTG AATATCGGCG TAGCGTGGCA
ATTGATCTTA TCAACACGCT AAATCCCACG GAGGAACAGC TGTTGGTTGG CTTACGCACA
GCTTACGCTG ACTCGGTTTC ACCCTCTTAC TTGTACGATG TCGGTCCTTA TCTGATTTCC
AGTGACGAAT GTATTCAGGT AAAGGAGTTC GAGAAAAATT ATTGTGCAGA TATTATGCAG
GTTGTGAAGT ATCGACATGT CAAAAATACA GGGTTTATCT CTTTTGACGG TAAAACCTTC
GTCTATTACC TCTATCCGGT AACTCACAAT CGTAGTCTGA TATTTTTGCT TGGTCTGGAG
CGTTTTTCTT TACTGTCAAA ATCTCTGGCG ATGGATAGCG AGAACCTGAT GTTCTCTCTG
TTTAAGAACG GTAAACCGGT GACCGGTGAT GAATTTAATG CTAAAAACAC CATTTTCACC
GTTTCGGAAG CGATGGAGCA CTTTGCCTAT TTGCCGACCG GATTGTACGT ATTTGCGTAT
AAAAAAGATG TTTATTTGCG GGTTTGTACA TTGATCATTT TCTTTGCCGC ATTGGTGGCA
GTGATATCGG GTGCCAGTTG CCTCTATCTG GTACGCAGAG TGATTAATCG TGGTATTGTG
GAGAAAGAAG CCATTATTAA TAACCATTTT GAACGCGTAC TGGATGGCGG GCTTTTCTTT
TCGGCTGCCG ATGTCAAAAA ACTCTACAGT ATGTATAACT CGGCGTTCCT GGACGACCTG
ACCAAAGCAA TGGGCAGAAA ATCCTTTGAC GAAGATTTAA AAGCGCTGCC GGAAAAAGGC
GGTTATTTGT GCCTGTTTGA CGTCGATAAA TTCAAAAATA TTAACGACAC CTTCGGTCAT
TTGCTGGGCG ATGAAGTGTT GATGAAAGTG GTGAAAATCC TTAAATCACA GATCCCGGTA
GATAAAGGTA AAGTCTACCG CTTCGGCGGT GACGAATTTG CGGTGATTTA TACTGGCGGC
ACGCTGGAAG AGTTGCTATC GATACTAAAA GAAATCGTTC ATTTCCAGGT GGGAAGCATT
AATTTAAGCA CCAGTATCGG TGTAGCGCAT TCAAATGAAT GTACTACCGT CGAACGCTTG
AAAATGCTGG CGGATGAGCG ACTGTATAAG AGTAAGAAAA ACGGCAGGGC TCAGATTAGC
TGGCAGTAG
 
Protein sequence
MSRINKFVLT VSLLIFIMIS AVACGIYTQM VKERVYGLKQ SVIDTAFAVA NIAEYRRSVA 
IDLINTLNPT EEQLLVGLRT AYADSVSPSY LYDVGPYLIS SDECIQVKEF EKNYCADIMQ
VVKYRHVKNT GFISFDGKTF VYYLYPVTHN RSLIFLLGLE RFSLLSKSLA MDSENLMFSL
FKNGKPVTGD EFNAKNTIFT VSEAMEHFAY LPTGLYVFAY KKDVYLRVCT LIIFFAALVA
VISGASCLYL VRRVINRGIV EKEAIINNHF ERVLDGGLFF SAADVKKLYS MYNSAFLDDL
TKAMGRKSFD EDLKALPEKG GYLCLFDVDK FKNINDTFGH LLGDEVLMKV VKILKSQIPV
DKGKVYRFGG DEFAVIYTGG TLEELLSILK EIVHFQVGSI NLSTSIGVAH SNECTTVERL
KMLADERLYK SKKNGRAQIS WQ