Gene EcHS_A1871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1871 
Symbol 
ID5591173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1888362 
End bp1889837 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content38% 
IMG OID640921013 
Productdiguanylate cyclase 
Protein accessionYP_001458565 
Protein GI157161247 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value0.0774935 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCAGT CAACACGTAT TTCCATGGGG TTATTCTTTA AATATTTTTT ATCGTTAACG 
AAAATTGATC TTGGTCAAAA CTATATATCT CTGCCATCAA TAAAATCCAG CACTCACATT
GCTCTCCTTT TTATGGTTTC TATGGGTACA CAAAAATTAA AAGCTCAAAG CTTTTTTATT
TTCAGTTTAT TGCTGACGTT AATTTTATTT TGCATTACTA CCTTATATAA CGAAAACACA
AATGTAAAAC TCATCCCACA GATGAATTAC CTGATGGTTG TTGTGGCTTT GTTTTTCCTT
AACGCCGTCA TTTTTCTTTT CATGTTAATG AAATATTTCA CTAACAAACA AATTTTACCA
ACACTCATTT TAAGCCTTGC ATTTTTAAGT AGCCTTATCT ATTTAGTTGA AACCATTGTA
ATTATCCATA AACCAATTAA CGGCAGTACA CTGATCCAGA CAAAGTCGAA TGATGTTTCT
ATTTTCTATA TTTTCCGCCA ACTCAGTTTT ATTTGTTTAA CCTCGCTGGC ACTCTTTTGT
TATGGAAAAG ACAACATCCT TGACAACAAT AAGAAAAAAA CGGGAATCCT GTTGCTGGCG
CTGATTCCTT TTTTAGTTTT TCCCCTTCTG GCACACAATC TGAGCAGTTA TAACGCTGAC
TATTCTTTGT ATGTCGTCGA CTACTGTCCT GACAACCATA CTGCGACCTG GGGAATCAAC
TATACAAAAA TATTGGTTTG TCTGTGGGCA TTTTTACTGT TCTTTATTAT CATGCGTACA
CGATTAGCCA GCGAACTATG GCCATTAATA GCATTATTAT GTCTGGCATC GCTATGCTGC
AACTTACTTC TACTGACTCT GGATGAGTAT AATTACACCA TCTGGTACAT CAGTCGCGGG
ATTGAAGTCT CCAGTAAACT GTTTGTTGTG TCTTTTCTGA TTTATAACAT TTTTCAGGAA
CTGCAACTCT CCAGCAAACT GGCAGTTCAT GATGTGCTGA CCAATATTTA TAATCGGCGC
TACTTTTTCA ACAGCGTAGA GTCATTATTG TCGCGACCTG TTGTTAAGGA CTTCTGTGTC
ATGCTGGTTG ATATTAATCA GTTCAAACGC ATCAATGCCC AATGGGGACA TCGTGTGGGT
GATAAAGTGC TGGTTTCAAT TGTCGATATT ATCCAGCAAA GCATCCGCCC CGATGATATT
TTAGCGCGAC TGGAGGGTGA GGTGTTTGGC TTGCTATTTA CCAAACTCAA TAGTGCCCAG
GCAAAAATCA TTGCGGAACG TATGCGTAAA AATGTCGAAC TCCTGACCGG CTTTAGTAAC
AGATATGATG TTCCTGAACA AATGACCATC AGTATTGGCA CGGTTTTTTC AACGGGTGAC
ACGCGTAATA TCTCGCTTGT CATGACGGAA GCAGATAAAG CCTTACGCGA AGCGAAAAGC
GAGGGGGGCA ACAAAGTGAT TATTCATCAT ATTTAA
 
Protein sequence
MIQSTRISMG LFFKYFLSLT KIDLGQNYIS LPSIKSSTHI ALLFMVSMGT QKLKAQSFFI 
FSLLLTLILF CITTLYNENT NVKLIPQMNY LMVVVALFFL NAVIFLFMLM KYFTNKQILP
TLILSLAFLS SLIYLVETIV IIHKPINGST LIQTKSNDVS IFYIFRQLSF ICLTSLALFC
YGKDNILDNN KKKTGILLLA LIPFLVFPLL AHNLSSYNAD YSLYVVDYCP DNHTATWGIN
YTKILVCLWA FLLFFIIMRT RLASELWPLI ALLCLASLCC NLLLLTLDEY NYTIWYISRG
IEVSSKLFVV SFLIYNIFQE LQLSSKLAVH DVLTNIYNRR YFFNSVESLL SRPVVKDFCV
MLVDINQFKR INAQWGHRVG DKVLVSIVDI IQQSIRPDDI LARLEGEVFG LLFTKLNSAQ
AKIIAERMRK NVELLTGFSN RYDVPEQMTI SIGTVFSTGD TRNISLVMTE ADKALREAKS
EGGNKVIIHH I