Gene EcolC_1847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1847 
Symbol 
ID6067436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2044828 
End bp2046303 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content38% 
IMG OID641601261 
Productdiguanylate cyclase 
Protein accessionYP_001724823 
Protein GI170019869 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.02251 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCAGT CAACACGTAT TTCCATGGGG TTATTCTTTA AATATTTTTT ATCGTTAACG 
AAAATTGATC TTGGTCAAAA CTATATATCT CTGCCATCAA TAAAATCCAG CACTCACATT
GCTCTCCTTT TTATGGTTTC TATGGGTACA CAAAAATTAA AAGCTCAAAG CTTTTTTATT
TTCAGTTTAT TGCTGACGTT AATTTTATTT TGCATTACTA CCTTATATAA CGAAAACACA
AATGTAAAAC TCATCCCACA GATGAATTAC CTGATGGTTG TTGTGGCTTT GTTTTTCCTT
AACGCCGTCA TTTTTCTTTT CATGTTAATG AAATATTTCA CTAACAAACA AATTTTACCA
ACACTCATTT TAAGCCTTGC ATTTTTAAGT GGCCTTATCT ATTTAGTTGA AACCATTGTA
ATTATCCATA AACCAATTAA CGGCAGTACA CTGATCCAGA CAAAGTCGAA TGATGTTTCT
ATTTTCTATA TTTTCCGCCA ACTCAGTTTT ATTTGTTTAA CCTCGCTGGC ACTCTTTTGT
TATGGAAAAG ACAACATCCT TGACAACAAT AAGAAAAAAA CGGGAATCCT GTTGCTGGCG
CTGATTCCTT TTTTAGTTTT TCCCCTTCTG GCACACAATC TGAGCAGTTA TAACGCTGAC
TATTCTTTGT ATGTCGTCGA CTACTGTCCT GACAACCATA CTGCGACCTG GGGAATCAAC
TATACAAAAA TATTGGTTTG TCTGTGGGCA TTTTTACTGT TCTTTATTAT CATGCGTACA
CGATTAGCCA GCGAACTATG GCCATTAATA GCATTATTAT GTCTGGCATC GCTATGCTGC
AACTTACTTC TACTGACTCT GGATGAGTAT AATTACACCA TCTGGTACAT CAGTCGCGGG
ATTGAAGTCT CCAGTAAACT GTTTGTTGTG TCTTTTCTGA TTTATAACAT TTTTCAGGAA
CTGCAACTCT CCAGCAAACT GGCAGTTCAT GATGTGCTGA CCAATATTTA TAATCGGCGC
TACTTTTTCA ACAGCGTAGA GTCATTATTG TCGCGACCTG TTGTTAAGGA CTTCTGTGTC
ATGCTGGTTG ATATTAATCA GTTCAAACGC ATCAATGCCC AATGGGGACA TCGTGTGGGT
GATAAAGTGC TGGTTTCAAT TGTCGATATT ATCCAGCAAA GCATCCGCCC CGATGATATT
TTAGCGCGAC TGGAGGGTGA GGTGTTTGGC TTGCTATTTA CCGAACTCAA TAGTGCCCAG
GCAAAAATCA TTGCGGAACG TATGCGTAAA AATGTCGAAC TCCTGACCGG CTTTAGTAAC
AGATATGATG TTCCTGAACA AATGACCATC AGTATTGGCA CGGTTTTTTC AACGGGTGAC
ACGCGTAATA TCTCGCTTGT CATGACGGAA GCAGATAAAG CCTTACGCGA AGCGAAAAGC
GAGGGGGGCA ACAAAGTGAT TATTCATCAT ATTTAA
 
Protein sequence
MIQSTRISMG LFFKYFLSLT KIDLGQNYIS LPSIKSSTHI ALLFMVSMGT QKLKAQSFFI 
FSLLLTLILF CITTLYNENT NVKLIPQMNY LMVVVALFFL NAVIFLFMLM KYFTNKQILP
TLILSLAFLS GLIYLVETIV IIHKPINGST LIQTKSNDVS IFYIFRQLSF ICLTSLALFC
YGKDNILDNN KKKTGILLLA LIPFLVFPLL AHNLSSYNAD YSLYVVDYCP DNHTATWGIN
YTKILVCLWA FLLFFIIMRT RLASELWPLI ALLCLASLCC NLLLLTLDEY NYTIWYISRG
IEVSSKLFVV SFLIYNIFQE LQLSSKLAVH DVLTNIYNRR YFFNSVESLL SRPVVKDFCV
MLVDINQFKR INAQWGHRVG DKVLVSIVDI IQQSIRPDDI LARLEGEVFG LLFTELNSAQ
AKIIAERMRK NVELLTGFSN RYDVPEQMTI SIGTVFSTGD TRNISLVMTE ADKALREAKS
EGGNKVIIHH I