Gene EcolC_1839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1839 
Symbol 
ID6066355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2037948 
End bp2038973 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content48% 
IMG OID641601253 
Productdiguanylate cyclase with GAF sensor 
Protein accessionYP_001724815 
Protein GI170019861 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.730013 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.326474 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCAGATC AGATTATCGC CCGCGTCTCG CAATCCCTTG CCAAAGAACA GTCACTGGAA 
AGTCTGGTCC GACAGCTTCT GGAGATGCTG GAAATGGTCA CTGATATGGA ATCAACCTAC
CTGACCAAAG TGGATGTCGA AGCGCGCCTG CAGCATATTA TGTTTGCCCG TAACAGCCAG
AAAATGCACA TCCCGGAGAA TTTTACCGTC TCGTGGGATT ACTCGTTATG CAAACGCGCC
ATTGATGAAA ACTGCTTTTT CAGCGATGAA GTCCCCGACC GTTGGGGTGA CTGTATTGCG
GCACGCAATC TTGGCATCAC CACATTTCTG AGCACGCCAA TTCACTTACC GGATGGATCA
TTCTATGGCA CGCTTTGCGC CGCCAGCAGT GAGAAGCGCC AGTGGAGTGA ACGCGCGGAA
CAGGTTTTAC AGTTATTCGC CGGACTGATT GCACAATATA TTCAAAAAGA GGCACTGGTT
GAACAGCTGC GCGAAGCCAA TGCTGCGCTG ATTGCGCAAT CGTATACCGA CTCGTTAACC
GGGCTACCGA ATCGGCGGGC GATTTTTGAA AATCTGACGA CACTGTTTTC CCTCGCCCGG
CATCTTAACC ATAAGATAAT GATCGCGTTT ATCGATCTGG ATAACTTCAA ATTAATCAAT
GATCGTTTTG GTCATAATAG TGGCGATCTG TTTCTCATTC AGGTTGGCGA GCGCCTTAAT
ACGCTCCAGC AAAATGGCGA AGTTATTGGT CGTCTCGGCG GTGATGAGTT TTTAGTTGTT
TCACTAAACA ACGAGAATGC GGATATTTCG TCGCTGCGAG AACGCATTCA GCAGCAAATA
CGTGGAGAAT ATCACTTAGG TGATGTTGAT TTGTATTATC CCGGTGCCAG TCTTGGCATA
GTAGAAGTCG ATCCTGAAAC AACCGATGCA GACAGTGCCC TGCATGCTGC CGATATTGCG
ATGTATCAGG AGAAAAAACA CAAACAGAAA ACACCTTTTG TCGCGCATCC AGCGCTACAT
TCCTGA
 
Protein sequence
MSDQIIARVS QSLAKEQSLE SLVRQLLEML EMVTDMESTY LTKVDVEARL QHIMFARNSQ 
KMHIPENFTV SWDYSLCKRA IDENCFFSDE VPDRWGDCIA ARNLGITTFL STPIHLPDGS
FYGTLCAASS EKRQWSERAE QVLQLFAGLI AQYIQKEALV EQLREANAAL IAQSYTDSLT
GLPNRRAIFE NLTTLFSLAR HLNHKIMIAF IDLDNFKLIN DRFGHNSGDL FLIQVGERLN
TLQQNGEVIG RLGGDEFLVV SLNNENADIS SLRERIQQQI RGEYHLGDVD LYYPGASLGI
VEVDPETTDA DSALHAADIA MYQEKKHKQK TPFVAHPALH S