Gene EcolC_1079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1079 
Symbol 
ID6065565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1169295 
End bp1170521 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content49% 
IMG OID641600491 
Producthypothetical protein 
Protein accessionYP_001724073 
Protein GI170019119 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.217552 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00135165 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGGATA ACGATAATTC TCTTAATAAG CGCCCCACGT TTAAAAGAGC ATTACGCAAC 
ATCAGTATCA CCAGCATATT TATCACTATG ATGCTGATCT GGTTGCTGCT TTCCGTGACC
TCGGTGCTGA CCCTGAAACA GTACGCGCAA AAAAACCTGG CACTGACAGC AGCAACAATG
ACTTACAGTC TGGAAGCAGC TGTCGTTTTT GCCGATGGCC CTGCAGCAAC TGAAACACTG
GCAGCGCTGG GCCAGCAAGG GCAATTTTCA ACTGCAGAAG TACGTGATAA GCAGCAAAAT
ATTCTGGCGT CCTGGCATTA CACCCGTAAG GATCCAGGCG ATACTTTCAG CAATTTCATA
AGCCACTGGC TCTTCCCCGC CCCCATCATT CAGCCGATTC GTCACAATGG TGAAACCATT
GGCGAAGTAC GCTTAACCGC TCGCGACAGT TCAATCAGCC ATTTTATCTG GTTTTCGCTC
GCCGTACTGA CCGGTTGTAT TCTGCTGGCA TCAGGCATCG CAATTACCCT CACCCGCCAT
TTGCACAATG GCCTGGTGGA AGCACTGAAA AATATCACCG ATGTCGTACA TGATGTGCGT
TCCAACCGCA ATTTTTCCCG ACGAGTTTCG GAAGAACGTA TCGCTGAGTT TCACCGCTTC
GCTCTCGACT TCAACAGTCT GCTGGATGAA ATGGAAGAGT GGCAGCTTCG TTTACAGGCT
AAAAATGCGC AGCTTCTACG TACCGCGCTA CATGACCCAT TAACCGGGCT GGCTAACCGC
GCAGCGTTTC GTAGCGGCAT CAACACGTTG ATGAACAATT CCGATGCCCG AAAAACGTCG
GCGTTACTAT TTCTTGATGG CGATAATTTC AAATACATCA ATGATACCTG GGGTCATGCG
ACGGGCGATA GAGTCTTGAT TGAAATCGCA AAACGGTTAG CTGAAGTTGG CGGGCTGCGA
CATAAAGCAT ACCGCCTGGG CGGCGATGAA TTCGCTATGG TGCTCTATGA TGTACAGTCA
GAATCTGAAG TGCAGCAGAT ATGCTCAGCA CTGACACAAA TCTTTAATCT CCCGTTTGAT
CTTCATAATG GTCATCAGAC CACCATGACA TTAAGCATTG GTTACGCGAT GACCATTGAG
CACGCCTCTG CGGAAAAATT ACAAGAGCTT GCCGATCACA ATATGTATCA GGCCAAACAC
CAGCGTGCCG AAAAGCTGGT GAGATAA
 
Protein sequence
MMDNDNSLNK RPTFKRALRN ISITSIFITM MLIWLLLSVT SVLTLKQYAQ KNLALTAATM 
TYSLEAAVVF ADGPAATETL AALGQQGQFS TAEVRDKQQN ILASWHYTRK DPGDTFSNFI
SHWLFPAPII QPIRHNGETI GEVRLTARDS SISHFIWFSL AVLTGCILLA SGIAITLTRH
LHNGLVEALK NITDVVHDVR SNRNFSRRVS EERIAEFHRF ALDFNSLLDE MEEWQLRLQA
KNAQLLRTAL HDPLTGLANR AAFRSGINTL MNNSDARKTS ALLFLDGDNF KYINDTWGHA
TGDRVLIEIA KRLAEVGGLR HKAYRLGGDE FAMVLYDVQS ESEVQQICSA LTQIFNLPFD
LHNGHQTTMT LSIGYAMTIE HASAEKLQEL ADHNMYQAKH QRAEKLVR