Gene EcolC_3657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3657 
Symbol 
ID6066129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4005830 
End bp4007254 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content54% 
IMG OID641603072 
Productsensory histidine kinase CreC 
Protein accessionYP_001726595 
Protein GI170021641 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.958648 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATCG GCATGCGGCT GCTGCTGGGC TATTTTTTAC TGGTGGCGGT GGCGGCCTGG 
TTCGTACTGG CTATTTTTGT CAAAGAAGTT AAACCGGGCG TGCGAAGAGC AACGGAGGGG
ACGTTAATCG ATACCGCAAC GTTGCTGGCG GAGCTGGCGC GTCCCGATTT GCTCTCTGGG
GACCCAACGC ATGGGCAACT GGCGCAGGCG TTTAATCAGC TACAACATCG CCCGTTTCGC
GCCAATATCG GTGGCATTAA CAAAGTGCGC AATGAATATC ATGTCTATAT GACCGATTCG
CAGGGTAAAG TATTGTTCGA TTCGGCAAAT AAAGCCGTTG GGCAGGATTA TTCGCGCTGG
AATGACGTCT GGCTAACGTT GCGTGGTCAG TATGGTGCGC GCAGCACGTT GCAAAATCCT
GCCGATCCCG AAAGTTCGGT GATGTATGTT GCCGCGCCGA TTATGAACGG CTCGCGGCTT
ATTGGCGTTT TGAGCGTAGG CAAACCGAAC GCGGCGATGG CTCCGGTCAT TAAGCGTAGC
GAGCGGCGAA TTTTATGGGC CAGCGCCATT TTGCTGGGGA TTGCACTGGT GATTGGCGCA
GGCATGGTTT GGTGGATCAA CCGCTCCATT GCCCGGCTCA CTCGCTATGC CGATTCCGTC
ACTGACAATA AGCCCGTTCC TCTTCCTGAA CTCGGCAGTA GCGAGTTGCG TAAGCTTGCG
CAGGCGCTGG AAAGTATGCG CGTGAAGCTG GAAGGGAAAA ACTATATTGA GCAGTATGTT
TATGCGTTAA CCCATGAGCT AAAAAGCCCA CTGGCGGCGA TTCGTAGCGC GGCGGAAATT
TTACGCGAAG GTCCGCCACC GGAAGTGGTG GCTCGTTTTA CCGACAACAT TCTGACGCAA
AATGCGCGTA TGCAGGCATT GGTAGAAACG TTACTACGCC AGGCAAGACT GGAGAATCGT
CAGGAAGTCG TTCTGACTGT TGTTGATGTG GCGGCATTAT TTCGCCGCGT CAGCGAAGCG
CGCACCGTGC AGTTGGCAGA AAAAAACATC ACTCTACATG TTATGCCCAC TGAGGTTAAT
GTTGCTGCTG AACCGGCGTT ACTGGAGCAG GCGCTGGGGA ATTTACTGGA TAACGCCATC
GATTTTACCC CCGAGAGCGG TCGCATAACG CTAAGCGCCG AAGTGGATCA GGAACACGTC
GCCCTTAAGG TGCTGGATAC CGGTAGTGGT ATTCCTGACT ACGCGCTGTC ACGTATTTTT
GAACGCTTTT ACTCTTTGCC GCGTGCAAAT GGGCAAAAAA GCAGCGGTCT GGGGTTGGCG
TTCGTCAGTG AGGTCGCCCG TTTGTTTAAC GGCGAAGTCA CGCTGCGCAA CGTGCAGGAA
GGTGGCGTGC TGGCCTCGCT TCGACTTCAC CGTCACTTCA CATAG
 
Protein sequence
MRIGMRLLLG YFLLVAVAAW FVLAIFVKEV KPGVRRATEG TLIDTATLLA ELARPDLLSG 
DPTHGQLAQA FNQLQHRPFR ANIGGINKVR NEYHVYMTDS QGKVLFDSAN KAVGQDYSRW
NDVWLTLRGQ YGARSTLQNP ADPESSVMYV AAPIMNGSRL IGVLSVGKPN AAMAPVIKRS
ERRILWASAI LLGIALVIGA GMVWWINRSI ARLTRYADSV TDNKPVPLPE LGSSELRKLA
QALESMRVKL EGKNYIEQYV YALTHELKSP LAAIRSAAEI LREGPPPEVV ARFTDNILTQ
NARMQALVET LLRQARLENR QEVVLTVVDV AALFRRVSEA RTVQLAEKNI TLHVMPTEVN
VAAEPALLEQ ALGNLLDNAI DFTPESGRIT LSAEVDQEHV ALKVLDTGSG IPDYALSRIF
ERFYSLPRAN GQKSSGLGLA FVSEVARLFN GEVTLRNVQE GGVLASLRLH RHFT