Gene EcSMS35_4950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4950 
SymbolcreC 
ID6143407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp5063433 
End bp5064857 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content54% 
IMG OID641619753 
Productsensory histidine kinase CreC 
Protein accessionYP_001746857 
Protein GI170683413 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.861306 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATCG GCATGCGGTT GTTGCTGGGC TATTTTTTAC TGGTGGCGGT GGCAGCCTGG 
TTCGTACTGG CCATTTTTGT CAAAGAAGTT AAACCGGGCG TGCGAAGAGC AACGGAGGGG
ACGTTGATCG ACACCGCAAC GTTGCTGGCG GAGCTGGCGC GTCCCGATTT GCTCTCTGGG
GACCCAACGC ATGGGCAACT GGCGCAGGCG TTTAATCAGC TACAACATCG CCCGTTTCGC
GCCAATATCG GTGGCATTAA CAAAGTGCGC AATGAATATC ATGTCTATAT GACCGATGCG
CAGGGCAAAG TATTGTTCGA TTCGGCAAAT AAAGCCGTTG GACAGGATTA TTCGCGCTGG
AATGACGTCT GGCTAACGTT GCGTGGTCAG TATGGTGCGC GCAGCACGTT GCAAAATCCT
GCCGATCCCG AAAGTTCTGT GATGTATGTT GCCGCACCGA TTATGGACGG CTCGCGGCTT
ATTGGCGTTT TGAGCGTAGG CAAACCGAAC GCGGCGATGG CTCCGGTCAT TAAGCGTAGC
GAGCGGCGAA TTTTATGGGC CAGCGCCATT TTGTTGGGGA TTGCACTGGT GATTGGCGCA
GGCATGGTTT GGTGGATCAA CCGCTCTATT GCCCGGCTCA CTCGCTATGC TGATTCCGTC
ACTGACAATA AGCCCGTTCC TCTCCCCGAT CTCGGTAGTA GCGAGTTGCG TAAACTCGCG
CAGGCGCTGG AAAGTATGCG CGTGAAGCTG GAAGGGAAAA ACTATATTGA GCAGTATGTT
TATGCGTTAA CCCATGAGCT AAAAAGCCCA CTGGCGGCGA TTCGTGGCGC GGCGGAAATT
TTACGCGAAG GTCCGCCGCC GGAAGTGGTG GCTCGTTTTA CTGACAACAT TCTGACGCAA
AATGCGCGTA TGCAGGCACT GGTGGAAACG TTACTACGCC AGGCAAGACT GGAGAATCGT
CAGGAAGTCG TTCTGACTGT TGTTGATGTG GCGGCATTAT TCCGCCGCGT CAGCGAAGCG
CGCACCGTGC AGTTGGCAGA AAAAAAAATC ACTCTGCATG TTATGCCCAC CGAGGTTAAC
GTTGCTGCTG AACCGACGTT ACTGGAGCAG GCGCTGGGGA ATTTACTGGA TAACGCCATC
GATTTTACCC CCGAGAGCGG TCGTATAACG CTAAGCGCCG AAGTGGAGCA GGAACACGTC
ACGCTTAAGG TGCTGGATAC CGGTAGTGGT ATTCCTGACT ACGCGCTTTC ACGTATTTTT
GAACGCTTTT ACTCTTTGCC GCGTGCAAAT GGGCAAAAAA GCAGCGGTCT GGGGTTGGCG
TTCGTCAGTG AGGTCGCCCG TTTGTTTAAC GGCGAAGTCA CGCTGCACAA CGTGCAGGAA
GGTGGCGTGC TGGCCTCGCT TCGACTTCAC CGTCACTTCA CATAG
 
Protein sequence
MRIGMRLLLG YFLLVAVAAW FVLAIFVKEV KPGVRRATEG TLIDTATLLA ELARPDLLSG 
DPTHGQLAQA FNQLQHRPFR ANIGGINKVR NEYHVYMTDA QGKVLFDSAN KAVGQDYSRW
NDVWLTLRGQ YGARSTLQNP ADPESSVMYV AAPIMDGSRL IGVLSVGKPN AAMAPVIKRS
ERRILWASAI LLGIALVIGA GMVWWINRSI ARLTRYADSV TDNKPVPLPD LGSSELRKLA
QALESMRVKL EGKNYIEQYV YALTHELKSP LAAIRGAAEI LREGPPPEVV ARFTDNILTQ
NARMQALVET LLRQARLENR QEVVLTVVDV AALFRRVSEA RTVQLAEKKI TLHVMPTEVN
VAAEPTLLEQ ALGNLLDNAI DFTPESGRIT LSAEVEQEHV TLKVLDTGSG IPDYALSRIF
ERFYSLPRAN GQKSSGLGLA FVSEVARLFN GEVTLHNVQE GGVLASLRLH RHFT