Gene EcolC_1849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1849 
Symbol 
ID6065589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2047846 
End bp2049780 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content50% 
IMG OID641601263 
Productputative serine protein kinase, PrkA 
Protein accessionYP_001724825 
Protein GI170019871 
COG category[T] Signal transduction mechanisms 
COG ID[COG2766] Putative Ser protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.159984 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0123748 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATAT TCGATCACTA TCGCCAGCGA TATGAAGCTG CCAAGGACGA AGAGTTCACA 
CTGCAGGAGT TTCTTACCAC TTGTCGGCAA GATCGCAGTG CTTATGCCAA CGCGGCTGAG
CGGCTATTGA TGGCTATCGG TGAGCCTGTC ATGGTCGATA CAGCCCAGGA ACCCAGACTT
TCTCGACTCT TTTCTAACCG GGTCATTGCA CGTTATCCGG CGTTTGAAGA GTTTTACGGC
ATGGAAGACG CGATTGAACA GATTGTCTCT TATCTGAAAC ACGCGGCTCA GGGGCTGGAA
GAGAAGAAAC AAATCCTGTA TCTGCTGGGG CCTGTGGGTG GGGGTAAATC ATCGCTTGCT
GAGCGACTGA AATCATTAAT GCAGCTCGTA CCGATTTATG TATTGAGCGC GAACGGTGAG
CGTAGCCCGG TCAACGATCA TCCGTTCTGT CTTTTCAATC CGCAGGAAGA TGCGCAGATT
CTGGAAAAAG AGTATGGCAT TCCTCGCCGT TATCTCGGCA CCATCATGTC GCCGTGGGCG
GCAAAACGCC TGCATGAATT TGGTGGCGAT ATCACTAAGT TCCGGGTAGT GAAGGTCTGG
CCGTCAATTC TGCAACAAAT TGCTATCGCC AAAACGGAAC CCGGCGATGA GAATAACCAG
GACATCTCCG CGCTGGTTGG GAAAGTCGAT ATTCGTAAAC TCGAACACTA CGCGCAGAAT
GACCCGGACG CCTACGGCTA TTCCGGTGCG CTGTGCCGCG CCAATCAGGG GATCATGGAA
TTCGTTGAGA TGTTTAAAGC ACCGATTAAA GTGCTGCATC CCTTGTTAAC CGCCACTCAG
GAAGGTAACT ACAACGGGAC GGAAGGTATC TCCGCCCTGC CGTTCAACGG GATTATTCTC
GCACACTCGA ACGAGTCCGA ATGGGTCACT TTCCGTAATA ACAAAAACAA CGAAGCCTTC
CTCGATCGTG TTTACATCGT GAAGGTGCCG TATTGCTTGC GCATTTCCGA AGAGATCAAA
ATCTACGAGA AATTGCTTAA TCACAGTGAA TTGACTCACG CCCCATGCGC CCCTGGCACG
CTCGAAACAC TGTCACGTTT TTCCATTCTT TCGCGCCTGA AAGAGCCAGA AAACTCCAGC
ATTTATTCAA AGATGCGGGT TTATGATGGC GAAAGTCTGA AAGACACCGA TCCCAAAGCC
AAGTCGTATC AGGAATATCG TGACTACGCC GGTGTCGATG AAGGGATGAA CGGTCTGTCG
ACGCGTTTTG CGTTTAAGAT CCTCTCCCGC GTGTTCAACT TCGATCATGT AGAAGTGGCA
GCAAACCCGG TCCATCTGTT CTACGTCCTG GAACAGCAGA TTGAGCGCGA GCAGTTCCCA
CAAGAGCAGG CAGAACGCTA TCTGGAGTTC CTGAAAGGTT ATCTGATCCC GAAATATGCC
GAGTTTATCG GCAAAGAGAT CCAGACGGCC TACCTTGAAT CCTATTCCGA ATATGGGCAA
AACATTTTCG ACCGTTATGT TACCTACGCG GATTTCTGGA TTCAGGATCA GGAGTATCGC
GATCCGGATA CCGGGCAGCT GTTTGACCGC GAGTCTCTTA ACGCCGAGCT GGAGAAAATC
GAGAAACCGG CGGGGATCAG TAATCCAAAA GATTTCCGCA ACGAGATTGT TAACTTCGTA
CTGCGCGCCA GAGCGAATAA CAGCGGACGC AATCCGAACT GGACCAGCTA TGAAAAACTG
CGCACGGTCA TCGAGAAGAA AATGTTCTCC AATACCGAGG AGCTGTTGCC GGTTATCTCG
TTTAACGCCA AAACGTCAAC CGACGAGCAG AAGAAACACG ACGACTTTGT CGACCGTATG
ATGGAAAAAG GCTACACCCG TAAACAGGTG CGTTTACTGT GCGAATGGTA TTTGCGCGTA
CGTAAATCGT CTTAA
 
Protein sequence
MNIFDHYRQR YEAAKDEEFT LQEFLTTCRQ DRSAYANAAE RLLMAIGEPV MVDTAQEPRL 
SRLFSNRVIA RYPAFEEFYG MEDAIEQIVS YLKHAAQGLE EKKQILYLLG PVGGGKSSLA
ERLKSLMQLV PIYVLSANGE RSPVNDHPFC LFNPQEDAQI LEKEYGIPRR YLGTIMSPWA
AKRLHEFGGD ITKFRVVKVW PSILQQIAIA KTEPGDENNQ DISALVGKVD IRKLEHYAQN
DPDAYGYSGA LCRANQGIME FVEMFKAPIK VLHPLLTATQ EGNYNGTEGI SALPFNGIIL
AHSNESEWVT FRNNKNNEAF LDRVYIVKVP YCLRISEEIK IYEKLLNHSE LTHAPCAPGT
LETLSRFSIL SRLKEPENSS IYSKMRVYDG ESLKDTDPKA KSYQEYRDYA GVDEGMNGLS
TRFAFKILSR VFNFDHVEVA ANPVHLFYVL EQQIEREQFP QEQAERYLEF LKGYLIPKYA
EFIGKEIQTA YLESYSEYGQ NIFDRYVTYA DFWIQDQEYR DPDTGQLFDR ESLNAELEKI
EKPAGISNPK DFRNEIVNFV LRARANNSGR NPNWTSYEKL RTVIEKKMFS NTEELLPVIS
FNAKTSTDEQ KKHDDFVDRM MEKGYTRKQV RLLCEWYLRV RKSS