Gene ECH74115_1217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1217 
Symboletk1 
ID6968611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1226194 
End bp1228374 
Gene Length2181 bp 
Protein Length726 aa 
Translation table11 
GC content51% 
IMG OID643385212 
Productcryptic autophosphorylating protein tyrosine kinase Etk 
Protein accessionYP_002269707 
Protein GI209400850 
COG category[D] Cell cycle control, cell division, chromosome partitioning
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0489] ATPases involved in chromosome partitioning
[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACTA AAAATATGAA TACGCCACCA GGCAGCACTC AGGAAAATGA GATCGATCTG 
CTTCGTCTGG TCGGCGAGTT ATGGGATCAC CGTAAGTTTA TTATCAGCGT GACCGCGTTA
TTCACGCTGA TCGCTGTCGC TTACTCGCTG TTAAGCACAC CAATTTATCA GGCAGATACT
CTGGTCCAGG TTGAGCAAAA ACAGGGCAAC GCCATTCTCA GCGGCCTGAG TGATATGATC
CCTAACTCAT CGCCCGAGTC TGCACCGGAG ATCCAACTGC TGCAATCGCG CATGATTCTC
GGTAAAACCA TTGCTGAACT GAATCTGCGC GACATGGTTG AGCAGAAGTA TTTTCCGATT
GTGGGTCGCG GCTGGGCGAG ATTAACCAAA GAGAAACCAG GTGAGCTGGC GATCAGCTGG
ATGCATATTC CACAACTGAA TGGTCAGGAT CAGCAACTGA CACTCACGGT TGGGGAAAAC
GGCCACTATA CGCTGGAAGG TGAAGAGTTC ACCGTCAATG GTATGGTCGG CCAGCGTCTG
GAAAAAGATG GCGTTGCGCT GACTATCGCG GATATTAAGG CCAAACCAGG AACACAGTTT
GTCCTGAGCC AACGTACCGA ACTGGAAGCG ATTAACGCGT TGCAGGAAAC CTTTACCGTT
AGCGAACGCA GTAAAGAAAG CGGGATGCTG GAACTTACCA TGACTGGTGA TGATCCCCAG
TTGATTACTC GTATTCTGAA CAGCATCGCT AACAACTATT TGCAACAGAA TATCGCTCGC
CAGGCGGCGC AGGATTCACA AAGCCTTGAA TTCTTACAGC GCCAGTTGCC AGAAGTGCGC
AGCGAGCTGG ACCAGGCGGA AGAAAAACTC AACGTTTATC GCCAGCAGCG CGATTCGGTT
GACCTTAACC TGGAAGCCAA AGCCGTTCTT GAGCAGATTG TGAACGTTGA TAATCAACTC
AATGAGCTGA CCTTCCGCGA GGCAGAGATC TCCCAGCTGT ATAAGAAAGA TCACCCAACT
TATCGTGCGC TGCTGGAAAA ACGCCAGACG CTGGAGCAAG AACGCAAACG CCTGAATAAG
CGGGTATCGG CTATGCCTTC CACCCAACAG GAAGTGTTGC GTTTAAGTCG TGACGTAGAA
GCGGGCCGTG CGGTATATCT GCAATTACTT AACCGCCAGC AGGAGTTGAG TATTTCGAAA
TCCAGTGCCA TTGGTAACGT GCGGATTATC GACCCGGCAG TCACTCAGCC GCAGCCAGTG
AAACCGAAAA AAGCGTTGAA TGTGGTGCTT GGTTTTATTC TTGGCCTGTT TATTTCGGTG
GGTGCCGTGC TGGCGCGTGC GATGTTGCGT CGTGGTGTAG AAGCCCCGGA ACAACTGGAA
GAGCACGGCA TCAGCGTTTA TGCCACCATC CCGATGTCCG AGTGGCTGGA TAAACGTACC
CGTCTGCGTA AGAAAAATTT ATTTTCTAAT CAGCAGCGCC ATCGTACTAA AAATATCCCC
TTCCTGGCGG TGGATAACCC GGCGGATTCT GCTGTGGAAG CCGTACGTGC GCTACGAACC
AGTCTGCATT TCGCTATGAT GGAGACTGAG AATAACATTC TGATGATCAC CGGTGCGACG
CCAGACAGTG GTAAAACGTT TGTCAGTTCA ACTCTGGCAG CGGTGATCGC CCAGTCCGAT
CAAAAAGTGT TATTTATTGA TGCCGACTTA CGCCGTGGTT ATTCGCATAA CCTGTTTACC
GTGAGTAATG AACATGGCTT GTCGGAATAT CTGGCAGGTA AAGATGAGCT CAACAAAGTG
ATCCAGCATT TTGGCAAAGG AGGCTTTGAT GTGATTACTC GCGGTCAGGT GCCACCTAAC
CCATCTGAAC TGCTGATGCG CGATCGGATG CGTCAATTAC TGGAATGGGC GAACGACCAT
TACGATCTGG TGATTGTTGA TACGCCGCCG ATGCTGGCGG TGAGCGATGC TGCGGTCGTG
GGGCGCTCTG TCGGCACCAG CCTGCTGGTT GCGCGTTTTG GCTTGAACAC CGCCAAAGAG
GTGAGCTTGT CAATGCAGCG TCTGGAACAG GCTGGCGTTA ATATTAAAGG CGCAATCCTC
AATGGTGTGA TTAAACGCGC CAGCACCGCT TACAGTTACG GCTATAACTA TTACGGTTAT
AGTTACTCCG AGAAAGAGTA A
 
Protein sequence
MTTKNMNTPP GSTQENEIDL LRLVGELWDH RKFIISVTAL FTLIAVAYSL LSTPIYQADT 
LVQVEQKQGN AILSGLSDMI PNSSPESAPE IQLLQSRMIL GKTIAELNLR DMVEQKYFPI
VGRGWARLTK EKPGELAISW MHIPQLNGQD QQLTLTVGEN GHYTLEGEEF TVNGMVGQRL
EKDGVALTIA DIKAKPGTQF VLSQRTELEA INALQETFTV SERSKESGML ELTMTGDDPQ
LITRILNSIA NNYLQQNIAR QAAQDSQSLE FLQRQLPEVR SELDQAEEKL NVYRQQRDSV
DLNLEAKAVL EQIVNVDNQL NELTFREAEI SQLYKKDHPT YRALLEKRQT LEQERKRLNK
RVSAMPSTQQ EVLRLSRDVE AGRAVYLQLL NRQQELSISK SSAIGNVRII DPAVTQPQPV
KPKKALNVVL GFILGLFISV GAVLARAMLR RGVEAPEQLE EHGISVYATI PMSEWLDKRT
RLRKKNLFSN QQRHRTKNIP FLAVDNPADS AVEAVRALRT SLHFAMMETE NNILMITGAT
PDSGKTFVSS TLAAVIAQSD QKVLFIDADL RRGYSHNLFT VSNEHGLSEY LAGKDELNKV
IQHFGKGGFD VITRGQVPPN PSELLMRDRM RQLLEWANDH YDLVIVDTPP MLAVSDAAVV
GRSVGTSLLV ARFGLNTAKE VSLSMQRLEQ AGVNIKGAIL NGVIKRASTA YSYGYNYYGY
SYSEKE