Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1217 |
Symbol | etk1 |
ID | 6968611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1226194 |
End bp | 1228374 |
Gene Length | 2181 bp |
Protein Length | 726 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643385212 |
Product | cryptic autophosphorylating protein tyrosine kinase Etk |
Protein accession | YP_002269707 |
Protein GI | 209400850 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0489] ATPases involved in chromosome partitioning [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR01007] capsular exopolysaccharide family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAACTA AAAATATGAA TACGCCACCA GGCAGCACTC AGGAAAATGA GATCGATCTG CTTCGTCTGG TCGGCGAGTT ATGGGATCAC CGTAAGTTTA TTATCAGCGT GACCGCGTTA TTCACGCTGA TCGCTGTCGC TTACTCGCTG TTAAGCACAC CAATTTATCA GGCAGATACT CTGGTCCAGG TTGAGCAAAA ACAGGGCAAC GCCATTCTCA GCGGCCTGAG TGATATGATC CCTAACTCAT CGCCCGAGTC TGCACCGGAG ATCCAACTGC TGCAATCGCG CATGATTCTC GGTAAAACCA TTGCTGAACT GAATCTGCGC GACATGGTTG AGCAGAAGTA TTTTCCGATT GTGGGTCGCG GCTGGGCGAG ATTAACCAAA GAGAAACCAG GTGAGCTGGC GATCAGCTGG ATGCATATTC CACAACTGAA TGGTCAGGAT CAGCAACTGA CACTCACGGT TGGGGAAAAC GGCCACTATA CGCTGGAAGG TGAAGAGTTC ACCGTCAATG GTATGGTCGG CCAGCGTCTG GAAAAAGATG GCGTTGCGCT GACTATCGCG GATATTAAGG CCAAACCAGG AACACAGTTT GTCCTGAGCC AACGTACCGA ACTGGAAGCG ATTAACGCGT TGCAGGAAAC CTTTACCGTT AGCGAACGCA GTAAAGAAAG CGGGATGCTG GAACTTACCA TGACTGGTGA TGATCCCCAG TTGATTACTC GTATTCTGAA CAGCATCGCT AACAACTATT TGCAACAGAA TATCGCTCGC CAGGCGGCGC AGGATTCACA AAGCCTTGAA TTCTTACAGC GCCAGTTGCC AGAAGTGCGC AGCGAGCTGG ACCAGGCGGA AGAAAAACTC AACGTTTATC GCCAGCAGCG CGATTCGGTT GACCTTAACC TGGAAGCCAA AGCCGTTCTT GAGCAGATTG TGAACGTTGA TAATCAACTC AATGAGCTGA CCTTCCGCGA GGCAGAGATC TCCCAGCTGT ATAAGAAAGA TCACCCAACT TATCGTGCGC TGCTGGAAAA ACGCCAGACG CTGGAGCAAG AACGCAAACG CCTGAATAAG CGGGTATCGG CTATGCCTTC CACCCAACAG GAAGTGTTGC GTTTAAGTCG TGACGTAGAA GCGGGCCGTG CGGTATATCT GCAATTACTT AACCGCCAGC AGGAGTTGAG TATTTCGAAA TCCAGTGCCA TTGGTAACGT GCGGATTATC GACCCGGCAG TCACTCAGCC GCAGCCAGTG AAACCGAAAA AAGCGTTGAA TGTGGTGCTT GGTTTTATTC TTGGCCTGTT TATTTCGGTG GGTGCCGTGC TGGCGCGTGC GATGTTGCGT CGTGGTGTAG AAGCCCCGGA ACAACTGGAA GAGCACGGCA TCAGCGTTTA TGCCACCATC CCGATGTCCG AGTGGCTGGA TAAACGTACC CGTCTGCGTA AGAAAAATTT ATTTTCTAAT CAGCAGCGCC ATCGTACTAA AAATATCCCC TTCCTGGCGG TGGATAACCC GGCGGATTCT GCTGTGGAAG CCGTACGTGC GCTACGAACC AGTCTGCATT TCGCTATGAT GGAGACTGAG AATAACATTC TGATGATCAC CGGTGCGACG CCAGACAGTG GTAAAACGTT TGTCAGTTCA ACTCTGGCAG CGGTGATCGC CCAGTCCGAT CAAAAAGTGT TATTTATTGA TGCCGACTTA CGCCGTGGTT ATTCGCATAA CCTGTTTACC GTGAGTAATG AACATGGCTT GTCGGAATAT CTGGCAGGTA AAGATGAGCT CAACAAAGTG ATCCAGCATT TTGGCAAAGG AGGCTTTGAT GTGATTACTC GCGGTCAGGT GCCACCTAAC CCATCTGAAC TGCTGATGCG CGATCGGATG CGTCAATTAC TGGAATGGGC GAACGACCAT TACGATCTGG TGATTGTTGA TACGCCGCCG ATGCTGGCGG TGAGCGATGC TGCGGTCGTG GGGCGCTCTG TCGGCACCAG CCTGCTGGTT GCGCGTTTTG GCTTGAACAC CGCCAAAGAG GTGAGCTTGT CAATGCAGCG TCTGGAACAG GCTGGCGTTA ATATTAAAGG CGCAATCCTC AATGGTGTGA TTAAACGCGC CAGCACCGCT TACAGTTACG GCTATAACTA TTACGGTTAT AGTTACTCCG AGAAAGAGTA A
|
Protein sequence | MTTKNMNTPP GSTQENEIDL LRLVGELWDH RKFIISVTAL FTLIAVAYSL LSTPIYQADT LVQVEQKQGN AILSGLSDMI PNSSPESAPE IQLLQSRMIL GKTIAELNLR DMVEQKYFPI VGRGWARLTK EKPGELAISW MHIPQLNGQD QQLTLTVGEN GHYTLEGEEF TVNGMVGQRL EKDGVALTIA DIKAKPGTQF VLSQRTELEA INALQETFTV SERSKESGML ELTMTGDDPQ LITRILNSIA NNYLQQNIAR QAAQDSQSLE FLQRQLPEVR SELDQAEEKL NVYRQQRDSV DLNLEAKAVL EQIVNVDNQL NELTFREAEI SQLYKKDHPT YRALLEKRQT LEQERKRLNK RVSAMPSTQQ EVLRLSRDVE AGRAVYLQLL NRQQELSISK SSAIGNVRII DPAVTQPQPV KPKKALNVVL GFILGLFISV GAVLARAMLR RGVEAPEQLE EHGISVYATI PMSEWLDKRT RLRKKNLFSN QQRHRTKNIP FLAVDNPADS AVEAVRALRT SLHFAMMETE NNILMITGAT PDSGKTFVSS TLAAVIAQSD QKVLFIDADL RRGYSHNLFT VSNEHGLSEY LAGKDELNKV IQHFGKGGFD VITRGQVPPN PSELLMRDRM RQLLEWANDH YDLVIVDTPP MLAVSDAAVV GRSVGTSLLV ARFGLNTAKE VSLSMQRLEQ AGVNIKGAIL NGVIKRASTA YSYGYNYYGY SYSEKE
|
| |