Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_1097 |
Symbol | etk |
ID | 5589119 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 1120691 |
End bp | 1122871 |
Gene Length | 2181 bp |
Protein Length | 726 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640924800 |
Product | cryptic autophosphorylating protein tyrosine kinase Etk |
Protein accession | YP_001462213 |
Protein GI | 157154890 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0489] ATPases involved in chromosome partitioning [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR01007] capsular exopolysaccharide family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0653674 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACTA AAAATATGAA TACGCCACCA GGCAGCACTC AGGAAAATGA GATCGATCTG CTTCGTCTGG TCGGCGAGTT ATGGGATCAC CGTAAGTTTA TTATCAGCGT GACCGCGTTA TTCACGCTGA TCGCTGTCGC TTACTCGCTG TTAAGTACAC CAATTTATCA GGCAGATACT CTGGTCCAGG TTGAGCAAAA ACAGGGCAAC GCCATTCTCA GCGGCCTGAG CGATATGATC CCTAACTCAT CGCCCGAGTC TGCACCGGAG ATCCAACTGC TGCAATCGCG CATGATTCTC GGTAAAACCA TTGCTGAACT GAATCTGCGC GACATAGTTG AGCAGAAGTA TTTTCCGATT GTGGGTCGCG GCTGGGCGAG ATTAACCAAA GAAAAACCAG GTGAGCTGGC GATCAGCTGG ATGCATATTC CACAACTGAA TGGTCAGGAT CAGCAACTGA CACTCACGGT TGGGGAAAAC GGCCACTATA CACTGGAAGG TGAAGAGTTC ACCGTCAATG GTATGGTCGG CCAGCGTCTG GAAAAAGATG GCGTTGCGCT GACTATCGCG GACATTAAGG CCAAACCAGG AACACAGTTT GTCCTGAGCC AGCGTACCGA ACTGGAAGCG ATTAACGCAT TGCAGGAAAC CTTTACCGTT AGCGAACGCA GTAAAGAAAG CGGGATGCTG GAACTTACCA TGACTGGTGA TGATCCCCAG TTGATTACTC GTATTCTGAA CAGCATCGCT AACAACTATT TGCAACAGAA TATCGCTCGC CAGGCGGCGC AGGATTCACA AAGCCTTGAA TTCTTACAGC GCCAGTTACC TGAAGTGCGC AGCGAGCTGG ACCAGGCGGA AGAAAAACTC AACGTTTATC GCCAGCAGCG CGATTCGGTT GACCTTAACC TGGAAGCCAA AGCCGTTCTT GAGCAGATTG TGAACGTTGA TAATCAACTC AATGAGCTGA CCTTCCGCGA GGCAGAGATC TCCCAGCTGT ATAAGAAAGA TCACCCAACT TATCGTGCGC TGCTGGAAAA ACGCCAGACG CTGGAGCAAG AACGCAAACG CCTGAATAAG CGGGTATCGG CAATGCCTTC CACCCAACAG GAAGTGTTGC GTTTAAGTCG TGACGTAGAA GCGGGCCGTG CGGTATATCT GCAATTACTT AACCGCCAGC AGGAGTTGAG TATTTCGAAA TCCAGTGCCA TTGGTAACGT GCGGATTATC GACCCGGCAG TCACTCAGCC GCAGCCAGTG AAACCGAAAA AAGCGTTGAA TGTGGTGCTT GGTTTTATTC TTGGCCTGTT TATTTCTGTG GGTGCCGTGC TGGCGCGTGC GATGTTGCGT CGTGGTGTAG AAGCCCCGGA ACAACTGGAA GAGCACGGCA TCAGCGTTTA TGCCACTATC CCAATGTCCG AGTGGCTGGA TAAACGCACC CGTCTGCGTA AGAAAAATTT ATTTTCTAAT CAGCAGCGCC ATCGTACTAA AAATATCCCC TTCCTGGCGG TGGATAACCC GGCGGATTCT GCTGTGGAAG CCGTACGTGC GCTACGAACC AGTCTGCATT TCGCTATGAT GGAGACGGAG AATAACATTC TGATGATCAC CGGTGCGACG CCAGACAGTG GTAAAACGTT TGTCAGTTCA ACTCTGGCAG CGGTGATCGC CCAGTCCGAT CAAAAAGTGT TATTTATTGA TGCCGACTTA CGCCGTGGTT ATTCGCATAA CCTGTTTACC GTGAGTAATG AACATGGCTT GTCGGAATAT CTGGCAGGTA AAGATGAGCT CAACAAAGTG ATCCAGCATT TTGGCAAAGG AGGCTTTGAT GTGATTACTC GCGGTCAGGT GCCACCTAAC CCGTCTGAAC TGCTGATGCG CGATCGGATG CGTCAATTAC TGGAATGGGC GAACGACCAT TACGATCTGG TGATTGTCGA TACGCCGCCG ATGCTGGCGG TGAGTGATGC CGCGGTCGTG GGGCGTTCTG TTGGCACCAG CCTGCTGGTT GCGCGTTTTG GCTTGAACAC CGCCAAAGAG GTGAGTTTGT CAATGCAGCG TCTGGAACAG GCAGGCGTCA ATATTAAAGG CGCTATCCTC AATGGTGTGA TTAAACGCGC CAGCACCGCT TACAGTTACG GCTATAACTA TTACGGTTAT AGTTACTCCG AGAAAGAGTA A
|
Protein sequence | MTTKNMNTPP GSTQENEIDL LRLVGELWDH RKFIISVTAL FTLIAVAYSL LSTPIYQADT LVQVEQKQGN AILSGLSDMI PNSSPESAPE IQLLQSRMIL GKTIAELNLR DIVEQKYFPI VGRGWARLTK EKPGELAISW MHIPQLNGQD QQLTLTVGEN GHYTLEGEEF TVNGMVGQRL EKDGVALTIA DIKAKPGTQF VLSQRTELEA INALQETFTV SERSKESGML ELTMTGDDPQ LITRILNSIA NNYLQQNIAR QAAQDSQSLE FLQRQLPEVR SELDQAEEKL NVYRQQRDSV DLNLEAKAVL EQIVNVDNQL NELTFREAEI SQLYKKDHPT YRALLEKRQT LEQERKRLNK RVSAMPSTQQ EVLRLSRDVE AGRAVYLQLL NRQQELSISK SSAIGNVRII DPAVTQPQPV KPKKALNVVL GFILGLFISV GAVLARAMLR RGVEAPEQLE EHGISVYATI PMSEWLDKRT RLRKKNLFSN QQRHRTKNIP FLAVDNPADS AVEAVRALRT SLHFAMMETE NNILMITGAT PDSGKTFVSS TLAAVIAQSD QKVLFIDADL RRGYSHNLFT VSNEHGLSEY LAGKDELNKV IQHFGKGGFD VITRGQVPPN PSELLMRDRM RQLLEWANDH YDLVIVDTPP MLAVSDAAVV GRSVGTSLLV ARFGLNTAKE VSLSMQRLEQ AGVNIKGAIL NGVIKRASTA YSYGYNYYGY SYSEKE
|
| |