Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2191 |
Symbol | etk2 |
ID | 5594126 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2172599 |
End bp | 2174767 |
Gene Length | 2169 bp |
Protein Length | 722 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640921324 |
Product | tyrosine-protein kinase etk |
Protein accession | YP_001458863 |
Protein GI | 157161545 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR01007] capsular exopolysaccharide family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0000000174737 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACCAG TTATTAATAA ACAAGCATCT CAGGAATCTG ATGAAATCGA TTTTGGTCGC TTAATCGGTG AATTGATTGA TCATCGCAAA TTGATAATAT CAGTAACTTC TCTATTTACC GTTATTGCAT TGGTTTATTC CATTTTTTCA ACACCAGTGT ATCAGGCGGA TGCGTTGATT CAAGTGGAGC AAAAACAAGG TAATAGCATT TTAAACAATC TTAGCCAAAT ACTTCCGGAT AGTCAGCCGC AATCTGCACC TGAAATAGCA TTATTACAAT CTAGGATGAT TTTGGGGAAA ACGGTAGATG ATCTAAACTT ACAGGCAGTT ATTAAGCAAG ATTACTTCCC TGTTTTTGGT AGGGGGTTGG CTAGGTTGCT AGGGGACAAA CCAGGCAATT TAACTGTCGG GCGGTTATAT ATAACAGGTA GCCATGATGA AATCCCGAAA ATTAAAATAA CAACTAATGA TAAGAATAGC TATACGGTAG AATATAATGA TGAAAAGTTC AATGCTAGCA TTGGAAAGTT AGTAAATAAA AATGATGTCA CAATTAAAAT TGATAAAATA GACGCTGAGC CAGGTACTAG CTTTACTGTA TATTATACTT CTGAATTGGA TGCAATAAGT GCGTTACAGA AGAAATTCAA TATTTCAGAA AAAGGCAAGG ATACGGGTAT TTTAAACTTG ACTCTTACAG GAGAAAATCC AAAACAAATT AAAGAAATCA TTAATAGTAT TAGTGAGAAT TACCTTGCAC AAAATATCGC AAGGCAAGCA GCGCAAGATG CAAAGAGTTT AGCATTCTTG AATGAGCAAT TACCAATTGT TAGAAGTGAT TTAGATAACG CTGAAAGCAA ACTTAATCAG TATCGGAGGC AGAATGATTC GGTTGATTTA TCCTTAGAAG CCAAATCTGT ACTTGACCAA ATAGTAAATG TAGACAACCA ATTAAATCAA CTGACTTTTA GAGAGTCAGA AATATCACAA CTTTATACCA AAGAACATCC TACTTATAAG GCCTTACTTG AAAAAAGAAA AACCTTACAA GATGAAAAGT TAAAGCTTAA TAAAAAAGTG TCAGCCATGC CGGCAACACA GCAGGAAATA TTACGTCTAA ATCGTGATGT CGAATCTGGG CGTGCTGTCT TCATGATGTT ACTTAACAGG CAACAAGAAC TGAATATAGC GAAATCAAGT GCAATAGGTA ATGTTAGAAT TATTGATAAT GCAGTAACAC AACCTGAACC CGTAAAACCA AAAAAAATTC TGGTTGTCAT AGTTGGTTTT GTTCTTGGCT TACTGATATC AATTGGTGTT GTCTTATTAC GTGTATTTTT GCGAAGAGGA ATAGAAACCC CTGAGCAACT CGAAGAGTTG GGTATAAATG TTTATGCAAG CATTCCTATT TCTGAGTTAC TTACCCAGAA AGCTACTAAA TTAGAGGGTT TGAGAAGGAA AGAACAATCA GAGCCTCAAA CTTTTCTTGC GATTGAAAAT CCTGCTGATA TTGCAATTGA AGCGATTAGA GGATTACGCA CAAGCCTGCA CTTTGCAATG ATGGAGGCTA GAAACAATAT TTTAATGATA TCAGGTGCAA GCCCTAATGC AGGTAAGACT TTTGTAAGTT CAAACTTATC AGCAGTAATA GCACAAACTG GAAAATCTGT TCTATATATT GATACAGATA TGAGGAAAGG ATATGCACAT AAGCTCTTTG AGTTAGATAA CAATAACGGC CTTTCTGAAA TATTGTCAGG GAAAGTTGAA GTATCCCAAG CAGTGAAAAA AGTACATAGC GCTGGTTTCG ATTTCATTTC TCGTGGACAG GTTCCACCTA ACCCGGCCGA ATTACTTATG CACAGACGTT TTGGAGAACT TTTAGCATGG GCAGAAAAAA AATATGATAT TGTGATTCTT GATACTCCAC CTATTTTGGC TGTCACTGAT CCGGCAATTA TTGGGCATTA TGCAGGGACA ACCTTGCTTG TCGCAAGGTT TGAAAAAAAC ACAGCCAAAG AAATTGAAAT TAGTGCGAAA CGTTTTGAAA ATAGTGGTGT AATAGTAAAA GGTTGCATCT TAAACGGCGT GGTGAAGAAA GCTAGTAGTT ATTATGGTTA TGGCTATAAT AATTATGGTT ACTCATATAA TGACAAGGAT AAGCACTAA
|
Protein sequence | MSPVINKQAS QESDEIDFGR LIGELIDHRK LIISVTSLFT VIALVYSIFS TPVYQADALI QVEQKQGNSI LNNLSQILPD SQPQSAPEIA LLQSRMILGK TVDDLNLQAV IKQDYFPVFG RGLARLLGDK PGNLTVGRLY ITGSHDEIPK IKITTNDKNS YTVEYNDEKF NASIGKLVNK NDVTIKIDKI DAEPGTSFTV YYTSELDAIS ALQKKFNISE KGKDTGILNL TLTGENPKQI KEIINSISEN YLAQNIARQA AQDAKSLAFL NEQLPIVRSD LDNAESKLNQ YRRQNDSVDL SLEAKSVLDQ IVNVDNQLNQ LTFRESEISQ LYTKEHPTYK ALLEKRKTLQ DEKLKLNKKV SAMPATQQEI LRLNRDVESG RAVFMMLLNR QQELNIAKSS AIGNVRIIDN AVTQPEPVKP KKILVVIVGF VLGLLISIGV VLLRVFLRRG IETPEQLEEL GINVYASIPI SELLTQKATK LEGLRRKEQS EPQTFLAIEN PADIAIEAIR GLRTSLHFAM MEARNNILMI SGASPNAGKT FVSSNLSAVI AQTGKSVLYI DTDMRKGYAH KLFELDNNNG LSEILSGKVE VSQAVKKVHS AGFDFISRGQ VPPNPAELLM HRRFGELLAW AEKKYDIVIL DTPPILAVTD PAIIGHYAGT TLLVARFEKN TAKEIEISAK RFENSGVIVK GCILNGVVKK ASSYYGYGYN NYGYSYNDKD KH
|
| |