Gene EcHS_A2191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2191 
Symboletk2 
ID5594126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2172599 
End bp2174767 
Gene Length2169 bp 
Protein Length722 aa 
Translation table11 
GC content36% 
IMG OID640921324 
Producttyrosine-protein kinase etk 
Protein accessionYP_001458863 
Protein GI157161545 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000174737 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACCAG TTATTAATAA ACAAGCATCT CAGGAATCTG ATGAAATCGA TTTTGGTCGC 
TTAATCGGTG AATTGATTGA TCATCGCAAA TTGATAATAT CAGTAACTTC TCTATTTACC
GTTATTGCAT TGGTTTATTC CATTTTTTCA ACACCAGTGT ATCAGGCGGA TGCGTTGATT
CAAGTGGAGC AAAAACAAGG TAATAGCATT TTAAACAATC TTAGCCAAAT ACTTCCGGAT
AGTCAGCCGC AATCTGCACC TGAAATAGCA TTATTACAAT CTAGGATGAT TTTGGGGAAA
ACGGTAGATG ATCTAAACTT ACAGGCAGTT ATTAAGCAAG ATTACTTCCC TGTTTTTGGT
AGGGGGTTGG CTAGGTTGCT AGGGGACAAA CCAGGCAATT TAACTGTCGG GCGGTTATAT
ATAACAGGTA GCCATGATGA AATCCCGAAA ATTAAAATAA CAACTAATGA TAAGAATAGC
TATACGGTAG AATATAATGA TGAAAAGTTC AATGCTAGCA TTGGAAAGTT AGTAAATAAA
AATGATGTCA CAATTAAAAT TGATAAAATA GACGCTGAGC CAGGTACTAG CTTTACTGTA
TATTATACTT CTGAATTGGA TGCAATAAGT GCGTTACAGA AGAAATTCAA TATTTCAGAA
AAAGGCAAGG ATACGGGTAT TTTAAACTTG ACTCTTACAG GAGAAAATCC AAAACAAATT
AAAGAAATCA TTAATAGTAT TAGTGAGAAT TACCTTGCAC AAAATATCGC AAGGCAAGCA
GCGCAAGATG CAAAGAGTTT AGCATTCTTG AATGAGCAAT TACCAATTGT TAGAAGTGAT
TTAGATAACG CTGAAAGCAA ACTTAATCAG TATCGGAGGC AGAATGATTC GGTTGATTTA
TCCTTAGAAG CCAAATCTGT ACTTGACCAA ATAGTAAATG TAGACAACCA ATTAAATCAA
CTGACTTTTA GAGAGTCAGA AATATCACAA CTTTATACCA AAGAACATCC TACTTATAAG
GCCTTACTTG AAAAAAGAAA AACCTTACAA GATGAAAAGT TAAAGCTTAA TAAAAAAGTG
TCAGCCATGC CGGCAACACA GCAGGAAATA TTACGTCTAA ATCGTGATGT CGAATCTGGG
CGTGCTGTCT TCATGATGTT ACTTAACAGG CAACAAGAAC TGAATATAGC GAAATCAAGT
GCAATAGGTA ATGTTAGAAT TATTGATAAT GCAGTAACAC AACCTGAACC CGTAAAACCA
AAAAAAATTC TGGTTGTCAT AGTTGGTTTT GTTCTTGGCT TACTGATATC AATTGGTGTT
GTCTTATTAC GTGTATTTTT GCGAAGAGGA ATAGAAACCC CTGAGCAACT CGAAGAGTTG
GGTATAAATG TTTATGCAAG CATTCCTATT TCTGAGTTAC TTACCCAGAA AGCTACTAAA
TTAGAGGGTT TGAGAAGGAA AGAACAATCA GAGCCTCAAA CTTTTCTTGC GATTGAAAAT
CCTGCTGATA TTGCAATTGA AGCGATTAGA GGATTACGCA CAAGCCTGCA CTTTGCAATG
ATGGAGGCTA GAAACAATAT TTTAATGATA TCAGGTGCAA GCCCTAATGC AGGTAAGACT
TTTGTAAGTT CAAACTTATC AGCAGTAATA GCACAAACTG GAAAATCTGT TCTATATATT
GATACAGATA TGAGGAAAGG ATATGCACAT AAGCTCTTTG AGTTAGATAA CAATAACGGC
CTTTCTGAAA TATTGTCAGG GAAAGTTGAA GTATCCCAAG CAGTGAAAAA AGTACATAGC
GCTGGTTTCG ATTTCATTTC TCGTGGACAG GTTCCACCTA ACCCGGCCGA ATTACTTATG
CACAGACGTT TTGGAGAACT TTTAGCATGG GCAGAAAAAA AATATGATAT TGTGATTCTT
GATACTCCAC CTATTTTGGC TGTCACTGAT CCGGCAATTA TTGGGCATTA TGCAGGGACA
ACCTTGCTTG TCGCAAGGTT TGAAAAAAAC ACAGCCAAAG AAATTGAAAT TAGTGCGAAA
CGTTTTGAAA ATAGTGGTGT AATAGTAAAA GGTTGCATCT TAAACGGCGT GGTGAAGAAA
GCTAGTAGTT ATTATGGTTA TGGCTATAAT AATTATGGTT ACTCATATAA TGACAAGGAT
AAGCACTAA
 
Protein sequence
MSPVINKQAS QESDEIDFGR LIGELIDHRK LIISVTSLFT VIALVYSIFS TPVYQADALI 
QVEQKQGNSI LNNLSQILPD SQPQSAPEIA LLQSRMILGK TVDDLNLQAV IKQDYFPVFG
RGLARLLGDK PGNLTVGRLY ITGSHDEIPK IKITTNDKNS YTVEYNDEKF NASIGKLVNK
NDVTIKIDKI DAEPGTSFTV YYTSELDAIS ALQKKFNISE KGKDTGILNL TLTGENPKQI
KEIINSISEN YLAQNIARQA AQDAKSLAFL NEQLPIVRSD LDNAESKLNQ YRRQNDSVDL
SLEAKSVLDQ IVNVDNQLNQ LTFRESEISQ LYTKEHPTYK ALLEKRKTLQ DEKLKLNKKV
SAMPATQQEI LRLNRDVESG RAVFMMLLNR QQELNIAKSS AIGNVRIIDN AVTQPEPVKP
KKILVVIVGF VLGLLISIGV VLLRVFLRRG IETPEQLEEL GINVYASIPI SELLTQKATK
LEGLRRKEQS EPQTFLAIEN PADIAIEAIR GLRTSLHFAM MEARNNILMI SGASPNAGKT
FVSSNLSAVI AQTGKSVLYI DTDMRKGYAH KLFELDNNNG LSEILSGKVE VSQAVKKVHS
AGFDFISRGQ VPPNPAELLM HRRFGELLAW AEKKYDIVIL DTPPILAVTD PAIIGHYAGT
TLLVARFEKN TAKEIEISAK RFENSGVIVK GCILNGVVKK ASSYYGYGYN NYGYSYNDKD
KH