Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2995 |
Symbol | wzc |
ID | 6966601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2773700 |
End bp | 2775862 |
Gene Length | 2163 bp |
Protein Length | 720 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643386835 |
Product | tyrosine kinase |
Protein accession | YP_002271303 |
Protein GI | 209396474 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR01007] capsular exopolysaccharide family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.00970433 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACAGAAA AAGTAAAACA ACATGCCGCT CCGGTAACGG GCAGTGATGA AATCGATATT GGTCGCCTGG TCGGCACCGT CATTGAAGCG CGCTGGTGGG TGATTGGCAT CACCGCTGTA TTCGCCCTTT GTGCCGTGGT TTACACCTTC TTCGCCACGC CGATTTATAG TGCCGACGCA CTGGTACAAA TCGAGCAAAG CAGCGGCAAT TCGTTAGTGC AGGATATCGG ATCGGCGTTA GCCAACAAAC CGCCTGCATC GGACGCCGAG ATCCAGTTGA TTCGTTCGCG CCTGGTGCTT GGTAAAACGG TGGATGATCT CGACCTCGAT ATTGCGGTGA GCAAAAACAC GTTCCCGATT TTCGGTGCGG GCTGGGATCG CCTGATGGGA CGCCAGAACG AGACGGTGAA AGTGACTACC TTTAACCGTC CGAAAGAGAT GGAGGATCAG GTGTTTACGC TTAATGTGCT GGACAACAAA AACTACACCC TGAGCAGCGA TGGCGGCTTT AGCGCCCGTG GGCAAGCGGG CCAGATACTG AAAAAAGAAG GCGTCACGCT GATGGTTGAA GCCATTCACG CCCGCCCGGG CAGTGAGTTT ACCGTCACCA AATACTCCAC GCTGGGGATG ATCAATCAAC TGCAAAACAG CCTGACGGTA ACGGAGAACG GCAAAGACGC AGGCGTACTG AGCCTGACTT ATACCGGTGA AGATCGCGAA CAGATCCGCG ACATTCTTAA CAGCATCGCC CGTAACTATC AGGAACAAAA TATTGAGCGC AAATCGGCGG AAGCGTCGAA AAGCCTCGCT TTCCTCGCGC AACAGTTACC GGAAGTACGT AGCCGCCTTG ATGTTGCCGA AAACAAACTG AATGCCTTCC GTCAGGATAA AGATTCTGTT GATCTGCCGC TGGAAGCGAA AGCGGTGCTC GATTCGATGG TGAACATCGA CGCCCAGTTG AACGAACTGA CCTTTAAAGA GGCGGAAATC TCCAAGCTGT ACACCAAAGT TCACCCCGCG TACCGCACGC TGCTGGAGAA ACGTCAGGCG CTGGAAGACG AAAAAGCCAA ACTTAATGGT CGCGTAACGG CGATGCCGAA AACCCAGCAG GAAATTGTCC GTCTGACCCG CGATGTCGAG TCTGGTCAGC AGGTCTATAT GCAACTGCTG AATAAAGAGC AGGAGCTGAA AATCACCGAG GCCAGCACCG TCGGCGATGT GCGCATTGTT GACCCGGCAA TCACTCAGCC TGGTGTGCTA AAACCGAAGA AAGGGCTGAT TATCCTTGGG GCGATTATCC TTGGCCTGAT GCTCTCTATC GTGGGGGTGC TGCTGCGCTC GTTGTTTAAT CGCGGCATCG AAAGCCCGCA GGTGCTGGAA GAACACGGTA TCAGCGTCTA TGCCAGCATC CCGCTGTCGG AGTGGCAGAA AGCGCGCGAT AGCGTCAAAA CCATCAAAGG GATTAAACGC TATAAACAGA GCCAGCTACT GGCGGTGGGG AATCCAACCG ATCTGGCGAT TGAAGCCATC CGCAGCCTTC GTACCAGTTT GCACTTCGCG ATGATGCAGG CGCAGAACAA TGTGTTGATG ATGACCGGGG TTAGCCCGTC AATCGGTAAA ACCTTTGTCT GCGCCAACCT GGCGGCGGTG ATCAGCCAGA CCAATAAACG CGTGTTGTTG ATCGACTGCG ATATGCGCAA AGGCTACACC CATGAGCTGT TGGGCACTAA TAACGTTAAT GGCCTGTCGG AAATTCTGAT TGGTCAGGGC GATATTACTA CAGCTGCTAA ACCGACCTCT ATTGCCAAAT TTGACCTGAT CCCGCGCGGT CAGGTACCGC CAAATCCTTC TGAACTGTTG ATGAGCGAAC GCTTTGCCGA ACTGGTGAAC TGGGCGAGTA AAAACTACGA CCTGGTGTTG ATTGATACGC CGCCGATTCT GGCAGTGACC GATGCGGCAA TTGTTGGTCG TCATGTCGGA ACCACGTTAA TGGTGGCGCG TTATGCGGTC AACACATTGA AAGAAGTGGA AACCAGTCTG AGCCGCTTTG AGCAAAACGG TATTCCGGTG AAAGGGGTGA TTCTGAACTC CATCTTCCGC CGCGCCAGCG CGTATCAGGA TTATGGCTAT TACGAATACG AATATAAGTC GGATGCGAAA TAA
|
Protein sequence | MTEKVKQHAA PVTGSDEIDI GRLVGTVIEA RWWVIGITAV FALCAVVYTF FATPIYSADA LVQIEQSSGN SLVQDIGSAL ANKPPASDAE IQLIRSRLVL GKTVDDLDLD IAVSKNTFPI FGAGWDRLMG RQNETVKVTT FNRPKEMEDQ VFTLNVLDNK NYTLSSDGGF SARGQAGQIL KKEGVTLMVE AIHARPGSEF TVTKYSTLGM INQLQNSLTV TENGKDAGVL SLTYTGEDRE QIRDILNSIA RNYQEQNIER KSAEASKSLA FLAQQLPEVR SRLDVAENKL NAFRQDKDSV DLPLEAKAVL DSMVNIDAQL NELTFKEAEI SKLYTKVHPA YRTLLEKRQA LEDEKAKLNG RVTAMPKTQQ EIVRLTRDVE SGQQVYMQLL NKEQELKITE ASTVGDVRIV DPAITQPGVL KPKKGLIILG AIILGLMLSI VGVLLRSLFN RGIESPQVLE EHGISVYASI PLSEWQKARD SVKTIKGIKR YKQSQLLAVG NPTDLAIEAI RSLRTSLHFA MMQAQNNVLM MTGVSPSIGK TFVCANLAAV ISQTNKRVLL IDCDMRKGYT HELLGTNNVN GLSEILIGQG DITTAAKPTS IAKFDLIPRG QVPPNPSELL MSERFAELVN WASKNYDLVL IDTPPILAVT DAAIVGRHVG TTLMVARYAV NTLKEVETSL SRFEQNGIPV KGVILNSIFR RASAYQDYGY YEYEYKSDAK
|
| |