Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1002 |
Symbol | wzc |
ID | 6147421 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1019915 |
End bp | 1022077 |
Gene Length | 2163 bp |
Protein Length | 720 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615889 |
Product | tyrosine kinase |
Protein accession | YP_001743081 |
Protein GI | 170681107 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR01007] capsular exopolysaccharide family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGAAA AAGTAAAACA ACATGCCGCT CCGGTAACGG GCAGTGATGA AATCGATATT GGTCGCCTGG TCGGCACCGT CATTGAAGCG CGCTGGTGGG TAATTGGCAT CACCGCCGTA TTTGCCCTCT GTGCCGTGGT TTACACCTTC TTCGCCACGC CGATTTATAG TGCCGACGCA CTGGTACAAA TCGAGCAAAA CAGCGGCAAT TCGTTAGTGC AGGACATTGG TTCGGCATTA GCCAACAAAC CGCCAGCATC GGACGCCGAG ATCCAGTTGA TTCGTTCGCG TCTGGTGCTT GGTAAAACGG TGGACGATCT CGACCTCGAT ATTGCAGTGA GCAAAAACAC GTTCCCTATT TTCGGTGCGG GCTGGGATCG CCTGATGGGA CGCCAGAACG AGACGGTAAA AGTGACCACC TTTAACCGCC CGAAAGAGAT GGCGGATCAG GTGTTTACGC TTAATGTGCT GGACGATAAA AACTACACTC TGAGCAGCGA TGGCGGCTTT AGCGCCCGTG GGCAAGCGGG CCAGATGCTG AAAAAAGAAG GCGTCACGCT GATGGTTGAA GCCATTCACT CCAGCCCGGG CAGTGAGTTT ACCGTCACCA AATACTCCAC GCTGGGGATG ATCAACCAAC TGCAAAACAG CCTGACGGTA ACGGAGAACG GCAAAGACGC AGGCGTACTG AGCCTGACTT ATACCGGTGA AGATCGCGAA CAGATCCGCG ACATTCTTAA CAGCATCGCC CGTAACTATC AGGAACAAAA TATTGAGCGC AAATCCGCGG AAGCGTCGAA AAGCCTCGCC TTCCTCGCGC AACAGTTACC GGAAGTACGT AGCCGCCTTG ATGTTGCCGA AAATAAATTG AATGCCTTCC GTCAGGATAA AGATTCTGTT GATCTGCCGC TGGAAGCAAA AGCGGTGCTC GATTCGATGG TGAACATCGA CGCACAATTG AACGAACTGA CCTTTAAAGA AGCGGAAATC TCCAAGCTGT ACACCAAAGT TCACCCGGCG TATCGCACGC TGCTGGAGAA ACGGCAGGCG CTGGAAGACG AAAAAGCCAA ACTGAACGGT CGCGTAACGG CGATGCCGAA AACCCAGCAG GAGATTGTCC GTCTGACCCG CGATGTCGAG TCTGGGCAGC AGGTCTATAT GCAGCTGCTG AATAAAGAGC AGGAGCTGAA AATCACCGAA GCCAGCACCG TCGGCGATGT GCGCATTGTT GACCCGGCAA TCACCCAGCC AGGTGTGCTA AAACCGAAGA AAGGGCTGAT TATCCTTGGG GCGATTATCC TTGGCCTGAT GCTCTCTATC GTGGCTGTGC TGCTGCGCTC GTTGTTTAAT CGCGGCATTG AAAGCCCGCA AGTGCTGGAA GAACATGGCA TTAGCGTCTA TGCCAGCATC CCGCTGTCGG AGTGGCAGAA AGCGCGCGAT AGCGTCAAAA CCATCAAAGG GATTAAACGC TATAAACAGA GCCAGCTACT GGCGGTGGGG AATCCAACCG ATCTGGCGAT TGAAGCCATC CGTAGTCTGC GTACCAGTCT GCACTTCGCG ATGATGCAGG CGCAGAACAA TGTGTTGATG ATGACCGGGG TTAGCCCGTC AATCGGTAAA ACCTTTGTCT GCGCCAACCT GGCGGCGGTG ATTAGCCAGA CCAATAAACG GGTGTTGTTG ATCGACTGCG ATATGCGCAA AGGCTACACC CACGAGCTGT TGGGCACTAA TAACGTTAAT GGCCTGTCGG AAATTCTGAT TGGACAGGGC GATATTACTA CTGCTGCTAA ACCGACCTCT ATTGCCAAAT TTGACCTGAT CCCGCGCGGT CAGGTGCCGC CAAATCCTTC TGAACTGTTA ATGAGCGAAC GCTTTGCCGA ACTGGTGAAG TGGGCGAGTA AAAACTACGA CCTGGTGTTG ATTGATACGC CGCCGATTCT GGCAGTGACC GATGCGGCAA TTGTTGGTCG TCATGTCGGA ACCACGTTAA TGGTGGCGCG TTATGCGGTC AACACATTGA AAGAAGTGGA AACCAGTCTG AGCCGCTTTG AGCAAAATGG TATTCCGGTG AAAGGGGTGA TTCTGAACTC CATCTTCCGC CGCGCCAGCG CGTATCAGGA TTATGGCTAT TACGAATACG AATATAAGTC GGATGCGAAA TAA
|
Protein sequence | MTEKVKQHAA PVTGSDEIDI GRLVGTVIEA RWWVIGITAV FALCAVVYTF FATPIYSADA LVQIEQNSGN SLVQDIGSAL ANKPPASDAE IQLIRSRLVL GKTVDDLDLD IAVSKNTFPI FGAGWDRLMG RQNETVKVTT FNRPKEMADQ VFTLNVLDDK NYTLSSDGGF SARGQAGQML KKEGVTLMVE AIHSSPGSEF TVTKYSTLGM INQLQNSLTV TENGKDAGVL SLTYTGEDRE QIRDILNSIA RNYQEQNIER KSAEASKSLA FLAQQLPEVR SRLDVAENKL NAFRQDKDSV DLPLEAKAVL DSMVNIDAQL NELTFKEAEI SKLYTKVHPA YRTLLEKRQA LEDEKAKLNG RVTAMPKTQQ EIVRLTRDVE SGQQVYMQLL NKEQELKITE ASTVGDVRIV DPAITQPGVL KPKKGLIILG AIILGLMLSI VAVLLRSLFN RGIESPQVLE EHGISVYASI PLSEWQKARD SVKTIKGIKR YKQSQLLAVG NPTDLAIEAI RSLRTSLHFA MMQAQNNVLM MTGVSPSIGK TFVCANLAAV ISQTNKRVLL IDCDMRKGYT HELLGTNNVN GLSEILIGQG DITTAAKPTS IAKFDLIPRG QVPPNPSELL MSERFAELVK WASKNYDLVL IDTPPILAVT DAAIVGRHVG TTLMVARYAV NTLKEVETSL SRFEQNGIPV KGVILNSIFR RASAYQDYGY YEYEYKSDAK
|
| |