Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3251 |
Symbol | |
ID | 6144240 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3322667 |
End bp | 3327223 |
Gene Length | 4557 bp |
Protein Length | 1518 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641618081 |
Product | hypothetical protein |
Protein accession | YP_001745231 |
Protein GI | 170683116 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAGA AATTTAAATA TAAGAAATCG CTTTTAGCGG CTATTTTGAG CGCAACCCTG TTAGCCGGTT GTGATGGCGG TGGTTCCGGA TCTTCCTCCG ATACGCCGCC TGTAGATTCT GGAACAGGGT CTTTGCCGGA AGTGAAACCT GATCCAACAC CAAACCCGGA GCCGACGCCT GAGCCAACGC CGGACCCAGA GCCTACGCCA GAACCGATAC CTGATCCTGA ACCAACACCA GAACCGGAGC CAGAACCTGT TCCTACGAAA ACGGGTTATC TGACCCTGGG CGGAAGCCAG CGGGTAACTG GTGCTACCTG TAATGGTGAA TCCAGCGATG GCTTTACATT TAAACCTGGC GAGGACGTTA CTTGCGTGGC GGGTAACACG ACAATTGCCA CCTTCAACAC TCAGTCAGAA GCTGCGCGTA GCTTGCGTGC GGTTGAAAAA GTGTCGTTTA GCCTTGAGGA CGCGCAAGAA CTGGCGGGCT CCGATGACAA GAAAAGCAAT GCGGTTTCGC TGGTAACGTC CAGTAACAGC TGTCCGGCGA ATACAGAACA GGTTTGTCTG ACGTTCTCCT CGGTGATCGA GAGTAAACGC TTCGACTCGC TGTATAAGCA AATCGATCTG GCACCGGAAG AGTTCAAAAA GCTGGTCAAT GAAGAGGTGG AAAACAATGC TGCGACCGAT AAAGCGCCAT CCACTCATAC TTCACCGGTC GTGCCCGTCA CCACGCCGGG AACAAAACCG GATCTGAACG CTTCCTTCGT GTCGGCTAAC GCGGAACAGT TTTATCAGTA TCAACCCACT GAAATCATTC TCTCTGAAGG TCGACTGGTC GATAGCCAGG GATATGGTGT TGCTGGCGTC AACTACTACA CCAATTCAGG CCGTGGCGTG ACAGGGGAAA ATGGTGAATT TTCCTTTAGC TGGGGCGAAA CCATCTCCTT TGGTATCGAT ACCTTTGAAC TGGGTTCAGT GCGCGGCAAT AAGTCGACCA TTGCGCTGAC TGAACTGGGT GATGAAGTTC GCGGGGCGAA TATTGATCAG CTTATTCATC GCTATTCGAC GACCGGGCAA AATAATACCC GTGTTGTTCC GGAGGATGTA CGCAAGGTCT TTGCCGAATA TCCCAACGTG ATCAACGAGA TTATCAATCT CTCGTTATCC AACGGTGCGA CGCTGGGGGA AGGTGAGCAA GTCGTTAATC TGCCTAACGA ATTTATTGAG CAGTTTAATA CGGGTCAGGC CAAAGAGATC GATACCGCGA TTTGTGCGAA AACCGATGGT TGTAACGAGG CTCGCTGGTT CTCGCTGACG ACGCGCAATG TTAATGACGG CCAGATTCAG GGCGTTATCA ACAAGCTGTG GGGCGTGGAT ACGAACTACA AATCTGTCAG CAAGTTCCAT GTATTCCATG ACTCCACCAA CTTCTATGGC AGCACGGGTA ATGCGCGCGG TCAGGCGGTG GTGAATATCT CCAACGCGGC CTTCCCGATT CTGATGGCGC GTAATGATAA AAACTACTGG CTGGCCTTCG GCGAGAAACG GGCCTGGGAT AAAAATGAGC TGGCGTACAT TACTGAAGCG CCTTCCATTG TGCGACCAGA GAACGTGACA CGCGAAACAG CCACCTTCAA CCTGCCGTTT ATTTCGCTGG GGCAAGTGGG CGATGGCAAG CTGATGGTTA TCGGTAACCC ACACTACAAC AGCATCCTGC GTTGCCCGAA CGGTTACAGC TGGAACGGGG GCGTTAATAA AGATGGGCAG TGTACGCTCA ACAGCGACCC GGATGACATG AAGAACTTCA TGGAGAACGT GCTGCGCTAT CTGTCAAATG ATCGCTGGTT GCCGGATGCA AAATCCAATA TGACCGTGGG TACTAACCTG GACACGGTGT ATTTCAAAAA ACACGGGCAG GTTACAGGAA ATAGTGCTGC GTTCGGCTTT CATCCGGATT TTGCGGGTAT CTCTGTTGAG CATTTAAGTA GCTATGGCGA TCTCGACCCG CAGGAAATGC CGCTGCTGAT CCTCAACGGC TTTGAGTATG TGACTCAGGT TGGTAACGAT CCTTATGCAA TCCCGCTGCG TGCAGATACC AGCAAACCGA AGCTGACCCA GCAGGATGTG ACCGATTTGA TCGCCTATAT GAACAAAGGT GGATCGGTGC TGATCATGGA AAACGTGATG AGCAATCTTA AGGAAGAGAG CGCATCTGGC TTTGTACGTC TGCTTGATGC CGCAGGTTTG TCGATGGCGC TTAACAAGTC GGTAGTAAAT AACGATCCGC AAGGCTACCC GGACCGCGTT CGTCAACGAC GTTCAACGCC AATTTGGGTC TATGAGCGTT ATCCGGCTGT CGATGGTAAA CCACCGTATA CCATTGATGA CACCACGAAA GAAGTTATCT GGAAATATCA GCAAGAAAAC AAACCTGATG ACAAACCGAA GCTGGAAGTT GCCAGCTGGC AGGAAGAAGT TGAGGGTAAA CAGGTAACTC AATTCGCCTT TATCGATGAA GCCGACCACA AAACGCCTGA GTCACTGGCT GCGGCGAAGA AGAGAATTCT GGACGCGTTC CCAGGGCTGG AAGAGTGTAA GGATTCTGAC TACCACTATG AGGTCAACTG TCTGGAATAT CGTCCTGGCA CGGGGGTTCC GGTTACTGGT GGCATGTATG TTCCACAGTA TACGCAACTA AGCCTTAACG CCGACACTGC GAAAGCGATG GTGCAGGCTG CGGATTTAGG CACCAACATT CAGCGTCTGT ATCAGCATGA GCTTTACTTC CGTACCAATG GTCGCAAAGG TGAGCGTCTG AGCAGCGTCG ATCTGGAACG TCTGTACCAG AACATGTCGG TCTGGCTGTG GAATAAAATT GAATATCGCT ATGAAAACGA CAAGGATGAC GAGCTGGGCT TTAAAACGTT CACCGAGTTC CTGAACTGTT ACGCCAACAA TGCTTATGAT GGTGGCACGC AGTGCTCCGC AGAGCTGAAA CAATCGCTGA TCGATAACAA GATGATCTAC GGTGAAGGCA GCAAAGCGGG CATGATGAAC CCGAGCTATC CGCTTAACTA TATGGAAAAA CCGCTGACGC GCCTGATGCT GGGGCGTTCC TGGTGGGATC TGAACATCAA GGTTGATGTC GAGAAGTATC CGGGGGCGGT ATCGGCTGAA GGTGAGGAGG TTACTGAAAC CATCAACCTG TACTCGAATC CGACCAAATG GTTTGCGGGT AACATGCAGT CTACTGGCCT GTGGGCTCCG GCTCAGCAGG AAGTCAGCAT TAAGTCCAAT GCGAAAGTCC CTGTGACTGT TACCGTGGCG CTGGCTGACG ACCTGACCGG GCGTGAGAAG CATGAGGTTG CGCTGAACCG TCCGCCAAGA GTGACTAAAA CATACTCTCT GGATGCTAGC GGCACGGTGA AGTTCAAGGT TCCTTACGGT GGTCTGATTT ATATCAAGAG CGACAGTAAA GAGGAGAAAT CAGCCAACTT CACCTTTACT GGCGTGGTAA AAGCGCCGTT CTATAAAGAC GGTAAATGGA AAAACGACCT GAAATCCCCT GCGCCGTTGG GTGAGCTGGA GTCTGCGTCG TTCGTCTATA CCACGCCGAA GAAGAACCTT GAGGCCAGCA ATTACAAGGG CGGTCTGAAA CAATTCGCTG AGGATCTGGA TACCTTTGCC AGCTCGATGA ATGACTTCTA CGGTCGTGAT GGCGAAAGCG GTAAGCACCG GATGTTTACC TATGAAGCAT TGACGGGGCA CAAACATCGT TTCACCAACG ATGTGCAGAT CTCCATCGGT GATGCGCACT CTGGTTATCC GGTGATGAAC AGCAGCTTCT CGCCGAACAG CACCACGCTG CCGACGACGC CGCTGAACGA CTGGCTGATC TGGCACGAAG TAGGGCACAA CGCTGCAGAA ACGCCGCTGA CTGTACCGGG CGCAACTGAA GTGGCGAACA ACGTGCTGGC GCTGTACATG CAGGATCGTT ATCTCGGCAA GATGAACCGT GTCGCTGACG ATATTACCGT TGCGCCGGAA TATCTGGAGG AGAGCAACGG TCAGGCATGG GCGCGTGGCG GTGCGGGTGA CCGTCTGCTG ATGTACGCGC AGCTGAAGGA ATGGGCAGAG AAAAACTTTG ATATCAAACA GTGGTATCCA GAAGGCTCTC TGCCAGCGTT CTACAGCGAG CGTGAAGGGA TGAAAGGCTG GAACCTGTTC CAGTTGATGC ACCGTAAAGC ACGCGGCGAT GATGTTGGCA ATGACAAATT TGGCAACAGA AACTACTGTG CCGAATCCAA CGGTAACGCT GCCGACACGC TGATGCTGTG TGCATCCTGG GTCGCTCAGA CGGACCTTTC CGCATTCTTT AAGAAATGGA ATCCGGGCGC GAATGCTTAC CAGTTGCCGG GAGCGACAGA GATGAGCTTC GAGGGCGGTG TGAGCCAGTC GGCTTACAAC ACGCTCGCGT CACTCGATCT GCCGAAACCG GAACAGGGAC CGGAAACCAT TAATCAGGTT ACCGAGCATA AGATGTCTGC CGAGTAA
|
Protein sequence | MNKKFKYKKS LLAAILSATL LAGCDGGGSG SSSDTPPVDS GTGSLPEVKP DPTPNPEPTP EPTPDPEPTP EPIPDPEPTP EPEPEPVPTK TGYLTLGGSQ RVTGATCNGE SSDGFTFKPG EDVTCVAGNT TIATFNTQSE AARSLRAVEK VSFSLEDAQE LAGSDDKKSN AVSLVTSSNS CPANTEQVCL TFSSVIESKR FDSLYKQIDL APEEFKKLVN EEVENNAATD KAPSTHTSPV VPVTTPGTKP DLNASFVSAN AEQFYQYQPT EIILSEGRLV DSQGYGVAGV NYYTNSGRGV TGENGEFSFS WGETISFGID TFELGSVRGN KSTIALTELG DEVRGANIDQ LIHRYSTTGQ NNTRVVPEDV RKVFAEYPNV INEIINLSLS NGATLGEGEQ VVNLPNEFIE QFNTGQAKEI DTAICAKTDG CNEARWFSLT TRNVNDGQIQ GVINKLWGVD TNYKSVSKFH VFHDSTNFYG STGNARGQAV VNISNAAFPI LMARNDKNYW LAFGEKRAWD KNELAYITEA PSIVRPENVT RETATFNLPF ISLGQVGDGK LMVIGNPHYN SILRCPNGYS WNGGVNKDGQ CTLNSDPDDM KNFMENVLRY LSNDRWLPDA KSNMTVGTNL DTVYFKKHGQ VTGNSAAFGF HPDFAGISVE HLSSYGDLDP QEMPLLILNG FEYVTQVGND PYAIPLRADT SKPKLTQQDV TDLIAYMNKG GSVLIMENVM SNLKEESASG FVRLLDAAGL SMALNKSVVN NDPQGYPDRV RQRRSTPIWV YERYPAVDGK PPYTIDDTTK EVIWKYQQEN KPDDKPKLEV ASWQEEVEGK QVTQFAFIDE ADHKTPESLA AAKKRILDAF PGLEECKDSD YHYEVNCLEY RPGTGVPVTG GMYVPQYTQL SLNADTAKAM VQAADLGTNI QRLYQHELYF RTNGRKGERL SSVDLERLYQ NMSVWLWNKI EYRYENDKDD ELGFKTFTEF LNCYANNAYD GGTQCSAELK QSLIDNKMIY GEGSKAGMMN PSYPLNYMEK PLTRLMLGRS WWDLNIKVDV EKYPGAVSAE GEEVTETINL YSNPTKWFAG NMQSTGLWAP AQQEVSIKSN AKVPVTVTVA LADDLTGREK HEVALNRPPR VTKTYSLDAS GTVKFKVPYG GLIYIKSDSK EEKSANFTFT GVVKAPFYKD GKWKNDLKSP APLGELESAS FVYTTPKKNL EASNYKGGLK QFAEDLDTFA SSMNDFYGRD GESGKHRMFT YEALTGHKHR FTNDVQISIG DAHSGYPVMN SSFSPNSTTL PTTPLNDWLI WHEVGHNAAE TPLTVPGATE VANNVLALYM QDRYLGKMNR VADDITVAPE YLEESNGQAW ARGGAGDRLL MYAQLKEWAE KNFDIKQWYP EGSLPAFYSE REGMKGWNLF QLMHRKARGD DVGNDKFGNR NYCAESNGNA ADTLMLCASW VAQTDLSAFF KKWNPGANAY QLPGATEMSF EGGVSQSAYN TLASLDLPKP EQGPETINQV TEHKMSAE
|
| |