Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4686 |
Symbol | |
ID | 6147003 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4784536 |
End bp | 4785693 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641619502 |
Product | hypothetical protein |
Protein accession | YP_001746610 |
Protein GI | 170683978 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0147095 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTAAGC GCAGTTGGTT ATTAATCGCC GCATTACCTC CCTCTATTTC ACCTTCCTGG GGCGCGGATT TTTATTACCG CCAGCAGGAG AAAGGCACGG TTTATGTTGT CGAACAGAAG GGGGAAAAAG ATGAGATCCT CTCCGAATTA CCAGATATTA ATTTTTCCCG CCTTTGGCGT ATTGCCAATT TAGCCAATAA ACAAGATTCC CGGTTACTGT CCGATTTTAA TCCCGATAAG TTCGATTGCG ATGATGAGGG GGATTGCGAA CATGCCTGGC TCACCGATGG ACGCTCTGTT CTTTGGTCTG GCAAAGTCCT GAAAAATCCC CCCGGTAAAC CTATAGTCGA CGCTGCCAGT TTTCAGGCAT TCGGCGCTTT CGCTGCTGAT AAACGCAGTA TCTATTTTGA TGGTCAGCGT ACCGATGATA ATAGCGGTGA TAAGCAGGTG GATATGTCAA CGCTTGAAGA GACGGACATC TGGAATTTAC TGCGTGATAA AAATAGTCTT TGGCATAAGG GGCACTGGTT GGGAAGCGCT GACGGATTTC AAATCCTGCG GCATGATTCC TCCCTGCAAT TTGTTGTGCA GACAAATTCG CAGGTGATTG TTAATGGCAA GCCACTGCCC GCCGATCGCA AAACTTTTCA GATTAAACGT TGGATGCCTG GCGAACGCTT AGTTTATCGC GATAAAAGCG GCGAGCGTGA CTATGAGCTG GAGGATACCA GCTATCGCTG TGCACCTTTT AATATTGGTC TGAATAACGT GTCCTGGCTC AAATATGAAG CCACTCCAGC GGGCAGTGAG TGTATTTATG AAACGCTGGC GGGAGTTGAT CCGGAATATT TTTATCTGTT TGTTCGGAAT ACCGGTTTAT ATAAGAACCA AATATATAAA GTCACAATTA ACGCCCTGGG CGAAGGTGAG TTGGTTAATC TCAAGCCAGA GGATCTCTCC GACTCACTTG AAGCAGGGGG TAGTTGGGGA TTAACTAACA CGTTTATATC AACAGACGGG CAGCTTTACA CTCAACAAGC GACTGGAATT GGGAAAGAAC ACGCCCAACA AGGTGAATGG CTGCGTTATA ACTTAGGCAA GGGAGGTTGG TTATCGGTGA AGCAACCCCC GAGCGGGCTT AAACCCTTAT TTAAATAA
|
Protein sequence | MVKRSWLLIA ALPPSISPSW GADFYYRQQE KGTVYVVEQK GEKDEILSEL PDINFSRLWR IANLANKQDS RLLSDFNPDK FDCDDEGDCE HAWLTDGRSV LWSGKVLKNP PGKPIVDAAS FQAFGAFAAD KRSIYFDGQR TDDNSGDKQV DMSTLEETDI WNLLRDKNSL WHKGHWLGSA DGFQILRHDS SLQFVVQTNS QVIVNGKPLP ADRKTFQIKR WMPGERLVYR DKSGERDYEL EDTSYRCAPF NIGLNNVSWL KYEATPAGSE CIYETLAGVD PEYFYLFVRN TGLYKNQIYK VTINALGEGE LVNLKPEDLS DSLEAGGSWG LTNTFISTDG QLYTQQATGI GKEHAQQGEW LRYNLGKGGW LSVKQPPSGL KPLFK
|
| |