Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4202 |
Symbol | |
ID | 6146550 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4303253 |
End bp | 4304113 |
Gene Length | 861 bp |
Protein Length | 286 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641619025 |
Product | hypothetical protein |
Protein accession | YP_001746153 |
Protein GI | 170684276 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.000910072 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTACAGG TTACGCTTTG CGCATCGTCG TTGGGATATC TTTTCCCGCA GAATGAAACG CAAGATTTAC GTCTACATAG CGCTTTTAAA CATGCGGTAA ATCTGTATTC AGACGCGGGA ACATTCATCA CTCTGTTGTG TGCGCAAACG TATCTGAATC TCCCTGATGC AGCCAGAGTC TGGCTACCTG AATGTTGGGA CTGGCGAAGA GAGATTGCCC ATTCAGATCC CATTCAGTTA ACCCCAGGCT TACTGCGGAC ACCTCGATTT TGTGTGGCGC TCGAAAACGC AACGCTCTGG CAGTCGCCAT TTGTGGGGGG AATGCTGACA CTTGAGGCAT TCCCTTTGGT TTTCCAGCAC TACCCAACGA TGGCATCGCA ACGGCTTTTA TTTTGTCTGG AACATAACGT TCAAAGCACA TTGCACCTGC CGGATAGTCT CACGCACCAG GGATTAGCGA TTATGGAGCA TCCAGATGCG CTGGAACGCC AGGTGCCACA ACTGATCGGC TTTGGTAAAG GCTTAACGCC CGATGGAGAT GACTATTTGT TGGGCTATCT GGCAGCACTC TGGTTATGGC AACTCCCTGC ACCACTTGCC GATCATCAAT ACCGGTTGCA GCAGGTAATT GATCAACATG CGCACAACAC CACGGATATC AGTCGTCATT ATCTGGAGCG TGCGCTTCAG GGACATTTTT CAGAACCGAT TTGCCAGTTA CTCGCACAAC TGGTTGGGAG TGCATCGGCA ATGACAATCG CATCCTGTGC AGAACAGGTC ATGCAATTTG GCGCAACGTC AGGAGTAGAT TGCCTTGCAG GAATGCTGCA TGGCTTTCGA ACCCTGAACA CCATAAATTG A
|
Protein sequence | MVQVTLCASS LGYLFPQNET QDLRLHSAFK HAVNLYSDAG TFITLLCAQT YLNLPDAARV WLPECWDWRR EIAHSDPIQL TPGLLRTPRF CVALENATLW QSPFVGGMLT LEAFPLVFQH YPTMASQRLL FCLEHNVQST LHLPDSLTHQ GLAIMEHPDA LERQVPQLIG FGKGLTPDGD DYLLGYLAAL WLWQLPAPLA DHQYRLQQVI DQHAHNTTDI SRHYLERALQ GHFSEPICQL LAQLVGSASA MTIASCAEQV MQFGATSGVD CLAGMLHGFR TLNTIN
|
| |