Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2188 |
Symbol | |
ID | 6971236 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2094628 |
End bp | 2096478 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643386081 |
Product | hypothetical protein |
Protein accession | YP_002270570 |
Protein GI | 209399696 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.162911 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.0000018311 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCGATTA AACATTATGA TGTTGTCAGG GCGGCGTCGC CGTCAGACCT TGCGGAAAAG CTGACACACA AACTGAAAGA GGGCTGGCAG CCGTTTGGTA GTCCGGTGGC CATAACCCCT TATACCCTGA TGCAGGCGAT TACAGCAGAA GGTGATGTGG TGGTCAGTGG TGCAACTGAG CCGGATTGGT ACTACGTCAT CGTACTGGCC GGGCAGTCCA ATGCCATGGC TTACGGTGAA GGGCTTCCGC TGCCGGATTC ATACGATGCT CCGGATCCGC GCATTAAACA GCTGGCGCGC CGCAGTACAG TGACGCCGGG TGGGGCTGCC TGCAGATATA ACGATATTAT TCCGGCCGAC CACTGCCTGC ATGATGTGCA GGATATGAGT ACGCTGAATC ATCCGAAGGC AGACCTGAGC AAAGGGCAGT ACGGCTGTGT CGGCCAGGGG TTACATATTG CCAAAAAACT GCTCCCGTAT ATCCCGAATA ACGCGGGGAT CCTGCTGGTA CCATGCTGTC GTGGTGGTTC GGCATTTACC CAGGGCGCGG AGGGGATATT CAGCGAGTCC ACGGGGGCCA GCCAGGATTC GGCGCGCTGG GGTGTGGGTA AACCGTTATA TCAGGACCTG ATTGCGCGCA CCAAAGCTGC ATTACAGAAG AACCCGAAAA ATGTGTTGCT GGCGGTGTGC TGGATGCAGG GAGAGTTTGA CATGAGCGCC GCCACCCACG CACAGCAACC TGCGCTGTTT ACAGCCATGC TGACACAGTT TCGTGCTGAC CTCTCCGTGT TTAACGCGCA GTGCCATGGT GGCAGTGCTG CAGATGTGCC GTGGATTTGT GGTGACACGA CGTATTACTG GAAAAATACA TACGCTACCC AGTACGACAC CGTGTACGGC GGGTATAAAA ACAGGGAGAG TGAGGGCGTT TATTTTGTGC CCTTCATGAC AGACGGTAAC GGCGTCAATA CCGCCACTAA CGCGCCGGCA GAAGATCCGG ATATTCCGGC ATCAGGATAT TACGGTGCGG CATCGAGAAC GAATGGAAAC CAGGTATCAT CAAACCGCCC GACACATTTC AGTTCATGGG CGCGCAGGAG CATTATTCCG GATCGTCTGG CAACCGCTAT TCTGAACGCA GCCGGGCGCA CCTCAGCCTT CATCAGTGGT AAGGCACCGG AAATCAAACC CTCGCCCGGC GGCAACACGC CATCGGGTCC GTCTGCAGAT ACGTCCGTTC GCACAATCTC CCTGCTGCCG GCAGCCGGAG AGGCTGCTGC GCAGGGCTGG AGCATTAAGG ATGGCGGAAT TCAGTTGTCA GATGGTGTAT TTAAGATCAC CAGGCAGAGC AATAAAACCT GGTCCCTGAC GCATCCGGTG GATGACGCAA TTACCCTGCT GACACAGGGC GGCAGACTGA CCTGTAAGTT CCGCCTGTCA GGCGCACTGA CCAACAATCA GTTCGGGCTG GGGATTTATC TGTATACGGA CGCTCCCGTT CCTGATGGTG TGGCGATGAC GGGTACCGGT AATCCGTTCC TGATGTCGTA CTTCACTCAG ACCACTGACG GCAGAGTGAA TCTGATGCAT CACAGGAAAG CCGGAAACAC GAAGCTGGGG GAGTTCGGCG ATTACGGTAA CGACTGGCAG ACGCTGGAGC TGGTGTTCAC CGCCGGCAGT GCCACGGTTA CTCCGAAACT GAATGGAGTG GCTGGCCCGG CATTCCAGGT TATAAAAGAC AGTCTGACAC TGGGACTGAA TGCGCTGACG CTGACGGATG TTACAAAAAA TGCAGCGTAT GGCGTTGAGA TAGAAAGTCT GATGCTGGAG ATAAATGCAC CGGCAGCATA A
|
Protein sequence | MSIKHYDVVR AASPSDLAEK LTHKLKEGWQ PFGSPVAITP YTLMQAITAE GDVVVSGATE PDWYYVIVLA GQSNAMAYGE GLPLPDSYDA PDPRIKQLAR RSTVTPGGAA CRYNDIIPAD HCLHDVQDMS TLNHPKADLS KGQYGCVGQG LHIAKKLLPY IPNNAGILLV PCCRGGSAFT QGAEGIFSES TGASQDSARW GVGKPLYQDL IARTKAALQK NPKNVLLAVC WMQGEFDMSA ATHAQQPALF TAMLTQFRAD LSVFNAQCHG GSAADVPWIC GDTTYYWKNT YATQYDTVYG GYKNRESEGV YFVPFMTDGN GVNTATNAPA EDPDIPASGY YGAASRTNGN QVSSNRPTHF SSWARRSIIP DRLATAILNA AGRTSAFISG KAPEIKPSPG GNTPSGPSAD TSVRTISLLP AAGEAAAQGW SIKDGGIQLS DGVFKITRQS NKTWSLTHPV DDAITLLTQG GRLTCKFRLS GALTNNQFGL GIYLYTDAPV PDGVAMTGTG NPFLMSYFTQ TTDGRVNLMH HRKAGNTKLG EFGDYGNDWQ TLELVFTAGS ATVTPKLNGV AGPAFQVIKD SLTLGLNALT LTDVTKNAAY GVEIESLMLE INAPAA
|
| |