Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1037 |
Symbol | |
ID | 6967466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1052559 |
End bp | 1053551 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643385051 |
Product | hypothetical protein |
Protein accession | YP_002269551 |
Protein GI | 209397130 |
COG category | [S] Function unknown |
COG ID | [COG2990] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGAAAT CGACGTCATG TACAACCATT GATTTCATGA ATATGTCGCA GCTAACCGAA CGGACCTTTA CGTCATCTGA ATCTCTCAGC AGCCTGTCAC TTTTTCTTAG TCTGGCACGT GGACAGTGTC GGCCGGGTAA ATTCTGGCAT CGCCGTAGTT TTCGCCAGAA ATTTTTGCTG CGCTCGTTGA TTATGCCGCG TTTAAGCGTT GAGTGGATGA ACGAACTTTC CCACTGGCCT AATCTCAATG TATTGTTAAC GCGCCAGCCG CGACTGCCTG TGCGTCTGCA TCGCCCTTAC CTTGCGGCGA ATCTTAGCCG TAAGCAATTG CTGGAGGCGT TACGTTACCA TTATGCGTTA CTCCGTGGAT GTATGTCGGC GGAAGAATTC AGCTTATATT TGAATACCCC CGGGCTGCAA CTGGCGAAGC TGGAAGGCAA AAACGGCGAG CAGTTCACGC TTGAGCTGAC CATGATGATC TCAATGGATA AAGAAGGTGA CAGCACAATC CTGTTCCGCA ACAGCGAAGG TATTCCTCTG GCAGAAATCA CGTTTACCCT GTGTGAATAT CAGGGGAAAA GAACGATGTT TATTGGTGGA CTGCAAGGCG CAAAATGGGA AATTCCACAT CAGGAAATCC AGAATGCGAC GAAAGCCTGC CACGGGCTAT TTCCCAAACG CCTCGTGATG GAAGCGGCCT GTCTGTTTGC CCAACGTTTG CAGGTAGAGC AGATTATTGC CGTCAGCAAT GAAACGCATA TTTACCGCAG CCTGCGTTAT CGCGATAAAG AAGGCAAGAT CCATGCCGAT TACAACGCTT TCTGGGAATC GGTTGGCGGC GTATGTGATG CTGAACGCCA TTACCGCCTT CCAGCACAGA TAGCACGAAA AGAGATTGCC GAAATCGCCA GTAAAAAACG GGCTGAATAC CGTCGGCGCT ATGAGATGCT CGACGCTATT CAGCCACAAA TGGCCACGAT GTTTCGCGGT TAA
|
Protein sequence | MVKSTSCTTI DFMNMSQLTE RTFTSSESLS SLSLFLSLAR GQCRPGKFWH RRSFRQKFLL RSLIMPRLSV EWMNELSHWP NLNVLLTRQP RLPVRLHRPY LAANLSRKQL LEALRYHYAL LRGCMSAEEF SLYLNTPGLQ LAKLEGKNGE QFTLELTMMI SMDKEGDSTI LFRNSEGIPL AEITFTLCEY QGKRTMFIGG LQGAKWEIPH QEIQNATKAC HGLFPKRLVM EAACLFAQRL QVEQIIAVSN ETHIYRSLRY RDKEGKIHAD YNAFWESVGG VCDAERHYRL PAQIARKEIA EIASKKRAEY RRRYEMLDAI QPQMATMFRG
|
| |