Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2209 |
Symbol | |
ID | 6969174 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2105350 |
End bp | 2106393 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643386100 |
Product | hypothetical protein |
Protein accession | YP_002270587 |
Protein GI | 209398826 |
COG category | [R] General function prediction only |
COG ID | [COG5529] Pyocin large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00285391 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 5.0069399999999995e-21 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCTAATG CCTGGCTCAG ATTGTGGCAT GACATGCCAA ATGACCCCAA GTGGCGAACG ATTGCCAGGG TATCAGGACA GCCAATCGCA ACAGTGATGG CAGTGTATAT CCATCTTCTG GTGAGCGCGT CACGAAATGT CACGACATGT CACGGCGTGT CACTACGTGG TCACATTGAT GTCACGACGG AAGATTTAGC AAGTGCGCTT GATGTGACGG AAGACGTAAT TGATTCAATT TTGCATGCAA TGCAGGGGCG GGTTCTGGAT GGTGACCTTA TTTCCGGATG GGAAAAACGT CAGGTGCTGA AAGAGGACAA TGGTAACGTT TCGCAAACGG CAAAATCCCC GGCAGAGCGC AAGAGAGCGC AGCGGGAGCG GGAAAAGCTG CGGAAACATA ATGCTGATTG TCACGATGAG TCACGACGTG TCACGCATCT GTCACGACAA GTCACGACAG ATAAAGATAC AGATAAAGAT ACAGATACAG AATTAAACCC CACACATAAC GCGCGCGAGA GTATTCCGAC CAGTGAGTCG AATGGTGCGC CGTTGCAGAC AGCCGAACCT GAATACCTGG ACGGCCTGAG CGAACCGATC GGGAAATTTT CGATGACTAC TGTCTGGCAG CCGTCGCCGG ATTTTCGACA ACGGGCAGCA GTGTGGGGTA TGGCTCTGCC TGAGCCGGAA TTTACACCTG CTGAGCTTGC CGCATTCCGG GATTACTGGA TGGCGGAGGG GAAGGTTTTC ACGCAGGTTC AGTGGGAGCA GAAATTTGCC CGCCACGTGC AGCACGTCAG GGCACAGGTA AAACCAGTCA GCAAGGGGGG AAGCCATGCA GCATCAGGTG GCACGGCATC ACGGGCAGTT CAGGAAATCC GGGCTGCACG CGAACAGTGG GAACGTGACA ACGGATTTAT CAGCAACGGA AACGGCCTGG AAGCTGTGGG AGCTCATGGG GGAGGTGTAT TCGAACCGCT GGACTCAGAA GAACGGGGCC GCACCTTCGA AGCTCTGGAT TGCCCAGATT GGTGCGATGA CTGA
|
Protein sequence | MANAWLRLWH DMPNDPKWRT IARVSGQPIA TVMAVYIHLL VSASRNVTTC HGVSLRGHID VTTEDLASAL DVTEDVIDSI LHAMQGRVLD GDLISGWEKR QVLKEDNGNV SQTAKSPAER KRAQREREKL RKHNADCHDE SRRVTHLSRQ VTTDKDTDKD TDTELNPTHN ARESIPTSES NGAPLQTAEP EYLDGLSEPI GKFSMTTVWQ PSPDFRQRAA VWGMALPEPE FTPAELAAFR DYWMAEGKVF TQVQWEQKFA RHVQHVRAQV KPVSKGGSHA ASGGTASRAV QEIRAAREQW ERDNGFISNG NGLEAVGAHG GGVFEPLDSE ERGRTFEALD CPDWCDD
|
| |