Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1833 |
Symbol | |
ID | 6971999 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1744154 |
End bp | 1745194 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643385770 |
Product | hypothetical protein |
Protein accession | YP_002270260 |
Protein GI | 209398708 |
COG category | [R] General function prediction only |
COG ID | [COG5529] Pyocin large subunit |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000938078 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0000000000000142087 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGTATCG ATGCACTACG ATGGGCTAAA AAGGTGAAAA CCGGCAGTTC ATCCAGTAAG TCTGTATTGA CCTGGCTTGC TGATATGTGC GGTGCCGATT TGTGTGCATA CCCGTCTGTA TCTGCACTGG CAGAAGTAAC GGAACTGAAC AAAAAGACTG TGCAGGACAG CTTACGACAC CTGATGGAGA TTGGGTTAAT TGTTGATACC GGTGAGAGAA AAGGCAGAAC AAAGCAAATT GTGGTGTACC GACTTATCGG TGTAGAAGAA AGTGTTGCCG AGCCTGAATA CACCCAAAAA CGGGTGTCTT TAAAGGTGGG TAAAATTGGT GCTGTTAATA AAAACAGTAC CGAAAACGGT TATGTTTCAG CACAAAAGAG CCCCAAAAAC GGAACTCTTT GTTGCATGGA AAATAACCAA AGACACCCAA ATTTTCCATC AAAGACACCC AAAAACGGAT CACGGAACCC AAAGGAACCC AAAGATCTAA ACCCCACACA TAACGCACGC GAGAGTGCTC CGACCAGTGA GCAGGAAGTT TTGTCGTTAC AGGCAGCCCC CCATGTATTC CTGGATGGCC TGAGCGAACC CATCGGAAAA TTTCCGATGA CCGATAGCTG GTATCCGTCA CGGGATTTTC GACGACGGGC TGCGTTGTGG GGGATGGCTT TGCCGGAGAC AGAATTTACA CCTGCTGAAC TTACCGCATT CCGGGACTAC TGGGCAGCGG AGGGGAAAGT GTTTACGCAG ATTCAGTGGG AGCAGAAATT CGCCCGTCAC GTAAATCACG TCAGGGCGCA GGTTAAACCA GTCAGCAAGG GGGTAAACCA TGCAGCAGCA CCAGGTGGCA CCGCATCACG GGCAGTTCAG GAAATTCGGG CAGCACGTGA GCAGTGGGAA CGTGAAAACG GATTTATCAG CGACGGAAAC GGCCTGGAAG CTGTGGGAAC TCATGGGGGA GGTTTATTCG AACCGCTGGA CCCAGAAGAA CGGGGCCGCA CCTTCGAAGC TCTGGATTGC ACAGATTGGC GCGATGACTG A
|
Protein sequence | MSIDALRWAK KVKTGSSSSK SVLTWLADMC GADLCAYPSV SALAEVTELN KKTVQDSLRH LMEIGLIVDT GERKGRTKQI VVYRLIGVEE SVAEPEYTQK RVSLKVGKIG AVNKNSTENG YVSAQKSPKN GTLCCMENNQ RHPNFPSKTP KNGSRNPKEP KDLNPTHNAR ESAPTSEQEV LSLQAAPHVF LDGLSEPIGK FPMTDSWYPS RDFRRRAALW GMALPETEFT PAELTAFRDY WAAEGKVFTQ IQWEQKFARH VNHVRAQVKP VSKGVNHAAA PGGTASRAVQ EIRAAREQWE RENGFISDGN GLEAVGTHGG GLFEPLDPEE RGRTFEALDC TDWRDD
|
| |