Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1540 |
Symbol | |
ID | 6971530 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1507470 |
End bp | 1509971 |
Gene Length | 2502 bp |
Protein Length | 833 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643385510 |
Product | putative phage portal protein, HK97 family |
Protein accession | YP_002270004 |
Protein GI | 209399454 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.533039 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGAACC TTTTACGGCG AACCCGAAAA AACCAGAAAT CAGGACGTGA CGTAAGAGAG GCGGGCTGGA CCAGCCTGTT TCAGGCGGTG GCTGAGCCCT TTTCCGGCGC CTGGCAGCAG GGCGTGAAAG CCGATCCTGA AGCCGTCCTC TCCTTTCATG CGGTGTTTGC ATGTATTTCG CTGATATCCC AGGATATCGC CAAAATGCGG CTGCGTCTTA TGCAGACGGA TGCGCATGGG ATACGCAGGG AAACGCGCCG GGGGGATATT GCCCGCCTCT GTCGTCGTCC CAACGCCCAG CAGAACCGCA TCCAGTTTTT TGAACTGTGG CTGAACGCCA AACTGCGTCA TGGCAATACG GTGGTGCTGA AAATCCGTAA TGCCCGGGGG CAGATCAAAG AACTGCGTAT TCTGGACTGG AGCCGGGTTG AACCTCTGGT GGCGGATGAC GGCGAGGTGT TCTACCGCAT CACGCCGGAC CGGAACTGCG GGATCACGGA GGCGGTGACG GTGCCTGCCC GGGAAGTGAT CCACGACCGG TTTAACTGTT TTTTTCATCC GCTTATAGGA TTGCCGCCGG TGTATGCCGC CGGGCTGGCG GCCACGCAGG GGCATCATAT TCAGGAAAAT TCGACGTCTT TTTTCAGAAA TGGCGGCAGG CCGTCCGGGG TGATTGAGAT CCCCGGCAGT ATTACGGAAG AAAATGCGAA AAAACTGAAG AGCAACTGGG ACAGCGGGTA TACAGGCGAA AATGCGGGGA AAACGGCCAT TCTGAGCAAC GGGGCAAAAT ACAACCCCAC GACGTTTTCA CCGGTGGATG CGCAGACGGT GGAACAACTG AAGATGACCG CTGAAATTGT CTGTTCGGTG TTCCGTGTCC CGGCCTACAA GATTGGCGTG GGACAACCGC CTTCCAGTGA CAACGTGGAG GCGCTGGAGC AGCAGTATTA TTCCCAGTGC CTGCAGACGC TGATTGAGTC CATTGAACTG TTACTGGATG AGGCGCTGGA AACGGGGGAA AACGAGAGTA CAGAATTTGA TGTCACCACG CTGCTGAGAA TGGACAGTGA GCGGCGCATG AAAACGCTGG GGGATGCGGT GAAAAATACG CTTCTCACGC CCAATGAGGC CCGTAAACGG GAGAACCTGC CGCCCCTGGC CGGCGGTGAT GCACTGTATC TTCAGCAGCA GAACTACAGT CTGGAGGCGC TGTCCCGTCG TGATGCCCGT GAGGATCCGT TCGCGTCTGC CGGGAAAACA GTTTCATCAC AGCTGCCTGA CGGCGCATCT GACGGTAATA AGGCAATCAG TGAAACAGAG CATGATGCGG TGAAAGCGAT GTTCAGGGGG GATACTGAGA AAATGACGGA ACGGGAACTG TCCATTATTC GTGCACTGGG AGAAGAATTT TCCACAGTGC TGGCGGATTT ACAGCGCACA TTTGAGGGGA AGATGGCCTC GCAGGCACAA GCGTTTGAAG AGAAACTGAC TTCCCTGTCG GCGGTATTAC AGAAGCATGT GACGGTGGAT GAGGTGCGTC CGGTTCTGCA GGCGATGGTG GATGACGCTG TGGGGGCCAT TCCGGTACCG CGTGATGGTC GTGATTATGA TCCGGATGTA CTGCAGCAGG CGGTGAATGA TGCGGTCGCA AATATTCCGG TACCGGCAGA CGGCAAAAGT ATCACCCCGG ATGATGTGCG TCCGATGCTT GAGCAGATGG TGAAAGAGGC AGTGAGCCAT ATTCCTGTTC CGCGCGACGG TCGTGACTAC GATCCGGATG TTCTGCAGAA GGCGGTGAAT GATGCGGTCG CGAAAATACC GGTACCGGCA GACGGTAAAA GTATCACTCC GGATGATGTG CATCCGATGC TTGAACAGAT GGTGAAGGAG GCGGTAAGCC ATATTCCTGT TCCGCGTGAT GGTCGTGACT ACGATCCGGA TGTTCTGCAG AAGGCGGTGA ATGATGCGGT CGCGAAAATA CCGGTACCGG CAGACGGTAA AAGTATCACT CCGGATGATG TGCATCCGAT GCTTGAACAG ATGGTGAAGG AGGCGGTAAG CCATATTCCT GTTCCGCGTG ATGGTCGTGA CTACGATCCG GATGTTCTGC AGAAGGCGGT TCTGGAGGCG GTGAGTGCCC TGCCGGCTCC GCAGGACGGG CGTGATGCCA CGGCACTGGA AATACTCCCC GCCATTGACG ATCAAAAATC CTTTCCCCGG GGCTCGTATG CCACACACCA GGGTGGACTC TGGCGGGCGT ATGAAAAAAC GTACGGGATG CGGGGATGGG AATGCCTGGT TGACGGGGTG GCGGATATTG ACGTCAGCAT GACGGGTGAA CGGTCGTTCT CTGTGGTGGT CCGGCAGAGC AGTGGCCAGC GTACGGAAAA AACATTTTCC CTGCCGGTGA TGCTCTACCG TGGTGTGTTC AGAATCGGCG AAACTTACCA CCCCGGCGAT ACGGTGACGT GGGGGGCTCG TTGTGGCACT GCAACAGTAT GA
|
Protein sequence | MWNLLRRTRK NQKSGRDVRE AGWTSLFQAV AEPFSGAWQQ GVKADPEAVL SFHAVFACIS LISQDIAKMR LRLMQTDAHG IRRETRRGDI ARLCRRPNAQ QNRIQFFELW LNAKLRHGNT VVLKIRNARG QIKELRILDW SRVEPLVADD GEVFYRITPD RNCGITEAVT VPAREVIHDR FNCFFHPLIG LPPVYAAGLA ATQGHHIQEN STSFFRNGGR PSGVIEIPGS ITEENAKKLK SNWDSGYTGE NAGKTAILSN GAKYNPTTFS PVDAQTVEQL KMTAEIVCSV FRVPAYKIGV GQPPSSDNVE ALEQQYYSQC LQTLIESIEL LLDEALETGE NESTEFDVTT LLRMDSERRM KTLGDAVKNT LLTPNEARKR ENLPPLAGGD ALYLQQQNYS LEALSRRDAR EDPFASAGKT VSSQLPDGAS DGNKAISETE HDAVKAMFRG DTEKMTEREL SIIRALGEEF STVLADLQRT FEGKMASQAQ AFEEKLTSLS AVLQKHVTVD EVRPVLQAMV DDAVGAIPVP RDGRDYDPDV LQQAVNDAVA NIPVPADGKS ITPDDVRPML EQMVKEAVSH IPVPRDGRDY DPDVLQKAVN DAVAKIPVPA DGKSITPDDV HPMLEQMVKE AVSHIPVPRD GRDYDPDVLQ KAVNDAVAKI PVPADGKSIT PDDVHPMLEQ MVKEAVSHIP VPRDGRDYDP DVLQKAVLEA VSALPAPQDG RDATALEILP AIDDQKSFPR GSYATHQGGL WRAYEKTYGM RGWECLVDGV ADIDVSMTGE RSFSVVVRQS SGQRTEKTFS LPVMLYRGVF RIGETYHPGD TVTWGARCGT ATV
|
| |