Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2245 |
Symbol | |
ID | 6970353 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2131181 |
End bp | 2133760 |
Gene Length | 2580 bp |
Protein Length | 859 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643386130 |
Product | putative phage portal protein, HK97 family |
Protein accession | YP_002270617 |
Protein GI | 209399350 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.0000140812 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTGGAACC TTTTACGGCG AACCCGAAAA AACCAGAAAT CAGGACGTGA CGTAAGAGAG GCGGGCTGGA CCAGCCTGTT TCAGGCGGTG GCTGAGCCCT TTTCCGGCGC CTGGCAGCAG GGCGTGAAAG CCGATCCTGA AGCCGTCCTC TCCTTTCATG CGGTGTTTGC ATGTATTTCG CTGATATCCC AGGATATCGC CAAAATGCGG CTGCGTCTTA TGCAGACGGA TGCGCATGGG ATACGCAGGG AAACGCGCCG GGGGGATATT GCCCGCCTCT GTCGTCGTCC CAACGCCCAG CAGAACCGCA TCCAGTTTTT TGAACTGTGG CTGAACGCCA AACTGCGTCA TGGCAATACG GTGGTGCTGA AAATCCGTAA TGCCCGGGGG CAGATCAAAG AACTGCGTAT TCTGGACTGG AGCCGGGTTG AACCTCTGGT GGCGGATGAC GGCGAGGTGT TCTACCGCAT CACGCCGGAC CGGAACTGCG GGATCACGGA GGCGGTGACG GTGCCTGCCC GGGAAGTGAT CCACGACCGG TTTAACTGTT TTTTTCATCC GCTTATAGGA TTGCCGCCGG TGTATGCCGC CGGGCTGGCG GCCACGCAGG GGCATCATAT TCAGGAAAAT TCGACGTCTT TTTTCAGAAA TGGCGGCAGG CCGTCCGGGG TGATTGAGAT CCCCGGCAGT ATTACGGAAG AAAATGCGAA AAAACTGAAG AGCAACTGGG ACAGCGGGTA TACAGGCGAA AATGCGGGGA AAACGGCCAT TCTGAGCAAC GGGGCAAAAT ACAACCCCAC GACGTTTTCA CCGGTGGATG CGCAGACGGT GGAACAACTG AAGATGACCG CTGAAATTGT CTGTTCGGTG TTCCGTGTCC CGGCCTACAA GATTGGCGTG GGACAACCGC CTTCCAGTGA CAACGTGGAG GCGCTGGAGC AGCAGTATTA TTCCCAGTGC CTGCAGACGC TGATTGAGTC CATTGAACTG TTACTGGATG AGGCGCTGGA AACGGGGGAA AACGAGAGTA CAGAATTTGA TGTCACCACG CTGCTGAGAA TGGACAGTGA GCGGCGCATG AAAACGCTGG GGGATGCGGT GAAAAATACG CTTCTCACGC CCAATGAGGC CCGTAAACGG GAGAACCTGC CGCCCCTGGC CGGCGGTGAT GCACTGTATC TTCAGCAGCA GAACTACAGT CTGGAGGCGC TGTCCCGTCG TGATGCCCGT GAGGATCCGT TCGCGTCTGC CGGGAAAACA GTTTCATCAC AGCTGCCTGA CGGCGCATCT GACGGTAATA AGGCAATCAG TGAAACAGAG CATGATGCGG TGAAAGCGAT GTTCAGGGGG GATACTGAGA AAATGACGGA ACGGGAACTG TCCATTATTC GTGCACTGGG AGAAGAATTT TCCACAGTGC TGGCGGATTT ACAGCGCACA TTTGAGGGGA AGATGGCCTC GCAGGCACAA GCGTTTGAAG AGAAACTGAC TTCCCTGTCG GCGGTATTAC AGAAGCATGT GACGGTGGAT GAGGTGCGTC CGGTTCTGCA GGCGATGGTG GATGACGCTG TGGGGGCCAT TCCGGTACCG CGTGATGGTC GTGATTATGA TCCGGATGTA CTGCAGCAGG CGGTGAATGA TGCGGTCGCA AATATTCCGC AGCCGGCGGA CGGTAAAAGT CTCACCCCGG ATGATGTGCG TCCGATGCTT GAACAGATGG TGAAGGAGGC TGTAAGCCAT ATCCCTGTTC CGCGTGATGG TCGTGACTAC GATCCGGAAG TACTGCAGAA GGCGGTGAAT GATGCGGTCG CAAATATTCC GCAGCCGGCG GACGGTAAAA GTCTCACCCC GGATGATGTG CGTCCGATGC TTGAACAGAT GGTGAAGGAG GCGGTAAGCC ATATCCCTGT TCCGCGCGAC GGTCGTGACT ATGATCCGGA AGTACTGCAG AAGGCGGTGA ATGATGCGGT CGCAAATATT CCGCAGCCGG CAGACGGTAA AAGTCTCACC CCGGATGATG TGCGTCCGAT GCTTGAACAG ATGGTGAAGG AGGCGGTAAG CCATATTCCT GTTCCGCGTG ATGGTCGTGA CTACGATCCG GATGTTCTGC AGAAGGCGGT TCTGGATGCG GTGAGTGCCC TGCCGGCTCC GCAGGACGGG CGTGATGCCA CGGCACTGGA AATACTCCCC GCCATTGACG ATCAAAAATC CTTTCCCCGG GGCACGTATG CCACACACCA GGGCGGACTC TGGCGGGCGT ATGAAAAAAC GCACGGGATG CGGGGATGGG AATGCCTGGT TGACGGGGTG GCGGATATTG ACGTCAGCAT GACGGGTGAG CGGTTGTTCT CTGTGGTGGT CCGGCAGAGC AGTGGCCAGC GTACGGAAAA AACATTTTCC CTGCCGGTGA TGCTCTACCG CGGTGTGTTC AGAGCCGGTG AAACCTACCA CCCCGGCGAT ACGGTGACGT GGGGGGGCTC GCTGTGGCAC TGCAACAGTA TGACCGAAGA TAAACCCGGA GAAGCTCATT CATCAGCCTG GACCCTGGCT GCAAAACGTG GGCGGGATGC AGGAGGCTGA
|
Protein sequence | MWNLLRRTRK NQKSGRDVRE AGWTSLFQAV AEPFSGAWQQ GVKADPEAVL SFHAVFACIS LISQDIAKMR LRLMQTDAHG IRRETRRGDI ARLCRRPNAQ QNRIQFFELW LNAKLRHGNT VVLKIRNARG QIKELRILDW SRVEPLVADD GEVFYRITPD RNCGITEAVT VPAREVIHDR FNCFFHPLIG LPPVYAAGLA ATQGHHIQEN STSFFRNGGR PSGVIEIPGS ITEENAKKLK SNWDSGYTGE NAGKTAILSN GAKYNPTTFS PVDAQTVEQL KMTAEIVCSV FRVPAYKIGV GQPPSSDNVE ALEQQYYSQC LQTLIESIEL LLDEALETGE NESTEFDVTT LLRMDSERRM KTLGDAVKNT LLTPNEARKR ENLPPLAGGD ALYLQQQNYS LEALSRRDAR EDPFASAGKT VSSQLPDGAS DGNKAISETE HDAVKAMFRG DTEKMTEREL SIIRALGEEF STVLADLQRT FEGKMASQAQ AFEEKLTSLS AVLQKHVTVD EVRPVLQAMV DDAVGAIPVP RDGRDYDPDV LQQAVNDAVA NIPQPADGKS LTPDDVRPML EQMVKEAVSH IPVPRDGRDY DPEVLQKAVN DAVANIPQPA DGKSLTPDDV RPMLEQMVKE AVSHIPVPRD GRDYDPEVLQ KAVNDAVANI PQPADGKSLT PDDVRPMLEQ MVKEAVSHIP VPRDGRDYDP DVLQKAVLDA VSALPAPQDG RDATALEILP AIDDQKSFPR GTYATHQGGL WRAYEKTHGM RGWECLVDGV ADIDVSMTGE RLFSVVVRQS SGQRTEKTFS LPVMLYRGVF RAGETYHPGD TVTWGGSLWH CNSMTEDKPG EAHSSAWTLA AKRGRDAGG
|
| |