Gene ECH74115_1540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1540 
Symbol 
ID6971530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1507470 
End bp1509971 
Gene Length2502 bp 
Protein Length833 aa 
Translation table11 
GC content56% 
IMG OID643385510 
Productputative phage portal protein, HK97 family 
Protein accessionYP_002270004 
Protein GI209399454 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.533039 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGAACC TTTTACGGCG AACCCGAAAA AACCAGAAAT CAGGACGTGA CGTAAGAGAG 
GCGGGCTGGA CCAGCCTGTT TCAGGCGGTG GCTGAGCCCT TTTCCGGCGC CTGGCAGCAG
GGCGTGAAAG CCGATCCTGA AGCCGTCCTC TCCTTTCATG CGGTGTTTGC ATGTATTTCG
CTGATATCCC AGGATATCGC CAAAATGCGG CTGCGTCTTA TGCAGACGGA TGCGCATGGG
ATACGCAGGG AAACGCGCCG GGGGGATATT GCCCGCCTCT GTCGTCGTCC CAACGCCCAG
CAGAACCGCA TCCAGTTTTT TGAACTGTGG CTGAACGCCA AACTGCGTCA TGGCAATACG
GTGGTGCTGA AAATCCGTAA TGCCCGGGGG CAGATCAAAG AACTGCGTAT TCTGGACTGG
AGCCGGGTTG AACCTCTGGT GGCGGATGAC GGCGAGGTGT TCTACCGCAT CACGCCGGAC
CGGAACTGCG GGATCACGGA GGCGGTGACG GTGCCTGCCC GGGAAGTGAT CCACGACCGG
TTTAACTGTT TTTTTCATCC GCTTATAGGA TTGCCGCCGG TGTATGCCGC CGGGCTGGCG
GCCACGCAGG GGCATCATAT TCAGGAAAAT TCGACGTCTT TTTTCAGAAA TGGCGGCAGG
CCGTCCGGGG TGATTGAGAT CCCCGGCAGT ATTACGGAAG AAAATGCGAA AAAACTGAAG
AGCAACTGGG ACAGCGGGTA TACAGGCGAA AATGCGGGGA AAACGGCCAT TCTGAGCAAC
GGGGCAAAAT ACAACCCCAC GACGTTTTCA CCGGTGGATG CGCAGACGGT GGAACAACTG
AAGATGACCG CTGAAATTGT CTGTTCGGTG TTCCGTGTCC CGGCCTACAA GATTGGCGTG
GGACAACCGC CTTCCAGTGA CAACGTGGAG GCGCTGGAGC AGCAGTATTA TTCCCAGTGC
CTGCAGACGC TGATTGAGTC CATTGAACTG TTACTGGATG AGGCGCTGGA AACGGGGGAA
AACGAGAGTA CAGAATTTGA TGTCACCACG CTGCTGAGAA TGGACAGTGA GCGGCGCATG
AAAACGCTGG GGGATGCGGT GAAAAATACG CTTCTCACGC CCAATGAGGC CCGTAAACGG
GAGAACCTGC CGCCCCTGGC CGGCGGTGAT GCACTGTATC TTCAGCAGCA GAACTACAGT
CTGGAGGCGC TGTCCCGTCG TGATGCCCGT GAGGATCCGT TCGCGTCTGC CGGGAAAACA
GTTTCATCAC AGCTGCCTGA CGGCGCATCT GACGGTAATA AGGCAATCAG TGAAACAGAG
CATGATGCGG TGAAAGCGAT GTTCAGGGGG GATACTGAGA AAATGACGGA ACGGGAACTG
TCCATTATTC GTGCACTGGG AGAAGAATTT TCCACAGTGC TGGCGGATTT ACAGCGCACA
TTTGAGGGGA AGATGGCCTC GCAGGCACAA GCGTTTGAAG AGAAACTGAC TTCCCTGTCG
GCGGTATTAC AGAAGCATGT GACGGTGGAT GAGGTGCGTC CGGTTCTGCA GGCGATGGTG
GATGACGCTG TGGGGGCCAT TCCGGTACCG CGTGATGGTC GTGATTATGA TCCGGATGTA
CTGCAGCAGG CGGTGAATGA TGCGGTCGCA AATATTCCGG TACCGGCAGA CGGCAAAAGT
ATCACCCCGG ATGATGTGCG TCCGATGCTT GAGCAGATGG TGAAAGAGGC AGTGAGCCAT
ATTCCTGTTC CGCGCGACGG TCGTGACTAC GATCCGGATG TTCTGCAGAA GGCGGTGAAT
GATGCGGTCG CGAAAATACC GGTACCGGCA GACGGTAAAA GTATCACTCC GGATGATGTG
CATCCGATGC TTGAACAGAT GGTGAAGGAG GCGGTAAGCC ATATTCCTGT TCCGCGTGAT
GGTCGTGACT ACGATCCGGA TGTTCTGCAG AAGGCGGTGA ATGATGCGGT CGCGAAAATA
CCGGTACCGG CAGACGGTAA AAGTATCACT CCGGATGATG TGCATCCGAT GCTTGAACAG
ATGGTGAAGG AGGCGGTAAG CCATATTCCT GTTCCGCGTG ATGGTCGTGA CTACGATCCG
GATGTTCTGC AGAAGGCGGT TCTGGAGGCG GTGAGTGCCC TGCCGGCTCC GCAGGACGGG
CGTGATGCCA CGGCACTGGA AATACTCCCC GCCATTGACG ATCAAAAATC CTTTCCCCGG
GGCTCGTATG CCACACACCA GGGTGGACTC TGGCGGGCGT ATGAAAAAAC GTACGGGATG
CGGGGATGGG AATGCCTGGT TGACGGGGTG GCGGATATTG ACGTCAGCAT GACGGGTGAA
CGGTCGTTCT CTGTGGTGGT CCGGCAGAGC AGTGGCCAGC GTACGGAAAA AACATTTTCC
CTGCCGGTGA TGCTCTACCG TGGTGTGTTC AGAATCGGCG AAACTTACCA CCCCGGCGAT
ACGGTGACGT GGGGGGCTCG TTGTGGCACT GCAACAGTAT GA
 
Protein sequence
MWNLLRRTRK NQKSGRDVRE AGWTSLFQAV AEPFSGAWQQ GVKADPEAVL SFHAVFACIS 
LISQDIAKMR LRLMQTDAHG IRRETRRGDI ARLCRRPNAQ QNRIQFFELW LNAKLRHGNT
VVLKIRNARG QIKELRILDW SRVEPLVADD GEVFYRITPD RNCGITEAVT VPAREVIHDR
FNCFFHPLIG LPPVYAAGLA ATQGHHIQEN STSFFRNGGR PSGVIEIPGS ITEENAKKLK
SNWDSGYTGE NAGKTAILSN GAKYNPTTFS PVDAQTVEQL KMTAEIVCSV FRVPAYKIGV
GQPPSSDNVE ALEQQYYSQC LQTLIESIEL LLDEALETGE NESTEFDVTT LLRMDSERRM
KTLGDAVKNT LLTPNEARKR ENLPPLAGGD ALYLQQQNYS LEALSRRDAR EDPFASAGKT
VSSQLPDGAS DGNKAISETE HDAVKAMFRG DTEKMTEREL SIIRALGEEF STVLADLQRT
FEGKMASQAQ AFEEKLTSLS AVLQKHVTVD EVRPVLQAMV DDAVGAIPVP RDGRDYDPDV
LQQAVNDAVA NIPVPADGKS ITPDDVRPML EQMVKEAVSH IPVPRDGRDY DPDVLQKAVN
DAVAKIPVPA DGKSITPDDV HPMLEQMVKE AVSHIPVPRD GRDYDPDVLQ KAVNDAVAKI
PVPADGKSIT PDDVHPMLEQ MVKEAVSHIP VPRDGRDYDP DVLQKAVLEA VSALPAPQDG
RDATALEILP AIDDQKSFPR GSYATHQGGL WRAYEKTYGM RGWECLVDGV ADIDVSMTGE
RSFSVVVRQS SGQRTEKTFS LPVMLYRGVF RIGETYHPGD TVTWGARCGT ATV