Gene ECH74115_2245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2245 
Symbol 
ID6970353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2131181 
End bp2133760 
Gene Length2580 bp 
Protein Length859 aa 
Translation table11 
GC content57% 
IMG OID643386130 
Productputative phage portal protein, HK97 family 
Protein accessionYP_002270617 
Protein GI209399350 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0000140812 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTGGAACC TTTTACGGCG AACCCGAAAA AACCAGAAAT CAGGACGTGA CGTAAGAGAG 
GCGGGCTGGA CCAGCCTGTT TCAGGCGGTG GCTGAGCCCT TTTCCGGCGC CTGGCAGCAG
GGCGTGAAAG CCGATCCTGA AGCCGTCCTC TCCTTTCATG CGGTGTTTGC ATGTATTTCG
CTGATATCCC AGGATATCGC CAAAATGCGG CTGCGTCTTA TGCAGACGGA TGCGCATGGG
ATACGCAGGG AAACGCGCCG GGGGGATATT GCCCGCCTCT GTCGTCGTCC CAACGCCCAG
CAGAACCGCA TCCAGTTTTT TGAACTGTGG CTGAACGCCA AACTGCGTCA TGGCAATACG
GTGGTGCTGA AAATCCGTAA TGCCCGGGGG CAGATCAAAG AACTGCGTAT TCTGGACTGG
AGCCGGGTTG AACCTCTGGT GGCGGATGAC GGCGAGGTGT TCTACCGCAT CACGCCGGAC
CGGAACTGCG GGATCACGGA GGCGGTGACG GTGCCTGCCC GGGAAGTGAT CCACGACCGG
TTTAACTGTT TTTTTCATCC GCTTATAGGA TTGCCGCCGG TGTATGCCGC CGGGCTGGCG
GCCACGCAGG GGCATCATAT TCAGGAAAAT TCGACGTCTT TTTTCAGAAA TGGCGGCAGG
CCGTCCGGGG TGATTGAGAT CCCCGGCAGT ATTACGGAAG AAAATGCGAA AAAACTGAAG
AGCAACTGGG ACAGCGGGTA TACAGGCGAA AATGCGGGGA AAACGGCCAT TCTGAGCAAC
GGGGCAAAAT ACAACCCCAC GACGTTTTCA CCGGTGGATG CGCAGACGGT GGAACAACTG
AAGATGACCG CTGAAATTGT CTGTTCGGTG TTCCGTGTCC CGGCCTACAA GATTGGCGTG
GGACAACCGC CTTCCAGTGA CAACGTGGAG GCGCTGGAGC AGCAGTATTA TTCCCAGTGC
CTGCAGACGC TGATTGAGTC CATTGAACTG TTACTGGATG AGGCGCTGGA AACGGGGGAA
AACGAGAGTA CAGAATTTGA TGTCACCACG CTGCTGAGAA TGGACAGTGA GCGGCGCATG
AAAACGCTGG GGGATGCGGT GAAAAATACG CTTCTCACGC CCAATGAGGC CCGTAAACGG
GAGAACCTGC CGCCCCTGGC CGGCGGTGAT GCACTGTATC TTCAGCAGCA GAACTACAGT
CTGGAGGCGC TGTCCCGTCG TGATGCCCGT GAGGATCCGT TCGCGTCTGC CGGGAAAACA
GTTTCATCAC AGCTGCCTGA CGGCGCATCT GACGGTAATA AGGCAATCAG TGAAACAGAG
CATGATGCGG TGAAAGCGAT GTTCAGGGGG GATACTGAGA AAATGACGGA ACGGGAACTG
TCCATTATTC GTGCACTGGG AGAAGAATTT TCCACAGTGC TGGCGGATTT ACAGCGCACA
TTTGAGGGGA AGATGGCCTC GCAGGCACAA GCGTTTGAAG AGAAACTGAC TTCCCTGTCG
GCGGTATTAC AGAAGCATGT GACGGTGGAT GAGGTGCGTC CGGTTCTGCA GGCGATGGTG
GATGACGCTG TGGGGGCCAT TCCGGTACCG CGTGATGGTC GTGATTATGA TCCGGATGTA
CTGCAGCAGG CGGTGAATGA TGCGGTCGCA AATATTCCGC AGCCGGCGGA CGGTAAAAGT
CTCACCCCGG ATGATGTGCG TCCGATGCTT GAACAGATGG TGAAGGAGGC TGTAAGCCAT
ATCCCTGTTC CGCGTGATGG TCGTGACTAC GATCCGGAAG TACTGCAGAA GGCGGTGAAT
GATGCGGTCG CAAATATTCC GCAGCCGGCG GACGGTAAAA GTCTCACCCC GGATGATGTG
CGTCCGATGC TTGAACAGAT GGTGAAGGAG GCGGTAAGCC ATATCCCTGT TCCGCGCGAC
GGTCGTGACT ATGATCCGGA AGTACTGCAG AAGGCGGTGA ATGATGCGGT CGCAAATATT
CCGCAGCCGG CAGACGGTAA AAGTCTCACC CCGGATGATG TGCGTCCGAT GCTTGAACAG
ATGGTGAAGG AGGCGGTAAG CCATATTCCT GTTCCGCGTG ATGGTCGTGA CTACGATCCG
GATGTTCTGC AGAAGGCGGT TCTGGATGCG GTGAGTGCCC TGCCGGCTCC GCAGGACGGG
CGTGATGCCA CGGCACTGGA AATACTCCCC GCCATTGACG ATCAAAAATC CTTTCCCCGG
GGCACGTATG CCACACACCA GGGCGGACTC TGGCGGGCGT ATGAAAAAAC GCACGGGATG
CGGGGATGGG AATGCCTGGT TGACGGGGTG GCGGATATTG ACGTCAGCAT GACGGGTGAG
CGGTTGTTCT CTGTGGTGGT CCGGCAGAGC AGTGGCCAGC GTACGGAAAA AACATTTTCC
CTGCCGGTGA TGCTCTACCG CGGTGTGTTC AGAGCCGGTG AAACCTACCA CCCCGGCGAT
ACGGTGACGT GGGGGGGCTC GCTGTGGCAC TGCAACAGTA TGACCGAAGA TAAACCCGGA
GAAGCTCATT CATCAGCCTG GACCCTGGCT GCAAAACGTG GGCGGGATGC AGGAGGCTGA
 
Protein sequence
MWNLLRRTRK NQKSGRDVRE AGWTSLFQAV AEPFSGAWQQ GVKADPEAVL SFHAVFACIS 
LISQDIAKMR LRLMQTDAHG IRRETRRGDI ARLCRRPNAQ QNRIQFFELW LNAKLRHGNT
VVLKIRNARG QIKELRILDW SRVEPLVADD GEVFYRITPD RNCGITEAVT VPAREVIHDR
FNCFFHPLIG LPPVYAAGLA ATQGHHIQEN STSFFRNGGR PSGVIEIPGS ITEENAKKLK
SNWDSGYTGE NAGKTAILSN GAKYNPTTFS PVDAQTVEQL KMTAEIVCSV FRVPAYKIGV
GQPPSSDNVE ALEQQYYSQC LQTLIESIEL LLDEALETGE NESTEFDVTT LLRMDSERRM
KTLGDAVKNT LLTPNEARKR ENLPPLAGGD ALYLQQQNYS LEALSRRDAR EDPFASAGKT
VSSQLPDGAS DGNKAISETE HDAVKAMFRG DTEKMTEREL SIIRALGEEF STVLADLQRT
FEGKMASQAQ AFEEKLTSLS AVLQKHVTVD EVRPVLQAMV DDAVGAIPVP RDGRDYDPDV
LQQAVNDAVA NIPQPADGKS LTPDDVRPML EQMVKEAVSH IPVPRDGRDY DPEVLQKAVN
DAVANIPQPA DGKSLTPDDV RPMLEQMVKE AVSHIPVPRD GRDYDPEVLQ KAVNDAVANI
PQPADGKSLT PDDVRPMLEQ MVKEAVSHIP VPRDGRDYDP DVLQKAVLDA VSALPAPQDG
RDATALEILP AIDDQKSFPR GTYATHQGGL WRAYEKTHGM RGWECLVDGV ADIDVSMTGE
RLFSVVVRQS SGQRTEKTFS LPVMLYRGVF RAGETYHPGD TVTWGGSLWH CNSMTEDKPG
EAHSSAWTLA AKRGRDAGG