Gene ECH74115_1633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1633 
Symbol 
ID6969925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1577105 
End bp1579654 
Gene Length2550 bp 
Protein Length849 aa 
Translation table11 
GC content58% 
IMG OID643385593 
Productputative prophage tail length tape measure protein 
Protein accessionYP_002270087 
Protein GI209400998 
COG category[S] Function unknown 
COG ID[COG5281] Phage-related minor tail protein 
TIGRFAM ID[TIGR01541] phage tail tape measure protein, lambda family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.198533 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAAC CGGTAGGCGA TCTGGTCGTT GATTTAAGTC TGGATGCGGC CAGATTTGAC 
GAGCAGATGG CCAGAGTCAG GCGTCATTTT TCCGGTACGG AAAGTGATGC GAAAAAAACA
GCGGCAGTCG TTGAACAGTC GATGAGCCGG CAGGCGCTGG CTGCACAGAA AGCGGGAATT
TCCGTCGGGC AGTATAAAGC CGCCATGCGT ATGCTGCCTG CGCAGTTCAC TGACGTGGCC
ACGCAGCTTG CAGGTGGGCA AAGTCCGTGG CTGATCCTGC TGCAACAGGG GGGGCAGGTG
AAGGACTCCT TCGGCGGGAT GATCCCCATG TTCCGGGGGC TTGCCGGTGC GATCACCCTG
CCGATGGTGG GGGCCACCTC GCTGGCGGTG GCGACCGGTG CGCTGGCGTA TGCCTGGTAT
CAGGGCAACT CAACCCTGTC CGATTTCAAC AAAACGCTGG TCCTTTCCGG CAATCAGGCC
GGGCTGACGG CAGATCGTAT GCTGGTCCTG TCCAGAGCCG GGCAGGCGGC AGGGCTGACG
TTTAACCAGA CCAGCGAGTC ACTCAGCGCA CTGGTTAAGG CGGGGGTAAG CGGTGAGGCT
CAGATTGCGT CCATCAGCCA GAGTGTGGCG CGTTTCTCCT CTGCATCAGG CGTGGAGGTG
GACAAGGTCG CTGAAGCCTT CGGGAAGCTG ACCACTGACC CGACGTCGGG GCTGACAGCG
ATGGCGCGCC AGTTCCATAA CGTGACGGCG GAGCAGATTG CGTATGTTGC TCAGTTGCAG
CGTTCCGGCG AAGAAGCCGG GGCATTGCAG GCGGCGAACG AGGCCGCGAC GAAAGGGTTT
GATGACCAGA CCCGCCGCCT GAAAGAGAAC ATGGGCACGC TGGAAACCTG GGCAGACAGG
ACAGCACGGG CATTCAAATC CATGTGGGAT GCGGTGCTGG ATATTGGTCG TCCTGATACC
GCGCAGGAGA TGCTGATTAA GGCAGAGGCC GCGTTTAAGA AAGCGGACGA TATCTGGAAT
CTGCGCAAGG ATGATTATTT TGTTAACGAT GAAGCGCGGG CGCGTTACTG GGATGATCGT
GAAAAGGCCC GTCTTGCGCT TGAAGCCGCC CGAAAGAAGG CTGAGCAGCA GAGTCAACAG
GACAAAAATG CGCAGCAACA GAGCGATACC GAAGCGTCAC GGCTGAAATA TACCGAAGAG
GCGCAGAAGG CTTACGAACG CCTGCAGACG CCGCTGGAGA AATATACCGC CCGTCAGGAA
GAACTGAATA AGGCACTGAA AGACGGGAAA ATCCTGCAGG CAGATTACAA CACGCTGATG
GCGGCGGCGA AAAAGGACTA TGAAGCGACG CTGAAAAAGC CGAAACAGTC CGGCGTGAAG
GTGTCTGCGG GCGATCGTCA GGAAGACAGT GCTCATGCTG CCCTGCTGAC GCTTCAGGCA
GAACTCCGGA TGCTGGAGAA GCATGCCGGA GCGAATGAGA AAATCAGCCA GCAGCGCCGG
GATTTGTGGA AGGCAGAAAG TCAGTTCGCG GTACTGGAGG AGGCGGCACA ACGTCGCCAG
CTGTCCGCAC AGGAGAAATC CCTGCTGGCG CATAAAGACG AGACGCTGGA GTACAAACGC
CAGCTGGCTG TACTTGGCGA CAAGGTCACC TATCAGGAGC ACCTGAACGC GCTGGCACAG
CAGGCGGATA AATTCGCACA GCAACAACGG GCAAAACGGG CCGCCATTGA TGCAAAAAAC
CGGGGGCTGA CTGACCGGCA GGCAGCGCGG GAAGCCACGG AACAGCGCTT GAAGGAACAG
TATGGCGATA ATCCTCTGGC GCTGAATAAC GTCATGTCAG AGCAGAAAAA GACCTGGGCA
GCTGAAGACC AGCTTCGCGG GAGCTGGATG GCAGGCCTCA GGTCAGGCTG GAGTGAGTGG
GAAGAGAGCG CCACGGACAG TATGTCGCAG GTTAAAAGTG CTGCCACGCA GACCTTTGAT
GGTATTGCAC AGAATATGGC GGCGATGCTG ACCGGCAGTG AACAGAACTG GCGCAGCTTC
ACCCGTTCCG TGCTGTCCAT GATGACAGAA ATTCTGCTTA AGCAGGCAAT GGTGGGGATT
GTCGGAAGTA TAGGCAGCGC CATTGGCGGC GGCGCATCAG CGTCAGGCGG TACAGCCATT
CAGGCAGCTG CGGCGAAATT CCATTTTGCG GCCGGAGGGT TTACGGGAAC CGGCGGCAAA
TATGAGCCAG CGGGGATTGT TCACCGTGGT GAATTTGTCT TCACGAAGGA GGCAACCAGC
CGGATTGGCG TAGGAAATCT TTACCGGCTG ATGCGCGGCT ATGCCACCGG CGGTTATGTC
GGTACACCGG GCAGCATGGC GGACAGCCGG TCGCAGGCGT CCGGGACGTT TGAGCAGAAT
AACCATGTGG TGATTAACAA CGACGGCACG AACGGTCAGA TAGGGCCACA GGCACTGAAG
GCGGTTTATG ACGTAGCCCG TAAGGCGGCA ATGGATGTTG TGACCGGGCA GATGCGCGAT
GGTGGTCTGT TCTCCGGAGG TGGACGATGA
 
Protein sequence
MAEPVGDLVV DLSLDAARFD EQMARVRRHF SGTESDAKKT AAVVEQSMSR QALAAQKAGI 
SVGQYKAAMR MLPAQFTDVA TQLAGGQSPW LILLQQGGQV KDSFGGMIPM FRGLAGAITL
PMVGATSLAV ATGALAYAWY QGNSTLSDFN KTLVLSGNQA GLTADRMLVL SRAGQAAGLT
FNQTSESLSA LVKAGVSGEA QIASISQSVA RFSSASGVEV DKVAEAFGKL TTDPTSGLTA
MARQFHNVTA EQIAYVAQLQ RSGEEAGALQ AANEAATKGF DDQTRRLKEN MGTLETWADR
TARAFKSMWD AVLDIGRPDT AQEMLIKAEA AFKKADDIWN LRKDDYFVND EARARYWDDR
EKARLALEAA RKKAEQQSQQ DKNAQQQSDT EASRLKYTEE AQKAYERLQT PLEKYTARQE
ELNKALKDGK ILQADYNTLM AAAKKDYEAT LKKPKQSGVK VSAGDRQEDS AHAALLTLQA
ELRMLEKHAG ANEKISQQRR DLWKAESQFA VLEEAAQRRQ LSAQEKSLLA HKDETLEYKR
QLAVLGDKVT YQEHLNALAQ QADKFAQQQR AKRAAIDAKN RGLTDRQAAR EATEQRLKEQ
YGDNPLALNN VMSEQKKTWA AEDQLRGSWM AGLRSGWSEW EESATDSMSQ VKSAATQTFD
GIAQNMAAML TGSEQNWRSF TRSVLSMMTE ILLKQAMVGI VGSIGSAIGG GASASGGTAI
QAAAAKFHFA AGGFTGTGGK YEPAGIVHRG EFVFTKEATS RIGVGNLYRL MRGYATGGYV
GTPGSMADSR SQASGTFEQN NHVVINNDGT NGQIGPQALK AVYDVARKAA MDVVTGQMRD
GGLFSGGGR