Gene ECH74115_1195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1195 
Symbol 
ID6971225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1202443 
End bp1205022 
Gene Length2580 bp 
Protein Length859 aa 
Translation table11 
GC content59% 
IMG OID643385192 
Productputative prophage tail length tape measure protein 
Protein accessionYP_002269688 
Protein GI209400793 
COG category[S] Function unknown 
COG ID[COG5281] Phage-related minor tail protein 
TIGRFAM ID[TIGR01541] phage tail tape measure protein, lambda family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAGC CGGTTGGTGA TCTTGTTATT GACCTGAGTC TGGATGCTGT CCGTTTCGAT 
GAGCAGATGA GCCGGGTAAG GCGTCATTTT TCAGGTCTGG ATACCGACGT CAGAAAAACC
GCCAGTGCTG TTGAACAGGG CCTGAGCCGC CAGGCGCTGG CTGCACAAAA AGCCGGGATT
TCCGTCGGGC AGTATAAAGC GGCCATGCGA ACCCTGCCCG CACAGTTTAC GGATATCGCC
ACGCAGCTTG CCGGTGGTCA GAATCCCTGG CTGATCCTGC TGCAACAGGG CGGTCAGGTG
AAGGACTCCT TCGGCGGGAT GATCCCCATG TTCAGGGGGC TTGCCGGTGC GATCACCCTG
CCGATGGTCG GGGTCACCTC GCTGGCGGTG GCGACAGGTG CGCTGGTGTA CGCCTGGTAC
CAGGGAGATT CCACGCTTTC AGCGTTTAAT AAAACCCTGG TTCTTTCCGG TAATCAGTCC
GGACTGACTG CCGATCGCAT GCTGACGCTC TCAAGAGCCG GGCAGGCAGC AGGGCTGACG
TTTAACCAGG CGAGAGAGTC ACTGGCAGCC CTGGTGAATG CCGGTGTGCG TGGTGGTGAA
CAGTTTGATG CCATCAACCA GAGTGTCGCG CGTTTTGCTT CTGCATCCGG TGTGGAGGTG
GACAAGGTTG CAGAGGCTTT CGGAAAACTG ACCACCGACC CTACGTCGGG GCTGATTGCG
ATGGTGCGCC AGTTCCGTAA CGTGACGGCA GAGCAGATTG CGTATGTTGC GCAGCTGCAG
CGTTCCGGTG ATGAGGCCGG GGCCTTACAG GCGGCGAACG ATATCGCCAC GAAAGGCTTT
GATGAGCAGA CCCGTCGCCT GAAAGAAAAC ATGGGGACAC TGGAGACCTG GGCGGATAAA
ACCGGGAAGG CATTCAAATC GATGTGGGAT GCCATTCTGG ATATCGGTCG TCCTGAGTCC
TCAGCGGATA TGCTCGCCAG TGCACAGAAG GCATTTGATG AGGCGGATAA AAAATGGCAG
TGGTACCAGA GCCGGAGCCA GCGCCGGGGA AAAACCGCCT CTTTCCGGGC CAACCTTCAG
GGCGCATGGA ATGACCGGGA AAATGCCCGT CTGGGGCTGG CAGCGGCCAC GCTGCAGTCG
GATATGGAAA AAGCCGGTGA ACTGGCCGCC AGGGACCGGG CCGAACGGGA CGCATCACAG
CTGAAGTATA CCGGAGAGGC GCAGAAGGCG TATGAGCGTC TGCTGACGCC GCTGGAGAAA
TATACCGCCC GTCAGGAAGA ACTGAATAAG GCCCTGAAAG ACGGGAAAAT CCTGCGGGCG
GATTACAACA CGCTGATGGC GGCGGCGAAA AAGGATTATG AATCGACGCT GAAAAAGCCG
AAGTCGTCAG GAGTCAAAGT GTCAGCCGGT GAGCGTCAGG AAGACCAGGC GCATGCTGCC
CTGCTGGCGC TTGAAACCGA GCTCAGGACG CTGGAAAAAC ACAGCGGTGC GAATGAGAAA
ATCAGCCAGC AGCGTCGCGA TTTATGGAAA GCGGAAAATC AGTATGCGGT CCTGAAAGAG
GCTGCCACGA AACGGCAGTT ATCTGAGCAG GAAAAATCCC TGCTGGCGCA TAAAGACGAG
ACGCTGGAGT ACAAACGCCA GCTGGCTGAG CTGGGCGACA AGGTTGAATA CCAGAAACGC
CTGAATGAGC TGGCACAGCA GGCGGTGCGG TTTGAAGAGC AGCAGAGCGC GAAGCAGGCC
GCCATCAGCG CAAAAGCCCG CGGTCTCACT GACCGTCAGG CGCAGCGGGA GTCTGAAGCG
CAGCGTCTTC GGGACGTGTA CGGTGATAAT CCGGCTGCGC TGGCGAAGGC CACATCGGCA
CTGAAGAACA CCAGGTCTGC GGAGGAGCAG CTTCGTGGAA GCTGGATGGC CGGGCTGAAG
TCCGGCTGGG GCGAGTGGGC GGAAAGTGCG ACGGACAGTT TTTCGCAGGT TAAAAGTGCT
GCCACGCAGA CCTTTGACGG TATTGCACAG AATATGGCGG CGATGCTGAC CGGTGCAGAG
GCAGACTGGC GGGGATTCAC CCGTTCGGTG CTGTCCATGA TGACAGAAAT CCTGCTTAAA
CAGGCCATGG TGGGCATTGT CGGGCGTATC GGCAGCGCCA TTGGCGGTGC TTTCGGTGGT
GGTGCATCTG CTTCCTCGGG GACGGCCATT CAGGCTGCGG CGGCGAACTT CCATTTCGCG
ACCGGAGGAT TTACGGGGAC GGGCGGCAAA TATGAGCCTG CGGGGATAGT TCACCGCGGG
GAGTTTGTTT TCACGAAAGA GGCAACCAGC CGGATAGGTG TGGGGAATCT TTACCGTCTG
ATGCGCGGCT ATGCGGAAGG TGGTTATGTG GGTGGTGCCG GAAGTCCGGC GCAGATGCGG
CGGGCGGAAG GTATTAATTT TAATCAGAAC AATCACGTGG TGATTCAGAA CGACGGCACC
AACGGACAGG CGGGGCCGCA GCTGATGAAG GCGGTGTATG ACATGGCCCG CAAGGGGGCG
CAGGATGAGC TCCGGCTGCA GTTGCGTGAT GGCGGTATGT TATCGGGGAG CGGGCGATGA
 
Protein sequence
MSQPVGDLVI DLSLDAVRFD EQMSRVRRHF SGLDTDVRKT ASAVEQGLSR QALAAQKAGI 
SVGQYKAAMR TLPAQFTDIA TQLAGGQNPW LILLQQGGQV KDSFGGMIPM FRGLAGAITL
PMVGVTSLAV ATGALVYAWY QGDSTLSAFN KTLVLSGNQS GLTADRMLTL SRAGQAAGLT
FNQARESLAA LVNAGVRGGE QFDAINQSVA RFASASGVEV DKVAEAFGKL TTDPTSGLIA
MVRQFRNVTA EQIAYVAQLQ RSGDEAGALQ AANDIATKGF DEQTRRLKEN MGTLETWADK
TGKAFKSMWD AILDIGRPES SADMLASAQK AFDEADKKWQ WYQSRSQRRG KTASFRANLQ
GAWNDRENAR LGLAAATLQS DMEKAGELAA RDRAERDASQ LKYTGEAQKA YERLLTPLEK
YTARQEELNK ALKDGKILRA DYNTLMAAAK KDYESTLKKP KSSGVKVSAG ERQEDQAHAA
LLALETELRT LEKHSGANEK ISQQRRDLWK AENQYAVLKE AATKRQLSEQ EKSLLAHKDE
TLEYKRQLAE LGDKVEYQKR LNELAQQAVR FEEQQSAKQA AISAKARGLT DRQAQRESEA
QRLRDVYGDN PAALAKATSA LKNTRSAEEQ LRGSWMAGLK SGWGEWAESA TDSFSQVKSA
ATQTFDGIAQ NMAAMLTGAE ADWRGFTRSV LSMMTEILLK QAMVGIVGRI GSAIGGAFGG
GASASSGTAI QAAAANFHFA TGGFTGTGGK YEPAGIVHRG EFVFTKEATS RIGVGNLYRL
MRGYAEGGYV GGAGSPAQMR RAEGINFNQN NHVVIQNDGT NGQAGPQLMK AVYDMARKGA
QDELRLQLRD GGMLSGSGR