Gene ECH74115_0908 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0908 
Symbol 
ID6966762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp916622 
End bp919687 
Gene Length3066 bp 
Protein Length1021 aa 
Translation table11 
GC content53% 
IMG OID643384930 
Productputative prophage tail length tape measure protein 
Protein accessionYP_002269430 
Protein GI209396645 
COG category[S] Function unknown 
COG ID[COG5281] Phage-related minor tail protein 
TIGRFAM ID[TIGR01541] phage tail tape measure protein, lambda family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCAGA TAGCCAACCT GGTCATTGAT TTGGGGATTG ATGCGGCAGA GTTTAAAAAT 
GAAATTCCCC GTATCAAAAA CCTTCTGAAT GGTGCAGCCA GCGATGCAGA ACGGTCTTCT
GCCCGTATGC AGCGTTTTAT GGAGCGTCAG ACTCAGGCCG CCCGGCAGAC AATGCAGGCG
GCTTCTTCGG CTGCAACAGC CGCATCCGTC CATGCGCAGA CGGTGGAGAA GAGCGCACAG
GCTCATGAAC GCATGGCCCG CGAGGTGGAG CAAACCCGCC AGCGTATGGA GGCACTGAGC
CAGAAAATGC GCGAGGAACA GGCGCAGGCC ATGGCTCTGG CGGAGGCTCA GGATAAAGCG
GCTGCTGCGT TTTATCGTCA GATTGACAGT GTGAAACAGG CCGGTGCGGG GCTGCAGGAA
TTACAGCGTA TTCAGCAACA GATCCGACAG GCCAGAAACA GTGGCGGGAT TGGTCAGCAG
GATTATCTGG CGCTGATTTC TGAGGTTACG GCGAAAACCC GTGTTCTTAC ACAGGCTGAG
GAAGAGGCTA CCCGACAGAA AGTGGCGTTT ATCCGTCAGC TTAAAGAGCA GGCAACCCGC
CAGAATCTTT CTTCTTCTGA GTTGCTTCGT GCTAAGGCTG CCCAGCTGGG GGTAAGCAGT
GCTGCAGAAG TGTATATCCG CAAAATGGAG CAGGCAGGAA AAGCCACGCA TTCGCTGGGT
CTGAAAAGTG CAGCAGCCCG TCAGGAGATA GGCGTTCTGA TAGGTGAACT GGCCCGCGGC
AATTTAGGTG CGCTGAGGGG ATCCGGAATA ACGCTGGCTA ACCGTGCCGG GTGGATAGAC
ACACTGATGT CACCGAAAGG CATGATGCCA GGAGCGGTTA TTGGCGGTAT TGCCGCGGCT
GTCTATGGTC TGGGTAAAGC CTGGTATGAC GGTCAGAAGG AGGGGGAAGA ATTTAACCGC
CAGCTGTCGC TGACGGGGCA TTATGCCGGA GTCACTGCCG GGCAGCTGTG GACGCTCAGT
CGTGCTATTT CCGGGAATGG TATCACGCAA CATGCTGCAG CCGGTGCGCT GGCTCAGGTG
GTGGGGAGTG GTGCATTTCG TGGAAACGAT ATCGGTATGG TGGCGAGAGC TGCCGCACAG
ATGGAGCGAT CGGTTGGCCA GTCGGTCAGC GATACCATAA ATCAGTTTAA GCGGCTGAAG
GATGATCCTG TAAATGCCGC GAAGGCTCTG GACAATGAGC TGCATTTTCT TACTGCCACT
CAGCTTGAGC AGATACGCGT CCTTGGGGAG CAGGGGCGGT CCAGTGATGC TGCACGGATA
GCCATGTCTG CACTGGCAGA GGAAACCGGT CGGCGTACTG CGGATATTGA TAATAACCTC
AATGCGCTTG GCAGTACGCT GAAGTATCTG TCTGATTTGT GGAGTCGTTT CTGGGATGCG
GCCATGAATA TTGGTCGTGA AGACTCGCTG GATGAACAGA TTGCCGCTTT ACAGGAGAAA
GTGTCGCGGG CGAAAAGACT CCCCTGGACG GCATCATCTT CTCAGGTTGA ATACGATCAG
CAGCGTCTTA ACGATCTTCA GGAGAAAAAA CGCCAGAAGG ATTTGCAGGA TGCAAAAGAG
CAGGCAGAGC GGAATTATCA GGAGCAACAG AAACGCCGTA ATGCTGAAAA TGCTGCACTG
AACCGGATGA ATGAAACGGA AGCAGCACGA CATCAGCGTG AAATAGCGCG TATTAATGCC
ATGCAGTACG CCGATCAGGC TGTCAGGGAT GCGGCGATAC AACGTGAAAA TGAACGTTAC
GAGAAAGCCC TGGCATCCGG TAAGAAAAAA ACACGCGAAA CCCGTAATGA TGAGGCCACC
CGGTTATTGC TGCAGTACAG TCAGCAACAG GCACAGGTGG AAGGACAGAT TGCTGCTGCC
AGACAGTCAG CAGGCATTGC CACGGAAAGG ATGACAGAAG CGCGTAAACA GCTTCTGGCT
CTGCAGCAGC GCATCAGCGA CCTGGACGGG AAAAAACTGA CGGCAGATGA AAAGAGTGTG
CTGGCCCGTA AAGATGAACT GATTCAGGCA CTGACGCTGC TGGATGTAAA ACAGCAGGAG
CTTCAGAAAC AGACGGCACT CAACGAGCTG AAGAAAAAAA CAATTCAGCT GACCAGTCAA
CTGGCTGAAG AAGAGCGCGC TCAGCGTCAG CAACATGACC TGGATATCGC CACGGTGGGT
ATGGGTGATC AGCAGCGACA GCGATATCAG GTACAACTGA GTCTTCGCCA GAAATACCAG
CAACAGCTGG AGCAGTTGAG GCGGGATAGT GAGCAGAAAG GGACATATAA CACGGATGAC
TACAGAAAGG CCGAGCAGGC GCTGACGGAG AGCCTGAACC GACAACTGAA TGAGAATCGC
CGTTACTGGC AACAGCTTGA AGTTGTACAG GGTAACTGGA AAAACGGAGT CCTGCGTGCA
TTTCAGGATT TTACCGTGGA TGCAGATAAT ACGGCAGAAA CAGCAGAACA GGTGTTCTCG
TCAGCCTTCA GCAACATGGG AAATGGCCTG GCAACTTTTG TCACTACCGG CAAACTCAAT
TTCAAATCCT TCACCTCTTC TGTGCTGTCA GATATGGCGA AAATCCTGGC GCAGGCAACC
ATGATGAAAT CGATAAAAGG GATTGGCAGT GTACTGGGAT TTGATCTCAG CAGCCTTTCC
CTGAATGCCA ATGGGGGGAT TTATCAGTCT GCTGATTTGA GTCGTTACAG TGGCACGGTG
GTTAACCGTC CGACGTTTTT TGCTTTTGCA AAAGGCGCGG GTGTGATGGG GGAAGCGGGA
CCTGAAGCCA TTCTGCCATT GCGTCGTGGT GCTGACGGTA AGCTGGGGGT TGTGGCGGAT
ATTGGGGGTT CAGGTATGGC GATGTTTTCC CCGCAGTACA ACATCGAGAT CAATAACGAT
GGCACGAACG GGCAGATAGG TCCGGCTGCC CTGAAGGCGG TTTATGACCT CGGGAAAAAA
GCGGCAGCGG ACTTTATGCA ACAGCAGGCC CGTGATGGTG GTCGGTTAAG TGGAGCATAT
CGGTAA
 
Protein sequence
MDQIANLVID LGIDAAEFKN EIPRIKNLLN GAASDAERSS ARMQRFMERQ TQAARQTMQA 
ASSAATAASV HAQTVEKSAQ AHERMAREVE QTRQRMEALS QKMREEQAQA MALAEAQDKA
AAAFYRQIDS VKQAGAGLQE LQRIQQQIRQ ARNSGGIGQQ DYLALISEVT AKTRVLTQAE
EEATRQKVAF IRQLKEQATR QNLSSSELLR AKAAQLGVSS AAEVYIRKME QAGKATHSLG
LKSAAARQEI GVLIGELARG NLGALRGSGI TLANRAGWID TLMSPKGMMP GAVIGGIAAA
VYGLGKAWYD GQKEGEEFNR QLSLTGHYAG VTAGQLWTLS RAISGNGITQ HAAAGALAQV
VGSGAFRGND IGMVARAAAQ MERSVGQSVS DTINQFKRLK DDPVNAAKAL DNELHFLTAT
QLEQIRVLGE QGRSSDAARI AMSALAEETG RRTADIDNNL NALGSTLKYL SDLWSRFWDA
AMNIGREDSL DEQIAALQEK VSRAKRLPWT ASSSQVEYDQ QRLNDLQEKK RQKDLQDAKE
QAERNYQEQQ KRRNAENAAL NRMNETEAAR HQREIARINA MQYADQAVRD AAIQRENERY
EKALASGKKK TRETRNDEAT RLLLQYSQQQ AQVEGQIAAA RQSAGIATER MTEARKQLLA
LQQRISDLDG KKLTADEKSV LARKDELIQA LTLLDVKQQE LQKQTALNEL KKKTIQLTSQ
LAEEERAQRQ QHDLDIATVG MGDQQRQRYQ VQLSLRQKYQ QQLEQLRRDS EQKGTYNTDD
YRKAEQALTE SLNRQLNENR RYWQQLEVVQ GNWKNGVLRA FQDFTVDADN TAETAEQVFS
SAFSNMGNGL ATFVTTGKLN FKSFTSSVLS DMAKILAQAT MMKSIKGIGS VLGFDLSSLS
LNANGGIYQS ADLSRYSGTV VNRPTFFAFA KGAGVMGEAG PEAILPLRRG ADGKLGVVAD
IGGSGMAMFS PQYNIEINND GTNGQIGPAA LKAVYDLGKK AAADFMQQQA RDGGRLSGAY
R