Gene ECH74115_1802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1802 
Symbol 
ID6971212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1716856 
End bp1719711 
Gene Length2856 bp 
Protein Length951 aa 
Translation table11 
GC content56% 
IMG OID643385747 
Producthypothetical protein 
Protein accessionYP_002270237 
Protein GI209395920 
COG category[S] Function unknown 
COG ID[COG4733] Phage-related protein, tail component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.0582533 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTAAAG GGGGCGGCAA GGGGCACACG CCGGTAGAGG CAAAGGACAA TCTTAAGTCC 
ACGCAGATGA TGAGCGTGAT TGACGCCATT GGTGAAGGGC CGATTGAAGG TCCGGTGAAG
GGGCTGCAGA GTATTCTGGT GAACAAAACC CCGCTGACGG ACACGGACGG TAATCCCGTG
ATACACGGTG TGACTGCGGT CTGGCGTGCC GGGGAGCAGG AGCAGACACC ACCGGAAGGC
TTTGAGTCCT CCGGAGCTGA AACCGCACTG GGCGTGGAAG TGACGAAGGC AAAGCCGGTG
ACGCGCACCA TTACGTCCGC GAACATTGAC CGCCTGCGGG TCACCTTCGG GGTGCAGTCA
CTGTTGGAGA CCACCTCAAA GGGCGACCGT AATCACTCTT CTGTCCGACT GCTGATTCAG
TTGCAGCGTA ACGGTAACTG GGTGACGGAA AAGGATGTCA CCATTAACGG CAAGACCACC
TCGCAGTTCC TGGCGTCGGT GATTCTGGAT AATCTGCCGC CCCGGCCCTT TAACATCCGG
ATGGTCAGGG AGACGGCGGA CAGCACCACG GACCAGCTGC AGAACAGAAC GCTGTGGTCG
TCATACACCG AAATCATCGA TGTGAAACAG TGCTACCCGA ACACGGCCAT TGTGGGGCTG
CAGGTGGATG CGGAGCAGTT TGGCGGTCAG CAGATGACGG TGAACTACCA TATCCGCGGT
CGCATCATCC AGGTGCCGTC AAACTATGAC CCGGAAAAAC GCACGTACAG CGGCATCTGG
GACGGCAGCC TGAAACCGGC ATACAGCAAC AACCCTGCCT GGTGCCTGTG GGACATGCTG
ACCCACCCGC GCTACGGAAT GGGAAAACGC CTGGGGGCGG CGGATGTGGA CAAGTGGGCG
CTGTATGCCA TTGTGACACT GCCGGAGACC GGTGCCGCCA CGGTGAACCT GATTAACGGC
AGCGGTAAGC CGGTGAGTGT GGACATCACC GCACACCCCG CGCCGGACCG GATACAGGTC
AGTACCCTGC CTGATGGTGT GGAGACATAC GGGGTGTGGG GACTCTCCCT GCCGTCACTG
CGCCGTCGCC TGTTCCGCTG TGTCTCCGTC CGGGAAAACA CGGACGGCAC CTTTGCCATC
ACGGCGGTGC AGCACGTACC GGAAAAAGAA GCCATCGTGG ATAACGGTGC CCGCTTTGAG
CCGCAGTCAG GTTCCCTGAA CAGCGTCATC CCACCGGCAG TACAGCACCT GACGGTGGAG
GTGAGTGCAG CTGACGGCCA GTATCTGGCG CAGGCTAAAT GGGACACGCC GCGGGTGGTG
AAGGGCGTGC GCTTCAGTCT GCGCCTGACC AGTGGTAAGG GAACGGATGC CAGACTGGTG
ACCACCGCCA TCACCGCAGA CACGGAGCAC CGTTTCAGCG GCCTGCCGCT CGGGGAATAC
ACCCTGACGG TGCGGGCGAT AAACAGCTAT GGCCAGCAGG GTGAACCTGC CACCACCACC
TTCCGGATTG CCGCACCGGC AGCACCGTCG CGGATTGAGC TGACGCCGGG CTATTTTCAG
ATAACCGCCA CGCCGCATCT TGCCGTTTAT GACCCGACGG TACAGTTTGA GTTCTGGTTC
TCGGAAAAGC GGATTGCGGA TATCAGGCAG GTTGAAACCG CAGCCCGCTA TCTTGGCTCG
GCGCTGTACT GGATAGCTGC CAGTATCAAT ATCAAACCGG GCCATGATTA TTATTTTTAT
ATCCGCAGTG TGAATACTGT TGGCAAATCG GCATTCGTGG AGGCTGTCGG TCGGGCGAGC
GATGATGCGG AAGGTTACCT GGATTTTTTC AAAGGAGAAA TCGGGAAAAC ACATCTGGCC
CAGGAGCTGT GGACGCAGAT TGATAACGGT CAGCTTGCGC CGGACCTGGC TGAAATCAGG
ACGTCCATTA CGAATGTCAG CAATGAAATC ACGCAGACCG TCAATAAAAA ACTGGAAAAT
CAGAGTGCGG CAATCCAGCA GATACAGAAA GTTCAGGTTG ATACAAATAA TAACCTGAAC
AGCATGTGGG CCGTGAAACT GCAGCAGATG CAGGACGGAC GCCTTTATAT TGCGGGTATC
GGTGCCGGTA TTGAGAATAC GCCAGCAGGA ATGCAGAGTC AGGTGCTGCT GGCGGCAGAC
AGGATTGCGA TGATTAATCC TGCGAATGGC AACACAAAGC CGATGTTTGT TGGTCAGGGC
GATCAGATAT TTATGAATGA AGTGTTCCTG AAATATCTGA CGGCTCCCAC CATTACCAGC
GGCGGTAATC CTCCGGCATT TTCCCTGACA CCGGACGGGC GGCTGACGGC GAAAAATGCC
GATATCAGCG GTAACGTGAA TGCGAACTCC GGGACGCTCA ACAACGTCAC GATTAACGAG
AACTGTCGGG TTCTGGGAAA ATTGTCCGCG AACCAGATTG AAGGCGATCT CGTTAAAACA
GTGGGCAAAG CTTTCCCCCG GGACTCCCGT GCACCGGAGC GGTGGCCATC AGGAACCATT
ACCGTCAGGG TTTATGACGA TCAGCCGTTT GACCGGCAGA TTGTTATTCC GGCGGTGGCA
TTCAGCGGCG CTAAACATGA GAAAGAGCAT ACTGATATTT ACTCCTCATG CCGTCTGATA
GTGCGGAAAA ACGGTGCTGA AATTTATAAC CGTACCGCGC TGGATAATAC GCTGATTTAC
AGTGGTGTTA TTGATATGCC TGCCGGTCAC GGTCACATGA CACTGGAGTT TTCGGTGTCA
GCATGGCTGG TAAATAACTG GTATCCCACA GCAAGTATCA GCGATTTGCT GGTTGTGGTG
ATGAAGAAAG CCACTGCAGG CATCACGATT AGCTGA
 
Protein sequence
MGKGGGKGHT PVEAKDNLKS TQMMSVIDAI GEGPIEGPVK GLQSILVNKT PLTDTDGNPV 
IHGVTAVWRA GEQEQTPPEG FESSGAETAL GVEVTKAKPV TRTITSANID RLRVTFGVQS
LLETTSKGDR NHSSVRLLIQ LQRNGNWVTE KDVTINGKTT SQFLASVILD NLPPRPFNIR
MVRETADSTT DQLQNRTLWS SYTEIIDVKQ CYPNTAIVGL QVDAEQFGGQ QMTVNYHIRG
RIIQVPSNYD PEKRTYSGIW DGSLKPAYSN NPAWCLWDML THPRYGMGKR LGAADVDKWA
LYAIVTLPET GAATVNLING SGKPVSVDIT AHPAPDRIQV STLPDGVETY GVWGLSLPSL
RRRLFRCVSV RENTDGTFAI TAVQHVPEKE AIVDNGARFE PQSGSLNSVI PPAVQHLTVE
VSAADGQYLA QAKWDTPRVV KGVRFSLRLT SGKGTDARLV TTAITADTEH RFSGLPLGEY
TLTVRAINSY GQQGEPATTT FRIAAPAAPS RIELTPGYFQ ITATPHLAVY DPTVQFEFWF
SEKRIADIRQ VETAARYLGS ALYWIAASIN IKPGHDYYFY IRSVNTVGKS AFVEAVGRAS
DDAEGYLDFF KGEIGKTHLA QELWTQIDNG QLAPDLAEIR TSITNVSNEI TQTVNKKLEN
QSAAIQQIQK VQVDTNNNLN SMWAVKLQQM QDGRLYIAGI GAGIENTPAG MQSQVLLAAD
RIAMINPANG NTKPMFVGQG DQIFMNEVFL KYLTAPTITS GGNPPAFSLT PDGRLTAKNA
DISGNVNANS GTLNNVTINE NCRVLGKLSA NQIEGDLVKT VGKAFPRDSR APERWPSGTI
TVRVYDDQPF DRQIVIPAVA FSGAKHEKEH TDIYSSCRLI VRKNGAEIYN RTALDNTLIY
SGVIDMPAGH GHMTLEFSVS AWLVNNWYPT ASISDLLVVV MKKATAGITI S