Gene ECH74115_1878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1878 
Symbol 
ID6971610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1774199 
End bp1777675 
Gene Length3477 bp 
Protein Length1158 aa 
Translation table11 
GC content57% 
IMG OID643385814 
Producthypothetical protein 
Protein accessionYP_002270303 
Protein GI209399955 
COG category[S] Function unknown 
COG ID[COG4733] Phage-related protein, tail component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.1744 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTAAAG GGGGCGGCAA GGGGCACACG CCGGTAGAGG CAAAGGACAA TCTTAAGTCC 
ACGCAGATGA TGAGCGTGAT TGACGCCATT GGTGAAGGGC CGATTGAAGG TCCGGTGAAG
GGGCTGCAGA GTATCCTGGT GAACAAAACC CCGCTGACGG ACACGGACGG TAATCCTGTG
ATACATGGTG TGACAGCGGT CTGGCGCGCC GGGGAGCAGG AGCAGACACC ACCTGAAGGC
TTTGAGTCCT CCGGGGCGGA AACCGCACTG GGCGTGGAAG TGACGAAAGC AAAGCCGGTG
ACGCGCACCA TTACGTCCGC GAACATTGAC CGCCTGCGGG TCACCTTCGG GGTGCAGTCA
CTGTTGGAGA CCACCTCAAA GGGCGACCGT AATCCCTCTT CTGTCCGACT GCTGATTCAG
TTGCAGCGTA ACGGTAACTG GGTGACGGAA AAGGATGTCA CCATTAACGG CAAGACCACC
TCACAGTACC TGGCGTCGGT GATTCTGGAG AATCTGCCTG AGCGGCCCTT TAACATCCGG
ATGGTCCGGG AGACGGCGGA CAGCACCACG GACCAGCTGC AGAATAAGAC GCTCTGGTCG
TCATACACCG AAATCATCGA TGTGAAACAG TGCTACCCGA ACACGGCGAT TGTGGGGCTG
CAGGTGGATG CGGAGCAGTT TGGCGGTCAG CAGATGACGG TGAACTACCA TATCCGCGGT
CGCATCATCC AGGTACCGTC AAACTATGAC CCGGAAAAAC GCACGTACAG CGGCATCTGG
GACGGCAGCC TGAAACCGGC ATACAGCAAC AACCCGGCCT GGTGCCTGTG GGACATGCTG
ACTCACCCGC GCTACGGCAT GGGAAAACGT CTGGTGGCGG CGGATGTTGA CAAGTGGGCG
CTGTATGCCA TCGGGCAGTA CTGCGACCAG ACGGTCCCGG ATGGTTTCGG GGGCACAGAG
CCGCGGATGA CCTTTAATGC GTACCTGTCA CAACAGCGTA AGGCGTGGGA CGTTCTCAGT
GATTTCTGCT CGGCGATGCG CTGTATGCCG GTATGGAACG GCCAGACGCT GACGTTCGTT
CAGGACCGTC CGTCGGATGT GGTGTGGCCG TACACCAGCA GCGATGTGGT GGTGGATGAT
AACGGCGTGG GGTTTCGCTA CAGCTTCAGC GCCCTGAAGG ACCGCCACAC GGCGGTGGAG
GTGAATTACA CCGACCCGCA GAACGGCTGG CAGACCTCCA CGGAACTGGT GGAAGACCCG
GAAGCCATAC TGCGCTACGG GCGCAACCTG CTGAAGATGG ATGCGTTCGG TTGCACCAGT
CGCGGTCAGG CCCACCGTGC CGGGCTGTGG GTGATAAAGA CCGGACTGCT GGAAACGCAG
ACGGTGGATT TCACGCTCGG GTCACAGGGG CTGCGTCACA CACCCGGTGA CATTATTGAA
ATCTGTGATA ACGACTATGC CGGGACCATG ACCGGCGGAC GTATCCTGTC CATCGATGCC
GCCAGCCGCA CCCTGACACT GGACCGTGAG GTGACCCTGC CGGAGACAGG TGCCGCCACG
GTGAACCTGA TTAACGGCAG CGGTAAGCCG GTGAGCGTGG CCATCACTGC ACACCCCGCG
CCGGACCGGA TACAGGTCAG CACCCTGCCT GATGGTGTGG AGACATACGG TGTATGGGGA
CTCTCCCTGC CGTCACTGCG TCGTCGCCTG TTCCGCTGTG TCTCCATCCG GGAAAACACG
GACGGCACCT TTGCCATCAC GGCGGTGCAG CACGTACCGG AAAAGGAAGC CATCGTGGAT
AACGGGGCCA GCTTTGAGCC GCAGTCAGGC ACCCTGAACA GCGTCATCCC TCCGGCAGTG
CAGCACCTGA CGGTGGAGGT GAGCGCAGCT GACGGTCAGT ATCTGGCGCA GGCGAAATGG
GACACGCCGA AGGTGGTGAA GGGCGTGAGC TTTATGCTTC GCCTGACCGT GGTCGCGGAT
GACGGCAGTG AGCGGCTGGT CAGCACGGCC CGGACGACGG AAACCACATA CCGCTTCACG
CAACTGGCGC CGGGGAACTA CAGGCTGACA GTCCGGGCGG TAAATGCGTG GGGGCAGCAG
GGCGATCCGG CGTCGGTATC GTTCCGGATT GCCGCACCGG CAGCGCCGTC ACAGATTGAG
CTGACACCGG GCTATTTTCA GATAACAGCG GTCCCGCGTC TTGCGGTGTA TGACCCGACG
GTACAGTTTG AGTTCTGGTT TTCGGAAACG CGGATTACCG ATATCAGGCA GGTTGAAACC
ACAGCCCGCT ACCTTGGCAC GGGGCTGTAC TGGATAGCCG CCAGTATCAA TATCAAACCG
GGCCATGATT ATTACTTTTA TATCCGCAGT GTGAACACCG TTGGCAAATC GGCATTTGTG
GAGGCTGTTG GCCAGCCGAG TGATGATGCA TCCGGCTATC TGGATTTTTT CAAAGGAGAG
ATAGGGAAAA CCCATCTGGC TCAGGAGTTG TGGACTCAGA TTGATAACGG TCAGCTTGCG
CCTGACCTGG CGGAAATCAG AACGTCCATC ACGGATGTCA GTAATGAAAT CACGCAGACC
GTCAATAAGA AACTGGAAGA CCAGAGTGCA GCGATCCAGC AGATACAGAA GGTTCAGGTT
GATACAAATA ATAATCTGAA CAGCATGTGG GCTGTGAAGC TGCAACAGAT GAAGGACGGA
CGCCTTTATA TTGCGGGTAT CGGTGCCGGT ATTGAGAATA CGCCAGCAGG AATGCAGAGT
CAGGTGCTGC TGGCGGCAGA CAGGATTGCG ATGATTAATC CTGCGAATGG CAACACAAAG
CCGATGTTTG TTGGTCAGGG CGATCAGATA TTCATGAACG ACGTGTTCCT GAAACGCCTG
ACGGCTCCCA CCATTACCAG CGGCGGTAAT CCTCCGGCAT TTTCCCTGAC ACCGGACGGA
AAGCTGACCG CTAAAAATGC AGATATCAGT GGCAGTGTGA ATGCGAACGC CGGGACGCTC
AACAATGTCA CAATTAATGA GAACTGTCAG ATTAAGGGGA AACTGTCAGC CAACCAGATT
GAAGGCGATA TAGTCAAAAC AGTGGGTAAG GCTTTTCCGC GGGACTCCCG GGCACCGGAG
CGGTGGCCAT CAGGGACCAT TACCGTCAGG GTTTATGACG ATCAGCCGTT TGACCGGCAG
ATTGTTATTC CGGCGGTGGC ATTCAGTGGC GCTAAGCATG AGAGAGAGCA TACTGATATT
TACTCCTCAT GCCGTCTGAT AGTGCGGAAA AACGGTGCTG AAATTTATAA CCGTACCGCG
CTGGATAATA CGCTGATTTA CAGTGGCGTT ATTGATATGC CTGCCGGTCA CGGTCACATG
ACGCTGGAGT TTTCGGTATC AGCATGGCTG GTGAATAACT GGTATCCCAC AGCAAGTATC
AGCGATTTGC TGGTTGTGGT GATGAAGAAA GCCACCGCAG GCATCAGTAT CAGCTGA
 
Protein sequence
MGKGGGKGHT PVEAKDNLKS TQMMSVIDAI GEGPIEGPVK GLQSILVNKT PLTDTDGNPV 
IHGVTAVWRA GEQEQTPPEG FESSGAETAL GVEVTKAKPV TRTITSANID RLRVTFGVQS
LLETTSKGDR NPSSVRLLIQ LQRNGNWVTE KDVTINGKTT SQYLASVILE NLPERPFNIR
MVRETADSTT DQLQNKTLWS SYTEIIDVKQ CYPNTAIVGL QVDAEQFGGQ QMTVNYHIRG
RIIQVPSNYD PEKRTYSGIW DGSLKPAYSN NPAWCLWDML THPRYGMGKR LVAADVDKWA
LYAIGQYCDQ TVPDGFGGTE PRMTFNAYLS QQRKAWDVLS DFCSAMRCMP VWNGQTLTFV
QDRPSDVVWP YTSSDVVVDD NGVGFRYSFS ALKDRHTAVE VNYTDPQNGW QTSTELVEDP
EAILRYGRNL LKMDAFGCTS RGQAHRAGLW VIKTGLLETQ TVDFTLGSQG LRHTPGDIIE
ICDNDYAGTM TGGRILSIDA ASRTLTLDRE VTLPETGAAT VNLINGSGKP VSVAITAHPA
PDRIQVSTLP DGVETYGVWG LSLPSLRRRL FRCVSIRENT DGTFAITAVQ HVPEKEAIVD
NGASFEPQSG TLNSVIPPAV QHLTVEVSAA DGQYLAQAKW DTPKVVKGVS FMLRLTVVAD
DGSERLVSTA RTTETTYRFT QLAPGNYRLT VRAVNAWGQQ GDPASVSFRI AAPAAPSQIE
LTPGYFQITA VPRLAVYDPT VQFEFWFSET RITDIRQVET TARYLGTGLY WIAASINIKP
GHDYYFYIRS VNTVGKSAFV EAVGQPSDDA SGYLDFFKGE IGKTHLAQEL WTQIDNGQLA
PDLAEIRTSI TDVSNEITQT VNKKLEDQSA AIQQIQKVQV DTNNNLNSMW AVKLQQMKDG
RLYIAGIGAG IENTPAGMQS QVLLAADRIA MINPANGNTK PMFVGQGDQI FMNDVFLKRL
TAPTITSGGN PPAFSLTPDG KLTAKNADIS GSVNANAGTL NNVTINENCQ IKGKLSANQI
EGDIVKTVGK AFPRDSRAPE RWPSGTITVR VYDDQPFDRQ IVIPAVAFSG AKHEREHTDI
YSSCRLIVRK NGAEIYNRTA LDNTLIYSGV IDMPAGHGHM TLEFSVSAWL VNNWYPTASI
SDLLVVVMKK ATAGISIS