Gene ECH74115_1638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1638 
Symbol 
ID6969319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1582057 
End bp1585455 
Gene Length3399 bp 
Protein Length1132 aa 
Translation table11 
GC content56% 
IMG OID643385598 
Producthypothetical protein 
Protein accessionYP_002270092 
Protein GI209397540 
COG category[S] Function unknown 
COG ID[COG4733] Phage-related protein, tail component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.498718 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAAAG GCAGCAGTAA GGGGCATACC CCGCGCGAAG CGAAGGACAA CCTGAAGTCC 
ACGCAGCTGC TGAGTGTGAT CGATGCCATC AGCGAAGGGC CGGTTGAAGG TCCGGTGGAT
GGATTAAAAA GCGTGCTGCT GAACAGTACG CCGGTGCTGG ACAGTGAGGG GAATACCAAT
ATATCCGGCG TCACGGTGGT GTTCCGGGCC GGTGAGCAGG AGCAGACACC GCCGGAGGGA
TTTGAATCCT CCGGCTCCGA GACGGTGCTC GGTACAGAAG TGAAATATGA CACGCCGATC
ACCCGGACCA TCACGTCGGC AAACATTGAC CGTCTGCGTT TTACTTTCGG CGTGCAGGCA
CTGGTGGAAA CCACCTCAAA GGGGGACAGG AATCCATCGG AAGTCCGCCT GCTGGTTCAG
ATACAACGTA ACGGTGGCTG GGTGACGGAA AAAGACATCA CCATTAAGGG TAAAACCACT
TCACAGTATC TGGCCTCGGT GGTGGTGGAT AACCTGCCGC CGCGCCCGTT TAATATCCGG
ATGCGCAGGA TGACGCCGGA CAGCACCACA GACCAGCTGC AGAACAAAAC GCTCTGGTCG
TCATACACCG AAATTATCGA TGTGAAACAG TGCTACCCGA ACACGGCACT GGTCGGCGTG
CAGGTGGACT CGGAGCAGTT CGGCAGCCAG AAGGTGAGCC GTAATTATCA TCTGCGCGGG
CGTATTCTGC AGGTGCCGTC GAATTATAAC CCGCAGACGC GGCAATACAG CGGTATCTGG
GACGGAACGT TTAAACCGGC ATACAGCAAC AACATGGCCT GGTGTCTGTG GGATATGCTG
ACCCATCCGC GCTACGGCAT GGGGAAACGT CTTGGTGCGG CGGATGTGGA TAAATGGGCG
CTGTATGTCA TCGGCCAGTA CTGCGACCAG TCAGTGCCGG ACGGCTTTGG CGGCACGGAG
CCGCGCATCA CCTGTAATGC CTACCTGACC ACACAGCGTA AGGCGTGGGA TGTTCTCAGC
GATTTCTGCT CGGCGATGCG CTGTATGCCG GTATGGAACG GGCAGACGCT GACGTTCGTG
CAGGACCGAC CGTCGGATAA GGTGTGGACC TATAACCGCA GTAATGTGGT GATGCCGGAT
GATGGCGCGT CGTTCCGCTA CAGCTTCAGC GCCCTCAAGG ACCGCCATAA TGCCGTTGAG
GTGAACTGGA TTGACCCGGA TAACGGCTGG GAGACGGCGA CAGAGCTTGT GGAGGACACG
CAGGCCATTC TCCGTTACGG TCGTAACGTC ACGAAGATGG ATGCCTTTGG CTGTACCAGC
CGGGGGCAGG CACACCGCGC CGGGCTGTGG CTGATTAAAA CGGAGCTGCT GGAGACGCAG
ACCGTGGATT TCAGCGTGGG CGCAGAAGGG CTTCGCCATG TACCGGGCGA TGTCATTGAA
ATCTGTGATG ATGACTATGC GGGGATCAGC ATCGGTGGGC GTGTGCTGGC GGTGAACAGC
CAGACCCGGA CGCTGACGCT CGACCGTGAA ATCACGCTGC CATCCTCCGG TACCACGCTG
ATAAGCCTGG TTGACGGAAG TGGCAATCCG GTCAGCGTGG AGGTCCAGTC CGTCACCGAC
GGCGTGAAGG TAAAAGTGAG CCGTGTTCCT GACGGCGTTG CCGGATACAG CGTATGGGGG
CTGAAGTTGC CGACGTTGCG CCAGCGCCTG TTCCGCTGCG TGAGTATCCG TGAGAACGAC
GACGGCACGT ATGCCATCAC CGCCGTGCAG CATGTACCCG AAAAAGAAGC CATCGTGGAT
AACGGGGCGC ACTTTGACGG CGACCTGAGC GGCACGGTGA ATGGCGTCAC GCCGCCCGCG
GTGCAGCACC TGACTGCCGA AGTCACCGCA GACAGCGGGG AATATCAGGT GCTGGCGCGC
TGGGACACGC CGAAGGTGGT GAAGGGGGTG AGCTTCCTGC TTCGCCTGAC CGTGGCAGCG
GACGATGGCA GTGAGCGGCT GGTCAGTACG GCCAGGACGA CGGAAACCAC ATACCGCTTC
ACGCAACTGG CGCTGGGGAA CTACAGGCTG ACTGTCCGGG CGGTAAATGC GTGGGGACAG
CAGGGCGATC CGGCATCGGT ATCGTTCCGG ATTGCCGCAC CGGCAGCGCC GTCTCGGATT
GAGCTGACAC CAGGCTATTT TCAGATAACC GCCACGCCGC ATCTTGCGGT TTATGATCCG
ACGGTACAGT TTGAGTTCTG GTTCTCGGAA ACGCGGATTG CGGATATCAG GCAGGTTGAA
ACCAGCGCGC GTTATCTTGG TACGGCGCTG TACTGGATAG CCGCCAGTAT CAATATCAAA
CCGGGCCATG ATTATTACTT TTATATCCGC AGTGTGAACA CCGTTGGCAA ATCGGCATTC
GTGGAGGCCG TCGGTCGGGC GAGCGATGAT GCGGAAGGTT ATCTGGATTT TTTCAAAGGC
AAGATAACCG AATCTCATCT TGGTAAAGAG CTACTGGAAA AAGTTGACCT GACGGAGGAT
AACGCCAGCA GACTGGATGA GTTTTCGAAA GAGTGGAAGG ATGCTAACGA TAAGTGGAAT
GCCATGTGGG GCGTCAAAAT TGAGCAGACC AAAGACGGCA AACATTATGT CGCGGGTATT
GGCCTCAGCA TGGAGGACAC GGAGGAAGGC AAGCTGAGCC AGTTTCTGGT TGCCGCCAAT
CGTATCGCGT TTATTGACCC GGCAAACGGG AATGAAACGC CGATGTTTGT GGCGCAGGGC
AACCAGATAT TCATGAACGA CGTGTTCCTG AAGCGCCTGA CGGCCCCCAC CATTACCAGC
GGTGGAAATC CACCGGCATT TTCCCTGACA CCGGACGGAA AGCTGACTGC TAAAAATGCG
GATATCAGTG GCAGTGTGAA TGCGAACTCC GGGACGCTCA ACAACGTCAC GATTAATGAG
AACTGTCAGA TTAAGGGGAA ACTGTCAGCC AATCAGATTG AAGGCGATAT TGTCAAAACG
GTCAGCAAGT CTTTCCCCCG CACGAGCACT TATGCCAGTG GCACCATCAC GGTAAGAATC
AGTGATGATC AGAAATTTGA CCGGCAGGTC ATGATACCGC CAGTGTTATT CCGCGGTGGT
AAGCATGAGA ATTTCAACAG TAATAACCAA CAGTCATACT GGTATTCAAC CTGCCGGTTA
AGAGTGACCC GCAATGGCCA GGAGATTTTT AATCAGTCCA CGACGGATGC TCAGGGCGTA
TTTTCCTCAG TTATAGATAT GCCTGCCGGA CAGGGGACGC TGACACTGAC ATTCACCGTA
TCTTCATCAG GAGCGAATAA CTGGACACCA ACAACCAGTA TCAGCGATCT GCTGGTTGTG
GTGATGAAAA AATCCACAGC AGGTATCAGT ATCAGCTGA
 
Protein sequence
MGKGSSKGHT PREAKDNLKS TQLLSVIDAI SEGPVEGPVD GLKSVLLNST PVLDSEGNTN 
ISGVTVVFRA GEQEQTPPEG FESSGSETVL GTEVKYDTPI TRTITSANID RLRFTFGVQA
LVETTSKGDR NPSEVRLLVQ IQRNGGWVTE KDITIKGKTT SQYLASVVVD NLPPRPFNIR
MRRMTPDSTT DQLQNKTLWS SYTEIIDVKQ CYPNTALVGV QVDSEQFGSQ KVSRNYHLRG
RILQVPSNYN PQTRQYSGIW DGTFKPAYSN NMAWCLWDML THPRYGMGKR LGAADVDKWA
LYVIGQYCDQ SVPDGFGGTE PRITCNAYLT TQRKAWDVLS DFCSAMRCMP VWNGQTLTFV
QDRPSDKVWT YNRSNVVMPD DGASFRYSFS ALKDRHNAVE VNWIDPDNGW ETATELVEDT
QAILRYGRNV TKMDAFGCTS RGQAHRAGLW LIKTELLETQ TVDFSVGAEG LRHVPGDVIE
ICDDDYAGIS IGGRVLAVNS QTRTLTLDRE ITLPSSGTTL ISLVDGSGNP VSVEVQSVTD
GVKVKVSRVP DGVAGYSVWG LKLPTLRQRL FRCVSIREND DGTYAITAVQ HVPEKEAIVD
NGAHFDGDLS GTVNGVTPPA VQHLTAEVTA DSGEYQVLAR WDTPKVVKGV SFLLRLTVAA
DDGSERLVST ARTTETTYRF TQLALGNYRL TVRAVNAWGQ QGDPASVSFR IAAPAAPSRI
ELTPGYFQIT ATPHLAVYDP TVQFEFWFSE TRIADIRQVE TSARYLGTAL YWIAASINIK
PGHDYYFYIR SVNTVGKSAF VEAVGRASDD AEGYLDFFKG KITESHLGKE LLEKVDLTED
NASRLDEFSK EWKDANDKWN AMWGVKIEQT KDGKHYVAGI GLSMEDTEEG KLSQFLVAAN
RIAFIDPANG NETPMFVAQG NQIFMNDVFL KRLTAPTITS GGNPPAFSLT PDGKLTAKNA
DISGSVNANS GTLNNVTINE NCQIKGKLSA NQIEGDIVKT VSKSFPRTST YASGTITVRI
SDDQKFDRQV MIPPVLFRGG KHENFNSNNQ QSYWYSTCRL RVTRNGQEIF NQSTTDAQGV
FSSVIDMPAG QGTLTLTFTV SSSGANNWTP TTSISDLLVV VMKKSTAGIS IS