Gene ECH74115_3194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3194 
Symbol 
ID6968073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2946217 
End bp2948862 
Gene Length2646 bp 
Protein Length881 aa 
Translation table11 
GC content58% 
IMG OID643387013 
Productputative prophage tail length tape measure protein 
Protein accessionYP_002271480 
Protein GI209399937 
COG category[S] Function unknown 
COG ID[COG5281] Phage-related minor tail protein 
TIGRFAM ID[TIGR01541] phage tail tape measure protein, lambda family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.000180556 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAGGTA ATTTTGCCGA TCTGACAGCC GTGCTGACAC TGGATTCTGC CCGTTTTTCT 
GAAGAGGCAG CGCGGGTAAA AAAAGAGCTG GGTGAAACCA GTGCGCTTGC TGATTTGATG
TCCGGGAAAG TCAGTCAGTC TTTCAGAAAA CAGGCTGATG CTGCTGAGCA GAGTCTGAGC
CGACAGGCGC TGGCTGCACA AAAAGCCGGG ATATCAGTCG GACAGTATAA GGCTGCCATG
CGCACACTGC CCGCACAGTT CACGGATATT GTCACTCAGC TTGCTGGTGG TCAGAATCCC
TTCCTTATCA TGCTGCAGCA GGGGGGGCAG ATCAGCGATT CATTCGGTGG ACCGCTCAGC
CTGCTTACCC TGCTGAAGGA GGAACTTCTC GGGATCAGGG ATGCCTCTGA ATCATCAGAG
GAGTCGCTGT CAGATACGGC AAATGCACTG GCTGAAAATG CCCGGAATGC CGGTGAGCTG
GGACGATTTA TGTCGGTGGC CCGTGTGGCG GCAGGTGGCG GGGTTGCCGT ACTGGCCGCG
CTTGCTGCCG CCGCCTGGCA GGCAGAGCAG GCTGATCGGG CCTTATTGCG TTCACTGATC
CTGACCGGAG GGGCGGCTGC CACCACAACG GCAGAATTGT GGAAAATGGC CGGGGTGATC
AGCGATGAAG CCGGTGGTGG TATCAGACAG GCGGCAGAAA ATCTGGCCCG TCTGGCAGAA
AGCGGGAAAT ATACCGCCGG GCAGCTACGG ATCATGGGGG AAACCTCTCA GAGATGGCTG
CAGACGGTGG GGGACGATGC CGGGAAGGTG GAAAAAGCCT TTGAAGGGAT TGCAGCAGAT
CCGGTGAAGG CGCTGGCCTC CCTGAATCAG CAGTATAACT TCCTGAGCGT TTCCCAGTTA
CGCCATATTG ATGAGCTTGA GCGCACGAAA GGTAAACAGG CTGCGGTGAC GGAGGCGATG
TCCCTGTTTG CGGATGTCAT GAATGCACGT CTGGAGCAAC TTGATAAAGC GGCCACGCCG
GTGGAAAAAA TCTGGGACGA TGTTAAAACC TGGACTTCTG ACGCATGGGC ATGGATAGGT
GATCATACAC TGGGGGCACT CAGTCTGATC ACTGACGTGG TGGCCGGAAC CGTTGAACAA
GTGAAGCTGC TGCTTGTGCA GGGGGATCTG GCGCTGGCTG AATTTATTCA GTCAGCCTGG
GAAACGACAA AGAATGTGCC CGGCGTTGGT GCGTTGTTTG GTGAACTGGC AGAAGAGAAC
CGCGTATTTA TTGAGAAAAC AAAACGCGAT GAACTGGCGC TGAGAAAATC CATTGCGGAA
CGGGATGCGC GTATACGCCA GGGGGAAATA GGGTACATCA ACCGCTCGCG TGCAACAGGC
GTCAGCAAAG GTCCAGGGCA GCAGGAAGCC GTCAGCCGTC TGGCTGAAGA GCTGACAGGT
AAAAAGCATA CATCACCGAA AACGCGCTCT GCCGGGGAGA GGGAAGAGGA GCAGGCAAGA
GAGGCTCTGC TTGCCCTTGA AGCTGAGCTC AGGACGCTGG AAAAACACAG CGGTGCGAAT
GAGAAAATTA GCCGGCAGCG CCGTGATTTA TGGAAGGCGG AAAGTCAGTA TGCGGTCCTG
AAAGAGGCTG CCACGAAACG GCAGTTATCC TGGCAGGAAA AATCCCTGCT GGCCCATGAG
AAAGAGACGC TGGAGTACAA ACGCCAGCTG GCTGACCTGG GCGACAAGGT TGAACACCAG
AAACGGCTGA ATGAGCTGGC ACAGCAGGCT GCGCGGTTTG AGCAGCAGCA GGGCGCGAAG
CAGGCGGCAA TCAGTGCCCA GGCGCGGGGC CTCACCGACC GTCAGGCGCA GCGGGAGTCG
GAAGAGCAGC GCCTTCGTGA CGTGTACGGT GATAATCCGG ATGCGCTGGC GAAGGCCACA
TCGGCACTGA AGAACACCTG GTCTGCGGAG GAGCAGCTTC GTGGAAGCTG GATGGCCGGT
CTGAAGTCCG GCTGGGGCGA GTGGGCAGAA AGTGCGACGG ACAGTTTTTC GCAGGTTAAA
AGCGTGGCCA CGCAGACCTT TGACGGTATT GCACAGAATA TGGCAGCGAT GCTGACCGGC
AGCGAACAGA ACTGGCGTGG TTTCACCCGT TCTGTGCTCT CCATGCTGAC AGAGATTTTT
CTGAAGCAGG CGATGGTGGG GATAGTCGGG AGTATCGGCA GCGCCATTGG CGGTGCTTTC
GGTGGTGGTG CATCTGCTTC CTCGGGGACG GCCATTGAGG CTGCGGCGGC GAACTTCCAT
TTCGCGACCG GAGGATTTAC GGGGACGGGC GGCAAATATG AGCCTGCGGG GATAGTTCAC
CGCGGGGAGT TTGTTTTCAC GAAAGAGGCA ACCAGCCGGA TAGGTGTGGG GAATCTTTAC
CGTCTGATGC GCGGCTATGC GGAAGGTGGT TATGTGGGTG GTGCCGGAAG TCCGGCGCAG
ATGCGGCGGG CGGAAGGTAT TAATTTTAAT CAGAACAATC ACGTGGTGAT TCAGAACGAC
GGCACCAACG GACAGGCGGG GCCGCAGCTG ATGAAGGCGG TGTATGACAT GGCCCGCAAG
GGGGCGCAGG ATGAGCTCCG GCTGCAGTTG CGTGATGGCG GTATGTTATC GGGGAGCGGG
CGATGA
 
Protein sequence
MAGNFADLTA VLTLDSARFS EEAARVKKEL GETSALADLM SGKVSQSFRK QADAAEQSLS 
RQALAAQKAG ISVGQYKAAM RTLPAQFTDI VTQLAGGQNP FLIMLQQGGQ ISDSFGGPLS
LLTLLKEELL GIRDASESSE ESLSDTANAL AENARNAGEL GRFMSVARVA AGGGVAVLAA
LAAAAWQAEQ ADRALLRSLI LTGGAAATTT AELWKMAGVI SDEAGGGIRQ AAENLARLAE
SGKYTAGQLR IMGETSQRWL QTVGDDAGKV EKAFEGIAAD PVKALASLNQ QYNFLSVSQL
RHIDELERTK GKQAAVTEAM SLFADVMNAR LEQLDKAATP VEKIWDDVKT WTSDAWAWIG
DHTLGALSLI TDVVAGTVEQ VKLLLVQGDL ALAEFIQSAW ETTKNVPGVG ALFGELAEEN
RVFIEKTKRD ELALRKSIAE RDARIRQGEI GYINRSRATG VSKGPGQQEA VSRLAEELTG
KKHTSPKTRS AGEREEEQAR EALLALEAEL RTLEKHSGAN EKISRQRRDL WKAESQYAVL
KEAATKRQLS WQEKSLLAHE KETLEYKRQL ADLGDKVEHQ KRLNELAQQA ARFEQQQGAK
QAAISAQARG LTDRQAQRES EEQRLRDVYG DNPDALAKAT SALKNTWSAE EQLRGSWMAG
LKSGWGEWAE SATDSFSQVK SVATQTFDGI AQNMAAMLTG SEQNWRGFTR SVLSMLTEIF
LKQAMVGIVG SIGSAIGGAF GGGASASSGT AIEAAAANFH FATGGFTGTG GKYEPAGIVH
RGEFVFTKEA TSRIGVGNLY RLMRGYAEGG YVGGAGSPAQ MRRAEGINFN QNNHVVIQND
GTNGQAGPQL MKAVYDMARK GAQDELRLQL RDGGMLSGSG R