Gene ECH74115_3126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3126 
Symbol 
ID6969619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2900648 
End bp2903293 
Gene Length2646 bp 
Protein Length881 aa 
Translation table11 
GC content58% 
IMG OID643386952 
Productputative prophage tail length tape measure protein 
Protein accessionYP_002271420 
Protein GI209399313 
COG category[S] Function unknown 
COG ID[COG5281] Phage-related minor tail protein 
TIGRFAM ID[TIGR01541] phage tail tape measure protein, lambda family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0141545 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGTA ATTTTGCCGA TCTGACAGCC GTGCTGACAC TGGATTCTGC CCGTTTTTCT 
GAAGAGGCAG CGCGGGTAAA AAAAGAGCTG GGTGAAACCA GTGCGCTTGC TGATTTGATG
TCCGGGAAAG TCAGTCAGTC TTTCAGAAAA CAGGCTGATG CTGCTGAGCA GAGTCTGAGC
CGACAGGCGC TGGCTGCACA AAAAGCCGGG ATATCAGTCG GACAGTATAA GGCTGCCATG
CGCACACTGC CCGCACAGTT CACGGATATT GTCACTCAGC TTGCCGGTGG TCAGAATCCC
TTCCTTATCA TGCTGCAGCA GGGGGGGCAG ATCAGCGATT CATTCGGTGG ACCGCTCAGC
CTGCTTACCC TGCTGAAGGA GGAACTTCTC GGGATCAGGG ATGCCTCTGA ATCATCAGAG
GAGTCGCTGT CAGATACGGC AAATGCACTG GCTGAAAATG CCCGGAATGC CGGTGAGCTG
GGACGATTTA TGTCGGTGGC CCGTGTGGCG GCAGGTGGCG GGGTTGCCGT ACTGGCCGCG
CTTGCTGCCG CCGCCTGGCA GGCAGAGCAG GCTGACCGGG CCTTATTGCG TTCACTGACC
CTGACCGGAG GGGCTGCTGC CACCACAACG GCAGAATTGT GGAAAATGGC CGGGGTGATC
AGCGATGAAG CCGGTGGTGG TATCAGACAG GCGGCAGAAA ATCTGGCCCG TCTGGCAGAA
AGCGGGAAAT ATACCGCCGG GCAGCTACGG ATCATGGGGG AAACCTCTCA GAGATGGCTG
CAGACGGTGG GGGACGATGC CGGGAAGGTG GAAAAAGCCT TTGAAGGGAT TGCAGCAGAT
CCGGTGAAGG CGCTGGCCTC CCTGAATCAG CAGTATAACT TCCTGAGCGT TTCCCAGTTA
CGCCATATTG ATGAGCTTGA GCGCACGAAA GGTAAACAGG CTGCGGTGAC GGAGGCGATG
TCCCTGTTTG CGGATGTCAT GAATGCACGT CTGGAGCAAC TTGATAAAGC GGCCACGCCG
GTGGAAAAAA TCTGGGACGA TGTTAAAACC TGGACTTCTG ACGCATGGGC ATGGATAGGT
GATCATACAC TGGGGGCACT CAGTCTGATC ACTGACGTGG TGGCAGGAAC CGTTGAACAG
GTGAAGCTGC TGCTTGTGCA GGGGGATCTG GCGCTGGCTG AATTTATTCA GTCAGCCTGG
GAAACGACAA AGAATGTGCC CGGCGTTGGT GCGTTGTTTG GTGAACTGGC AGAAGAGAAC
CGCGTATTTA TTGAGAAAAC AAAACGTGAT GAACTGGCGC TGAGAAAATC CATTGCGGAA
CGGGATGCGC GTATACGCCA GGGGGAAATG GGGTACATCA ACCGTTCGCG TGCAACAGGC
GTCAGCAAAG GTCCGGGGCA GCAGGAAGCC GTCAGCCGTC TGGCTGAAGA GCTGACAGGT
AAAAAGCATA CATCACCGAA AACGCGCTCT GCCGGGGAGA GGGAAGAGGA GCAGGCAAGA
GAGGCTCTGC TTGCCCTTGA AGCTGAGCTC AGGACGCTGG AAAAACACAG CGGTGCGAAT
GAGAAAATCA GCCGGCAGCG CCGTGATTTA TGGAAGGCGG AAAGTCAGTA TGCGGTCCTG
AAAGAGGCTG CCACGAAACG ACAGTTATCT GAGCAGGAAA AATCCCTGCT GGCGCATAAA
GACGAGACGC TGGAGTACAA ACGCCAGCTG GCTGAGCTGG GCGACAAGGT TGAATACCAG
AAACGCCTGA ATGAGCTGGC ACAGCAGGCG GTGCGGTTTG AAGAGCAGCA GAGCGCGAAG
CAGGCCGCCA TCAGCGCAAA AGCCCGCGGT CTCACTGACC GTCAGGCGCA GCGGGAGTCT
GAAGCGCAGC GTCTTCGGGA CGTGTACGGT GATAATCCGG CTGCGCTGGC GAAGGCCACA
TCGGCACTGA AGAACACCTG GTCTGCGGAG GAGCAGCTTC GTGGAAGCTG GATGGCCGGG
CTGAAGTCCG GCTGGGGCGA GTGGGCGGAA AGTGCGACGG ACAGTTTTTC GCAGGTTAAA
AGTGCTGCCA CGCAGACCTT TGACGGTATT GCACAGAATA TGGCGGCGAT GCTGACCGGT
GCAGAGGCAG ACTGGCGGGG ATTCACCCGT TCGGTGCTGT CCATGATGAC AGAAATCCTG
CTTAAACAGG CCATGGTGGG CATTGTCGGG CGTATCGGCA GCGCCATTGG CGGTGCTTTC
GGTGGTGGTG CGTCTGCCTC CACGGGGACG GCCATTCAGG CTGCGGCGGC GAACTTCCAT
TTCGCGACCG GGGGATTTAC GGGGACGGGG GGTAAATATG AACCTGCGGG AATTGTTCAT
CGCGGGGAGT TTGTCTTCAC GAAGGAGGCA ACCAGCCGGA TTGGCGTCGG CAACCTGTAC
CGCCTGATGC GGGGCTATGC GGAAGGGGGT TATGTGGGCG GTGCCGGAAG TCCGGCGCAG
ATGCGGCGGA CGGAAGGCAT TAATTTTAAT CAGAACAATC ACGTGGTGAT TCAGAACGAC
GGCACCAACG GACAGGCGGG GCCGCAGCTG ATGAAGGCGG TGTATGACAT GGCCCGCAAG
GGGGCGCAGG ATGAGCTCCG GCTGCAGTTG CGTGATGGCG GTATGTTATC GGGGAGCAGG
CGATGA
 
Protein sequence
MAGNFADLTA VLTLDSARFS EEAARVKKEL GETSALADLM SGKVSQSFRK QADAAEQSLS 
RQALAAQKAG ISVGQYKAAM RTLPAQFTDI VTQLAGGQNP FLIMLQQGGQ ISDSFGGPLS
LLTLLKEELL GIRDASESSE ESLSDTANAL AENARNAGEL GRFMSVARVA AGGGVAVLAA
LAAAAWQAEQ ADRALLRSLT LTGGAAATTT AELWKMAGVI SDEAGGGIRQ AAENLARLAE
SGKYTAGQLR IMGETSQRWL QTVGDDAGKV EKAFEGIAAD PVKALASLNQ QYNFLSVSQL
RHIDELERTK GKQAAVTEAM SLFADVMNAR LEQLDKAATP VEKIWDDVKT WTSDAWAWIG
DHTLGALSLI TDVVAGTVEQ VKLLLVQGDL ALAEFIQSAW ETTKNVPGVG ALFGELAEEN
RVFIEKTKRD ELALRKSIAE RDARIRQGEM GYINRSRATG VSKGPGQQEA VSRLAEELTG
KKHTSPKTRS AGEREEEQAR EALLALEAEL RTLEKHSGAN EKISRQRRDL WKAESQYAVL
KEAATKRQLS EQEKSLLAHK DETLEYKRQL AELGDKVEYQ KRLNELAQQA VRFEEQQSAK
QAAISAKARG LTDRQAQRES EAQRLRDVYG DNPAALAKAT SALKNTWSAE EQLRGSWMAG
LKSGWGEWAE SATDSFSQVK SAATQTFDGI AQNMAAMLTG AEADWRGFTR SVLSMMTEIL
LKQAMVGIVG RIGSAIGGAF GGGASASTGT AIQAAAANFH FATGGFTGTG GKYEPAGIVH
RGEFVFTKEA TSRIGVGNLY RLMRGYAEGG YVGGAGSPAQ MRRTEGINFN QNNHVVIQND
GTNGQAGPQL MKAVYDMARK GAQDELRLQL RDGGMLSGSR R