Gene ECH74115_1796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1796 
Symbol 
ID6972210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1711617 
End bp1714262 
Gene Length2646 bp 
Protein Length881 aa 
Translation table11 
GC content58% 
IMG OID643385742 
Productputative prophage tail length tape measure protein 
Protein accessionYP_002270232 
Protein GI209395830 
COG category[S] Function unknown 
COG ID[COG5281] Phage-related minor tail protein 
TIGRFAM ID[TIGR01541] phage tail tape measure protein, lambda family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.132418 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGTA ATTTTGCCGA TCTGACAGCC GTGCTGACAC TGGATTCTGC CCGTTTTTCT 
GAAGAGGCAG CGCGGGTAAA AAAAGAGCTG GGTGAAACCA GTGCGCTTGC TGATTTGATG
TCCGGGAAAG TCAGTCAGTC TTTCAGAAAA CAGGCTGATG CTGCTGAGCA GAGTCTGAGC
CGACAGGCGC TGGCTGCACA AAAAGCCGGG ATATCAGTCG GACAGTATAA GGCTGCCATG
CGCACACTGC CCGCACAGTT CACGGATATT GTCACTCAGC TTGCCGGTGG TCAGAATCCC
TTCCTTATCA TGCTGCAGCA GGGGGGGCAG ATCAGCGATT CATTCGGTGG ACCGCTCAGC
CTGCTTACCC TGCTGAAGGA GGAACTTCTC GGGATCAGGG ATGCCTCTGA ATCATCAGAG
GAGTCGCTGT CAGATACGGC AAATGCACTG GCTGAAAATG CCCGGAATGC CGGTGAGCTG
GGACGATTTA TGTCGGTGGC CCGTGTGGCG GCAGGTGGCG GGGTTGCCGT ACTGGCCGCG
CTTGCTGCCG CCGCCTGGCA GGCAGAGCAG GCTGACCGGG CCTTATTGCG TTCACTGACC
CTGACCGGAG GGGCTGCTGC CACCACAACG GCAGAATTGT GGAAAATGGC CGGGGTGATC
AGCGATGAAG CCGGTGGTGG TATCAGACAG GCGGCAGAAA ATCTGGCCCG TCTGGCAGAA
AGCGGGAAAT ATACCGCCGG GCAGCTACGG ATCATGGGGG AAACCTCTCA GAGATGGCTG
CAGACGGTGG GGGACGATGC CGGGAAGGTG GAAAAAGCCT TTGAAGGGAT TGCAGCAGAT
CCGGTGAAGG CGCTGGCCTC CCTGAATCAG CAGTATAACT TCCTGAGCGT TTCCCAGTTA
CGCCATATTG ATGAGCTTGA GCGCACGAAA GGTAAACAGG CTGCGGTGAC GGAGGCGATG
TCCCTGTTTG CGGATGTCAT GAATGCACGT CTGGAGCAAC TTGATAAAGC GGCCACGCCG
GTGGAAAAAA TCTGGGACGA TGTTAAAACC TGGACTTCTG ACGCATGGGC ATGGATAGGT
GATCATACAC TGGGGGCACT CAGTCTGATC ACTGACGTGG TGGCAGGAAC CGTTGAACAG
GTGAAGCTGC TGCTTGTGCA GGGGGATCTG GCGCTGGCTG AATTTATTCA GTCAGCCTGG
GAAACGACAA AGAATGTGCC CGGCGTTGGT GCGTTGTTTG GTGAACTGGC AGAAGAGAAC
CGCGTATTTA TTGAGAAAAC AAAACGCGAT GAACTGGCGC TGAGAAAATC CATTGCGGAA
CGGGATGCGC GTATACGCCA GGGGGAAATG GGGTACATCA ACCGCTCGCG TGCAACAGGC
GTCAGCAAAG GTCCGGGGCA GCAGGAAGCC GTCAGCCGTC TGGCTGAAGA GCTGACAGGT
AAAAAGCATA CATCACCGAA AACGCGCTCT GCCGGGGAGA GGGAAGAGGA GCAGGCAAGA
GAGGCTCTGC TTGCCCTTGA AGCTGAGCTC AGGACGCTGG AAAAACACAG CGGTGCGAAT
GAGAAAATCA GCCGGCAGCG CCGTGATTTA TGGAAAGCGG AAAATCAGTA TGCGGTCCTG
AAAGAGGCTG CCACGAAACG GCAGTTATCT GAGCAGGAAA AATCCCTGCT GGCGCATAAA
GACGAGACGC TGGAGTACAA ACGCCAGCTG GCTGAGCTGG GCGACAAGGT TGAATACCAG
AAACGCCTGA ATGAGCTGGC ACAGCAGGCG GTGCGGTTTG AAGAGCAGCA GAGCGCGAAG
CAGGCCGCCA TCAGCGCAAA AGCCCGCGGT CTCACTGACC GTCAGGCGCA GCGGGAGTCT
GAAGCGCAGC GTCTTCGGGA CGTGTACGGT GATAATCCGG CTGCGCTGGC GAAGGCCACA
TCGGCACTGA AGAACACCAG GTCTGCGGAG GAGCAGCTTC GTGGAAGCTG GATGGCCGGG
CTGAAGTCCG GCTGGGGCGA GTGGGCGGAA AGTGCGACGG ACAGTTTTTC GCAGGTTAAA
AGTGCTGCCA CGCAGACCTT TGACGGTATT GCACAGAATA TGGCGGCGAT GCTGACCGGT
GCAGAGGCAG ACTGGCGGGG ATTCACCCGT TCGGTGCTGT CCATGATGAC AGAAATCCTG
CTTAAACAGG CCATGGTGGG CATTGTCGGG CGTATCGGCA GCGCCATTGG CGGTGCTTTC
GGTGGTGGTG CATCTGCTTC CTCGGGGACG GCCATTCAGG CTGCGGCGGC GAACTTCCAT
TTCGCGACCG GAGGATTTAC GGGGACGGGC GGCAAATATG AGCCTGCGGG GATAGTTCAC
CGCGGGGAGT TTGTTTTCAC GAAAGAGGCA ACCAGCCGGA TAGGTGTGGG GAATCTTTAC
CGTCTGATGC GCGGCTATGC GGAAGGTGGT TATGTGGGTG GTGCCGGAAG TCCGGCGCAG
ATGCGGCGGA CGGAAGGCAT TAATTTTAAT CAGAACAATC ACGTGGTGAT TCAGAACGAC
GGCTCCAACG GACAGGCGGG GCCGCAGCTG ATGAAGGCGG TGTATGACAT GGCCCGCAAG
GGGGCGCAGG ATGAGATTCA GGCGCAGATG CGTGATGGCG GCGTCTTTTC CGGAGGCAGG
CGATGA
 
Protein sequence
MAGNFADLTA VLTLDSARFS EEAARVKKEL GETSALADLM SGKVSQSFRK QADAAEQSLS 
RQALAAQKAG ISVGQYKAAM RTLPAQFTDI VTQLAGGQNP FLIMLQQGGQ ISDSFGGPLS
LLTLLKEELL GIRDASESSE ESLSDTANAL AENARNAGEL GRFMSVARVA AGGGVAVLAA
LAAAAWQAEQ ADRALLRSLT LTGGAAATTT AELWKMAGVI SDEAGGGIRQ AAENLARLAE
SGKYTAGQLR IMGETSQRWL QTVGDDAGKV EKAFEGIAAD PVKALASLNQ QYNFLSVSQL
RHIDELERTK GKQAAVTEAM SLFADVMNAR LEQLDKAATP VEKIWDDVKT WTSDAWAWIG
DHTLGALSLI TDVVAGTVEQ VKLLLVQGDL ALAEFIQSAW ETTKNVPGVG ALFGELAEEN
RVFIEKTKRD ELALRKSIAE RDARIRQGEM GYINRSRATG VSKGPGQQEA VSRLAEELTG
KKHTSPKTRS AGEREEEQAR EALLALEAEL RTLEKHSGAN EKISRQRRDL WKAENQYAVL
KEAATKRQLS EQEKSLLAHK DETLEYKRQL AELGDKVEYQ KRLNELAQQA VRFEEQQSAK
QAAISAKARG LTDRQAQRES EAQRLRDVYG DNPAALAKAT SALKNTRSAE EQLRGSWMAG
LKSGWGEWAE SATDSFSQVK SAATQTFDGI AQNMAAMLTG AEADWRGFTR SVLSMMTEIL
LKQAMVGIVG RIGSAIGGAF GGGASASSGT AIQAAAANFH FATGGFTGTG GKYEPAGIVH
RGEFVFTKEA TSRIGVGNLY RLMRGYAEGG YVGGAGSPAQ MRRTEGINFN QNNHVVIQND
GSNGQAGPQL MKAVYDMARK GAQDEIQAQM RDGGVFSGGR R