Gene ECH74115_2209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2209 
Symbol 
ID6969174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2105350 
End bp2106393 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content54% 
IMG OID643386100 
Producthypothetical protein 
Protein accessionYP_002270587 
Protein GI209398826 
COG category[R] General function prediction only 
COG ID[COG5529] Pyocin large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00285391 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value5.0069399999999995e-21 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCTAATG CCTGGCTCAG ATTGTGGCAT GACATGCCAA ATGACCCCAA GTGGCGAACG 
ATTGCCAGGG TATCAGGACA GCCAATCGCA ACAGTGATGG CAGTGTATAT CCATCTTCTG
GTGAGCGCGT CACGAAATGT CACGACATGT CACGGCGTGT CACTACGTGG TCACATTGAT
GTCACGACGG AAGATTTAGC AAGTGCGCTT GATGTGACGG AAGACGTAAT TGATTCAATT
TTGCATGCAA TGCAGGGGCG GGTTCTGGAT GGTGACCTTA TTTCCGGATG GGAAAAACGT
CAGGTGCTGA AAGAGGACAA TGGTAACGTT TCGCAAACGG CAAAATCCCC GGCAGAGCGC
AAGAGAGCGC AGCGGGAGCG GGAAAAGCTG CGGAAACATA ATGCTGATTG TCACGATGAG
TCACGACGTG TCACGCATCT GTCACGACAA GTCACGACAG ATAAAGATAC AGATAAAGAT
ACAGATACAG AATTAAACCC CACACATAAC GCGCGCGAGA GTATTCCGAC CAGTGAGTCG
AATGGTGCGC CGTTGCAGAC AGCCGAACCT GAATACCTGG ACGGCCTGAG CGAACCGATC
GGGAAATTTT CGATGACTAC TGTCTGGCAG CCGTCGCCGG ATTTTCGACA ACGGGCAGCA
GTGTGGGGTA TGGCTCTGCC TGAGCCGGAA TTTACACCTG CTGAGCTTGC CGCATTCCGG
GATTACTGGA TGGCGGAGGG GAAGGTTTTC ACGCAGGTTC AGTGGGAGCA GAAATTTGCC
CGCCACGTGC AGCACGTCAG GGCACAGGTA AAACCAGTCA GCAAGGGGGG AAGCCATGCA
GCATCAGGTG GCACGGCATC ACGGGCAGTT CAGGAAATCC GGGCTGCACG CGAACAGTGG
GAACGTGACA ACGGATTTAT CAGCAACGGA AACGGCCTGG AAGCTGTGGG AGCTCATGGG
GGAGGTGTAT TCGAACCGCT GGACTCAGAA GAACGGGGCC GCACCTTCGA AGCTCTGGAT
TGCCCAGATT GGTGCGATGA CTGA
 
Protein sequence
MANAWLRLWH DMPNDPKWRT IARVSGQPIA TVMAVYIHLL VSASRNVTTC HGVSLRGHID 
VTTEDLASAL DVTEDVIDSI LHAMQGRVLD GDLISGWEKR QVLKEDNGNV SQTAKSPAER
KRAQREREKL RKHNADCHDE SRRVTHLSRQ VTTDKDTDKD TDTELNPTHN ARESIPTSES
NGAPLQTAEP EYLDGLSEPI GKFSMTTVWQ PSPDFRQRAA VWGMALPEPE FTPAELAAFR
DYWMAEGKVF TQVQWEQKFA RHVQHVRAQV KPVSKGGSHA ASGGTASRAV QEIRAAREQW
ERDNGFISNG NGLEAVGAHG GGVFEPLDSE ERGRTFEALD CPDWCDD