Gene ECH74115_1476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1476 
Symbol 
ID6967404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1456792 
End bp1457814 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content50% 
IMG OID643385447 
Producthypothetical protein 
Protein accessionYP_002269941 
Protein GI209399164 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000304616 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00000000000128745 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAG TGTTATTGAT AATCTTGTTA TTGCTGGTGG TACTGGGTAT CGCCGCTGGT 
GTGGGCGTCT GGAAGGTTCG CTATCTTGCC GACAGCAAAT TGCTTATCAA AGAAGAGACG
ATATTTACCC TGAAGCCAGG GACCGGACGT CTGGCGCTTG GTGAACAGCT TTATGCCGAT
AAGATCATCA ATCGCCCACG GGTTTTTCAA TGGTTGCTGC GTATCGAACC GGATCTTTCT
CACTTTAAAG CCGGGACTTA CCGCTTTACA CCGCAGATGA CCGTGCGCGA GATGCTGAAA
TTGTTGGAAA GCGGTAAAGA AGCACAGTTC CCTCTGCGAC TGGTAGAAGG GATGCGTCTG
AGCGACTACC TCAAGCAATT GCGTGAGGCC CCGTATATCA AGCACACGCT GAGCGATGAT
AAGTACATCA CCGTAGCGCA GGCACTTGAA CTGGAAAACC CGGAGTGGAT TGAAGGTTGG
TTCTGGCCAG ACACCTGGAT GTATACCGCC AATACCACCG ATGTCGCGTT ACTCAAGCGA
GCGCACAAGA AAATGGTGAA AGCGGTCGAT AGCGCCTGGG AAGGGCGTGC GGACGGTCTG
CCTTATAAAG ATAAAAATCA GCTGGTGACG ATGGCATCAA TTATCGAAAA AGAAACCGCC
ATTGCCAGTG AACGCGATCA GGTTGCCTCG GTATTTATCA ACCGTTTACG CATTGGTATG
CGCCTGCAGA CCGACCCAAC CGTGATTTAC GGGATGGGAG AGCGTTATAA TGGCAAACTT
TCTCGTGCAG ACCTGGAAAC GCCGACAGCG TATAACACCT ATACCATTAC CGGTTTGCCG
CCGGGTGCGA TAGCTACGCC GGGGGCGGAT TCGCTGAAGG CTGCTGCGCA TCCGGCAAAA
ACGCCGTATC TCTATTTTGT GGCCGATGGT AAAGGTGGTC ACACGTTTAA TACCAATCTT
GCCAGTCATA ACAAGTCTGT GCAGGATTAT CTGAAAGTGC TTAAGGAAAA AAATGCGCAG
TAA
 
Protein sequence
MKKVLLIILL LLVVLGIAAG VGVWKVRYLA DSKLLIKEET IFTLKPGTGR LALGEQLYAD 
KIINRPRVFQ WLLRIEPDLS HFKAGTYRFT PQMTVREMLK LLESGKEAQF PLRLVEGMRL
SDYLKQLREA PYIKHTLSDD KYITVAQALE LENPEWIEGW FWPDTWMYTA NTTDVALLKR
AHKKMVKAVD SAWEGRADGL PYKDKNQLVT MASIIEKETA IASERDQVAS VFINRLRIGM
RLQTDPTVIY GMGERYNGKL SRADLETPTA YNTYTITGLP PGAIATPGAD SLKAAAHPAK
TPYLYFVADG KGGHTFNTNL ASHNKSVQDY LKVLKEKNAQ