Gene ECH74115_2750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2750 
Symbol 
ID6969147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2571627 
End bp2572631 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content53% 
IMG OID643386605 
Productputative sulfite oxidase subunit YedY 
Protein accessionYP_002271084 
Protein GI209397989 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000229765 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.192354 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA ATCAATTTTT AAAAGAATCA GATGTTACGG CCGAGTCGGT ATTCTTTATG 
AAGCGTCGGC AGGTGTTAAA AGCACTGGGC ATCAGCGCAG CTGCACTTTC TTTGCCTCAC
GCTGCGCATG CCGATCTGCT TAGCTGGTTT AAAGGGAACG ATCGCCCACC CGCCCCCGCC
GGAAAAGCGC TGGAGTTCAG CAAGCCTGCC GCCTGGCAAA ATAACCTGCC ACTGACGCCA
GCAGATAAAG TCTCCGGTTA TAACAACTTC TATGAATTCG GGCTGGATAA AGCCGATCCC
GCCGCTAATG CTGGTAGCCT GAAAACCGAT CCATGGACAC TGAAAATCAG CGGCGAAGTG
GCAAAACCAT TGACCCTCGA TCACGATGAT TTAACCCGTC GCTTCCCGCT GGAAGAGCGT
ATTTATCGTA TGCGCTGCGT GGAAGCGTGG TCGATGGTGG TGCCGTGGAT TGGTTTTCCG
CTGCACAAAT TGCTGGCGCT TGCCGAACCC ACCAGCAATG CGAAGTATGT CGCTTTCGAA
ACAATTTATG CACCGGAACA GATGCCAGGC CAGCAGGACC GCTTTATCGG CGGCGGGCTG
AAATATCCTT ATGTCGAAGG ATTGCGTCTC GACGAAGCAA TGCATCCGCT CACACTGATG
ACCGTAGGTG TTTATGGCAA GGCGTTACCG CCACAAAATG GCGCGCCGGT GCGACTGATT
GTGCCGTGGA AATATGGCTT TAAAGGGATT AAATCGATCG TCAGTATTAA GCTGACCCGC
GAGCGTCCGC CAACCACCTG GAATCTGGCA GCGCCTGACG AATACGGTTT TTACGCCAAC
GTTAATCCGC ATGTTGATCA CCCACGCTGG TCACAGGCTA CCGAACGATT TATTGGTTCA
GGCGGCATCC TCGATGTACA GCGCCAGCCA ACGCTACTGT TTAATGGTTA CGCCGACCAG
GTGGCATCGC TGTATCGTGG CCTGGATTTG CGGGAGAATT TCTGA
 
Protein sequence
MKKNQFLKES DVTAESVFFM KRRQVLKALG ISAAALSLPH AAHADLLSWF KGNDRPPAPA 
GKALEFSKPA AWQNNLPLTP ADKVSGYNNF YEFGLDKADP AANAGSLKTD PWTLKISGEV
AKPLTLDHDD LTRRFPLEER IYRMRCVEAW SMVVPWIGFP LHKLLALAEP TSNAKYVAFE
TIYAPEQMPG QQDRFIGGGL KYPYVEGLRL DEAMHPLTLM TVGVYGKALP PQNGAPVRLI
VPWKYGFKGI KSIVSIKLTR ERPPTTWNLA APDEYGFYAN VNPHVDHPRW SQATERFIGS
GGILDVQRQP TLLFNGYADQ VASLYRGLDL RENF