Gene ECH74115_5181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5181 
SymbolyieM 
ID6970618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4824691 
End bp4826142 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content53% 
IMG OID643388847 
Producthypothetical protein 
Protein accessionYP_002273273 
Protein GI209397934 
COG category[R] General function prediction only 
COG ID[COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.108838 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.131102 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAACGC TGGATACGCT TAATGTGATG CTGGCCGTCA GCGAAGAGGG ATTGATCGAA 
GAGATGATCA TCGCGCTGCT GGCCTCACCG CAGCTGGCAG TCTTCTTTGA AAAATTCCCA
CGACTGAAGG CGGCAATCAC TGATGATGTT CCCCGCTGGC GTGAGGCGCT GCGCAGTCGG
CTGAAAGATG CCCGAGTCCC GCCGGAACTC ACCGAAGAGG TGATGTGCTA TCAGCAAAGC
CAGCTCCTCT CCACGCCGCA GTTTATTGTG CAGCTACCAC AGATCCTGGA CTTACTGCAT
CGTCTGAATT CCCCATGGGC AGAACAAGCC CGACAGTTGG TCGATGCTAA CAGCACGATC
ACTTCAGCGT TACACACGCT TTTTCTCCAG CGCTGGCGTT TAAGTCTGAT CGTGCAAGCA
ACAACGTTAA ATCAACAGCT ATTAGAAGAA GAACGCGAAC AACTGTTGAG TGAAGTTCAG
GAACGCATGA CGCTGAGCGG ACAACTTGAA CCGATTCTCG CAGATAACAA TACCGCAGCT
GGTCGTCTGT GGGATATGAG CGCCGGTCAG CTTAAACGTG GCGACTATCA GTTGATTGTG
AAATACGGTG AATTTCTTAA CGAACAGCCG GAACTGAAAC GCCTGGCAGA ACAACTGGGG
CGTTCCCGGG AAGCCAAATC AATACCGCGC AACGATGCGC AGATGGAAAC CTTCCGCACC
ATGGTGCGCG AACCGGCGAC GGTTCCTGAG CAGGTTGATG GTCTGCAACA AAGCGATGAT
ATTTTACGTC TTCTGCCGCC AGAACTGGCG ACACTAGGGA TAACGGAACT GGAGTATGAG
TTTTACCGTC GGCTGGTGGA AAAACAGTTG CTCACCTATC GCCTGCACGG TGAGTCGTGG
CGTGAAAAAG TGATCGAACG CCCGGTGGTG CATAAAGATT ACGACGAACA GCCGCGCGGA
CCGTTTATTG TCTGCGTGGA TACTTCCGGC TCAATGGGCG GCTTTAATGA ACAGTGTGCG
AAAGCGTTCT GCCTGGCTTT GATGCGCATT GCTCTCGCTG AAAACCGGCG CTGTTATATT
ATGCTATTTT CCACCGAGAT CGTCCGTTAT GAGCTTTCAG GCCCACAAGG CATCGAACAA
GCAATCCGTT TTTTAAGCCA GCAGTTTCGT GGCGGCACCG ATCTTGCCAG TTGTTTTCGC
GCCATTATGG AACGCTTGCA AAGCAGGGAA TGGTTTGATG CCGATGCGGT GGTGATTTCT
GATTTTATCG CCCAGCGGTT GCCTGACGAC GTGACGAGTA AAGTGAAAGA GCTGCAGCGG
GTACATCAGC ATCGCTTTCA TGCCGTGGCG ATGTCGGCAC ACGGCAAACC CGGCATCATG
CGCATTTTCG ATCATATCTG GCGCTTTGAT ACCGGGATGC GAAGCCGCCT GCTCAGACGC
TGGCGGCGAT AA
 
Protein sequence
MLTLDTLNVM LAVSEEGLIE EMIIALLASP QLAVFFEKFP RLKAAITDDV PRWREALRSR 
LKDARVPPEL TEEVMCYQQS QLLSTPQFIV QLPQILDLLH RLNSPWAEQA RQLVDANSTI
TSALHTLFLQ RWRLSLIVQA TTLNQQLLEE EREQLLSEVQ ERMTLSGQLE PILADNNTAA
GRLWDMSAGQ LKRGDYQLIV KYGEFLNEQP ELKRLAEQLG RSREAKSIPR NDAQMETFRT
MVREPATVPE QVDGLQQSDD ILRLLPPELA TLGITELEYE FYRRLVEKQL LTYRLHGESW
REKVIERPVV HKDYDEQPRG PFIVCVDTSG SMGGFNEQCA KAFCLALMRI ALAENRRCYI
MLFSTEIVRY ELSGPQGIEQ AIRFLSQQFR GGTDLASCFR AIMERLQSRE WFDADAVVIS
DFIAQRLPDD VTSKVKELQR VHQHRFHAVA MSAHGKPGIM RIFDHIWRFD TGMRSRLLRR
WRR