Gene ECH74115_3108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3108 
Symbol 
ID6967339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2884452 
End bp2885588 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content54% 
IMG OID643386934 
Productvon Willebrand factor type A domain protein 
Protein accessionYP_002271402 
Protein GI209396016 
COG category[R] General function prediction only 
COG ID[COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.708549 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.744851 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAC TGAACGATCT TCTGACCACC CGTGAGCTAC AACGCTGGCG ATTAATTCTT 
GGCGAAGCGG CAGAAACGAC GCTTTGTGGG CTGGATGACA ACGCCCGGCA GATAGACCAC
GCGCTGGAGT GGCTGTATGG GCGCGATCCT GAACGGCTCC AGCGTGGTGA ACGCTCCGGT
GGATTAGGTG GCTCAAATCT CACCCCCCCT GAGTGGATCA ACAGTATTCA CACGCTGTTT
CCGCAACAGG TGATTGAGCG GCTGGAAAGC GATGCCGTAC TGCGCTACGG CATTGAAGAT
GTGGTGACGA ATCTCGACGT GCTGGAACGT ATGCAGCCTT CTGAAAGCCT GCTACGCGCC
GTTTTGCACA CCAAACATCT GATGAACCCC GAAGTACTGG CTGCCGCCCG CCGGATAGTG
CGCCAGGTTG TTGAAGAAAT TATGGCTCGA CTGGCAAAGG AAGTTCGTCA GGCTTTTTCT
GGTGTCCGCG ATCGCCGTCG CCGCTCATCT ATTCCACTGG CGCGAGACTT TGATTTCAAA
AGTACTCTTC GCGCCAATCT GCAACACTGG CACCCGCAAC ACGGCAAGTT GTATATCGAA
TCCCCCCGCT TTAACAGCCG AATTAAGCGC CACAGTGAAC AATGGCAACT GGTCTTACTG
GTTGATCTAA GCGGATCGAT GGTCGATTCG GTGATCCACT CTGCGGTAAT GGCGGCCTGT
TTGTGGCAGT TACCCGGCAT TCGTACCCAT CTGGTGGCGT TTGACACCAG TGTCGTTGAT
CTCACGGCAG ACGTTGCCGA TCCGGTAGAG TTATTAATGA AAGTACAGTT GGGCGGCGGG
ACCAATATCG CCAGTGCCGT GGAGTATGGG CGGCAACTTA TTGAACAACC AGCAAAAAGC
GTCATTATCC TCGTGAGTGA TTTTTATGAA GGGGGTTCAT CATCATTGCT GACGCATCAG
GTGAAAAAGT GTGTCCAGAG CGGCATCAAA GTGCTGGGGC TGGCAGCGCT CGACAGCACC
GCAACGCCTT GCTATGACCG CGATATGGCC CAGGCGCTGG TTAATGTTGG CGCACAAATA
GCCGCAATGA CACCGGGTGA ACTGGCTACC TGGCTTGCGG AGAATTTGCA GTCATGA
 
Protein sequence
MSELNDLLTT RELQRWRLIL GEAAETTLCG LDDNARQIDH ALEWLYGRDP ERLQRGERSG 
GLGGSNLTPP EWINSIHTLF PQQVIERLES DAVLRYGIED VVTNLDVLER MQPSESLLRA
VLHTKHLMNP EVLAAARRIV RQVVEEIMAR LAKEVRQAFS GVRDRRRRSS IPLARDFDFK
STLRANLQHW HPQHGKLYIE SPRFNSRIKR HSEQWQLVLL VDLSGSMVDS VIHSAVMAAC
LWQLPGIRTH LVAFDTSVVD LTADVADPVE LLMKVQLGGG TNIASAVEYG RQLIEQPAKS
VIILVSDFYE GGSSSLLTHQ VKKCVQSGIK VLGLAALDST ATPCYDRDMA QALVNVGAQI
AAMTPGELAT WLAENLQS