Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3108 |
Symbol | |
ID | 6967339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 2884452 |
End bp | 2885588 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643386934 |
Product | von Willebrand factor type A domain protein |
Protein accession | YP_002271402 |
Protein GI | 209396016 |
COG category | [R] General function prediction only |
COG ID | [COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.708549 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 0.744851 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAAC TGAACGATCT TCTGACCACC CGTGAGCTAC AACGCTGGCG ATTAATTCTT GGCGAAGCGG CAGAAACGAC GCTTTGTGGG CTGGATGACA ACGCCCGGCA GATAGACCAC GCGCTGGAGT GGCTGTATGG GCGCGATCCT GAACGGCTCC AGCGTGGTGA ACGCTCCGGT GGATTAGGTG GCTCAAATCT CACCCCCCCT GAGTGGATCA ACAGTATTCA CACGCTGTTT CCGCAACAGG TGATTGAGCG GCTGGAAAGC GATGCCGTAC TGCGCTACGG CATTGAAGAT GTGGTGACGA ATCTCGACGT GCTGGAACGT ATGCAGCCTT CTGAAAGCCT GCTACGCGCC GTTTTGCACA CCAAACATCT GATGAACCCC GAAGTACTGG CTGCCGCCCG CCGGATAGTG CGCCAGGTTG TTGAAGAAAT TATGGCTCGA CTGGCAAAGG AAGTTCGTCA GGCTTTTTCT GGTGTCCGCG ATCGCCGTCG CCGCTCATCT ATTCCACTGG CGCGAGACTT TGATTTCAAA AGTACTCTTC GCGCCAATCT GCAACACTGG CACCCGCAAC ACGGCAAGTT GTATATCGAA TCCCCCCGCT TTAACAGCCG AATTAAGCGC CACAGTGAAC AATGGCAACT GGTCTTACTG GTTGATCTAA GCGGATCGAT GGTCGATTCG GTGATCCACT CTGCGGTAAT GGCGGCCTGT TTGTGGCAGT TACCCGGCAT TCGTACCCAT CTGGTGGCGT TTGACACCAG TGTCGTTGAT CTCACGGCAG ACGTTGCCGA TCCGGTAGAG TTATTAATGA AAGTACAGTT GGGCGGCGGG ACCAATATCG CCAGTGCCGT GGAGTATGGG CGGCAACTTA TTGAACAACC AGCAAAAAGC GTCATTATCC TCGTGAGTGA TTTTTATGAA GGGGGTTCAT CATCATTGCT GACGCATCAG GTGAAAAAGT GTGTCCAGAG CGGCATCAAA GTGCTGGGGC TGGCAGCGCT CGACAGCACC GCAACGCCTT GCTATGACCG CGATATGGCC CAGGCGCTGG TTAATGTTGG CGCACAAATA GCCGCAATGA CACCGGGTGA ACTGGCTACC TGGCTTGCGG AGAATTTGCA GTCATGA
|
Protein sequence | MSELNDLLTT RELQRWRLIL GEAAETTLCG LDDNARQIDH ALEWLYGRDP ERLQRGERSG GLGGSNLTPP EWINSIHTLF PQQVIERLES DAVLRYGIED VVTNLDVLER MQPSESLLRA VLHTKHLMNP EVLAAARRIV RQVVEEIMAR LAKEVRQAFS GVRDRRRRSS IPLARDFDFK STLRANLQHW HPQHGKLYIE SPRFNSRIKR HSEQWQLVLL VDLSGSMVDS VIHSAVMAAC LWQLPGIRTH LVAFDTSVVD LTADVADPVE LLMKVQLGGG TNIASAVEYG RQLIEQPAKS VIILVSDFYE GGSSSLLTHQ VKKCVQSGIK VLGLAALDST ATPCYDRDMA QALVNVGAQI AAMTPGELAT WLAENLQS
|
| |