Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2255 |
Symbol | |
ID | 5595012 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 2250315 |
End bp | 2251451 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640921385 |
Product | von Willebrand factor type A domain-containing protein |
Protein accession | YP_001458921 |
Protein GI | 157161603 |
COG category | [R] General function prediction only |
COG ID | [COG3552] Protein containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 73 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGAAC TGAACGATCT TCTGACCACC CGTGAGCTAC AACGCTGGCG ATTAATTCTT GGCGAAGCGG CAGAAACGAC GCTTTGTGGG CTGGATGACA ACGCCCGGCA GATAGACCAC GCGCTGGAAT GGCTGTATGG GCGCGATCCT GAACGGCTCC AGCGTGGTGA ACGCTCCGGT GGATTAGGTG GCTCAAATCT CACCACCCCT GAGTGGATCA ACAGTATTCA CACGCTGTTT CCGCAACAGG TGATTGAGCG GCTGGAAAGC GATGCCGTAC TGCGCTATGG CATTGAAGAT GTGGTGACGA ATCTCGACGT GCTGGAACGT ATGCAGCCTT CTGAAAGCCT GCTACGCGCC GTTTTGCATA CCAAACATCT GATGAACCCC GAAGTACTGG CTACCGCCCG CCGGATAGTG CGCCAGGTTG TTGAAGAAAT TATGGCTCGA CTGGCAAAGG AAGTTCGTCA GGCTTTTTCT GGTGTCCGCG ATCGCCGTCG CCGCTCATCT ATTCCACTGG CGCGAAACTT TGATTTCAAA AGTACTCTAC GCGCCAACCT GCAACACTGG CACCCACAAC ACGGCAAGTT GTATATCGAA TCCCCCCGCT TTAACAGCCG CATTAAACGC CAAAGCGAAC AATGGCAACT GGTCTTACTG GTTGATCAAA GCGGCTCGAT GGTCGATTCG GTGATCCACT CTGCGGTGAT GGCGGCCTGT TTATGGCAGT TACCCGGCAT TCGTACCCAT CTGGTCGCTT TTGACACCAG TGTGGTTGAT CTCACGGCAG ACGTTGCCGA TCCGGTAGAG TTATTAATGA AAGTGCAATT AGGCGGCGGA ACCAATATCG CCAGTGCCGT GGAGTATGGT CGGCAACTTA TTGAACAACC AGCGAAAAGC GTCATTATCC TCGTGAGCGA TTTTTACGAA GGGGGTTCAT CATCATTACT GACGCATCAG GTGAAAAAGT GTGTCCAGAG CGGCATCAAA GTGCTGGGAC TGGCAGCGCT CGATAGCACA GCAACACCTT GCTATGACCA CGATACGGCC CAGGCGCTGG TAAATGTCGG CGCACAAATA GCCGCCATGA CACCGGGCGA GCTGGCATCA TGGCTTGCGG AGAATCTTCA GTCATGA
|
Protein sequence | MSELNDLLTT RELQRWRLIL GEAAETTLCG LDDNARQIDH ALEWLYGRDP ERLQRGERSG GLGGSNLTTP EWINSIHTLF PQQVIERLES DAVLRYGIED VVTNLDVLER MQPSESLLRA VLHTKHLMNP EVLATARRIV RQVVEEIMAR LAKEVRQAFS GVRDRRRRSS IPLARNFDFK STLRANLQHW HPQHGKLYIE SPRFNSRIKR QSEQWQLVLL VDQSGSMVDS VIHSAVMAAC LWQLPGIRTH LVAFDTSVVD LTADVADPVE LLMKVQLGGG TNIASAVEYG RQLIEQPAKS VIILVSDFYE GGSSSLLTHQ VKKCVQSGIK VLGLAALDST ATPCYDHDTA QALVNVGAQI AAMTPGELAS WLAENLQS
|
| |