Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0923 |
Symbol | |
ID | 6142854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 931619 |
End bp | 932755 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641615811 |
Product | von Willebrand factor type A domain-containing protein |
Protein accession | YP_001743003 |
Protein GI | 170683571 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAAC TGAACGATCT TCTGACCACC CGTGAGCTAC AACGCTGGCG ATTAATTCTT GGCGAAGCGG CAGAAACGAC CCTTTGTGGG CTGGATGACA ATGCCCGGCA GATAGACCAC GCGCTGGAGT GGCTGTACGG GCGCGATCCT GAACGGCTCC AGCGTGGTGA ACGCTCTGGT GGATTAGGTG GCTCAAATCT CACCCCCCCT GAGTGGATCA ACAGTATTCA CACGCTGTTT CCGCAACAGG TGATTGAGCG ACTGGAAAGC GATGCCGTAC TGCGCTACGG CATTGAAGAT GTGGTGACGA ATCTCGACGT GCTGGAACGT ATGCAGCCTT CGGAAAGCCT GCTACGCGCC GTTTTGCACA CCAAACATCT GATGAATCCC GAAGTGCTGG CTGCCGCCCG CCGGATAGTG CACCAGGTTG TTGAAGAAAT TATGGCTCGA CTGGCAAAGG AAGTTCGTCA GGCTTTTTCT GGAGTACGCG ATCGCCGTCG CCGCTCATTT ATTTCACTGG TGCGAAACTT TGATTTCAAA AGTACTCTGC GCGCCAACCT GCAACACTGG CACCCGCAAC ACGGCAAGTT GTATATCGAA TCCCCCCGCT TTAACAGCCG CATTAAACGC CAAAGCGAAC AATGGCAACT GGTCTTACTG GTTGATCAAA GCGGATCGAT GGTTGACTCG GTGATCCACT CTGCGGTGAT GGCGGCCTGT TTATGGCAGT TACCCGGCAT TCGTACCCAT CTGGTGGCGT TTGACACCAG CGTCGTTGAT CTCACGGCAG ACGTTGCTGA TCCGGTAGAG TTATTAATGA AAGTACAACT GGGCGGCGGG ACCAATATCG CCAGTGCCGT GGAGTATGGT CGGCAACTTA TTGAACAACC AGCAAAAAGC GTCATTATCC TCGTGAGCGA TTTCTATGAA GGGGGTTCAT CATCATTGCT GACGCATCAG GTGAAAAAGT GTGTCCAGAG CGGCGTCAAA GTGCTGGGAC TGGCGGCGCT CGATAGCACC GCAACACCTT GCTATGACCG CGATATGGCC CAGGCGCTGG TTAATGTCGG CGCACAAATA GCTGCCATGA CACCGGGCGA ACTGGCGGCC TGGCTTGCGG AGAATCTTCA GTCATGA
|
Protein sequence | MSELNDLLTT RELQRWRLIL GEAAETTLCG LDDNARQIDH ALEWLYGRDP ERLQRGERSG GLGGSNLTPP EWINSIHTLF PQQVIERLES DAVLRYGIED VVTNLDVLER MQPSESLLRA VLHTKHLMNP EVLAAARRIV HQVVEEIMAR LAKEVRQAFS GVRDRRRRSF ISLVRNFDFK STLRANLQHW HPQHGKLYIE SPRFNSRIKR QSEQWQLVLL VDQSGSMVDS VIHSAVMAAC LWQLPGIRTH LVAFDTSVVD LTADVADPVE LLMKVQLGGG TNIASAVEYG RQLIEQPAKS VIILVSDFYE GGSSSLLTHQ VKKCVQSGVK VLGLAALDST ATPCYDRDMA QALVNVGAQI AAMTPGELAA WLAENLQS
|
| |