Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3514 |
Symbol | |
ID | 6145618 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3593732 |
End bp | 3594901 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641618343 |
Product | VWA domain-containing protein |
Protein accession | YP_001745490 |
Protein GI | 170681772 |
COG category | [R] General function prediction only |
COG ID | [COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAGTA ACGAAATGGA ACGGCTGCGC CGCTGGCGTC TGATCCTTGG AGAAGATGCC GACAATGCCT GTAACGTGGC GCTGGATGCC AGAGAGTCCG CGATTGATGC CGCGCTGGCG GCGCTGTATC AGCCAGAGGG TAAACATGGA TTACGCGGCG GCACGGGTGG ATCGTCACCC AAAGTCGCTC GCTGGCTGGG TGATATTCGC CAGTATTTCC CTTCTTCTGT GGTTCAGGTG ATGCAAAAAG ACGCCTTTGA ACGTCTTAAT TTGCACAGTA TGTTGCTGGA GCCGGAGATG CTGGCAAACG TCCAGCCTGA TGTTCATCTG GTCTCCACGC TGATGTCGCT GAATGGCGTA ATTCCGGCCA AAACCAAAGA AACCGCCAGG CTTGTGGTGC GTAAAGTTGT CGAGCAATTG ATGAAACAGC TTGAAGAGCC AATGCGTAGC GCGGTAAGCG GCGCTCTTAA TCGTGCGGTA CGTAATCGTC GCCCGCGCCA CGCGGAAATT GACTGGCAGC GCACCATTCG CGCCAACCTG CGCCACTGGC AGGAAGAGTA TAAAAGTATT GTTCCTGAAA CGCTGATCGG CTACGGGCGC AAGTCTCAAC GCACGCAAAA GGAGATCATT TTATGTATCG ACCAGAGTGG GTCGATGGCC TCCTCGGTGG TCTATTCCAG TATTTTTGGC GCAGTAATGG CATCGCTCCC GGCGGTAAAA ACGCATCTGG TGGTTTTCGA CACCGCAGTC GTTGATATGA CAGAGAAGCT CGACGATCCG GTAGAGCTGC TGTTTGGCGT ACAGCTGGGT GGCGGCACCG ACATCAACCG CGCGGTGGGC TACTGCCAGT CTTTGATTCG CGATCCGCGC AATACCATTC TGGTGCTGAT TTCCGACCTC TACGAAGGCG GGGTGGAACG CAATCTGTTG CAACGCGCCA GTGAGTTGAT TCAGTCCGGC GTACAGGTGG TTACCCTGCT GGCCTTAAGC GATGAAGGTG CGCCGTTTTA CGACCGGTCA CTGGCAGGAA AACTCGCCGC CATGGGTATT CCCTCCTTTG CCTGTACCCC AGATTTATTC CCCGGCATGA TGGCCGCCGC CATTCGTAAG GAAGATGTGA ATCTGTGGGC TGCGCAAAAT GGCGTAGTGA CAGCAAGAGA AACCGCCTGA
|
Protein sequence | MSSNEMERLR RWRLILGEDA DNACNVALDA RESAIDAALA ALYQPEGKHG LRGGTGGSSP KVARWLGDIR QYFPSSVVQV MQKDAFERLN LHSMLLEPEM LANVQPDVHL VSTLMSLNGV IPAKTKETAR LVVRKVVEQL MKQLEEPMRS AVSGALNRAV RNRRPRHAEI DWQRTIRANL RHWQEEYKSI VPETLIGYGR KSQRTQKEII LCIDQSGSMA SSVVYSSIFG AVMASLPAVK THLVVFDTAV VDMTEKLDDP VELLFGVQLG GGTDINRAVG YCQSLIRDPR NTILVLISDL YEGGVERNLL QRASELIQSG VQVVTLLALS DEGAPFYDRS LAGKLAAMGI PSFACTPDLF PGMMAAAIRK EDVNLWAAQN GVVTARETA
|
| |