Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2426 |
Symbol | |
ID | 6147313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2473374 |
End bp | 2475140 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641617298 |
Product | von Willebrand factor type A domain-containing protein |
Protein accession | YP_001744470 |
Protein GI | 170681089 |
COG category | [R] General function prediction only |
COG ID | [COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAAATA AAAATATAAT CATGTTGCTT ATGAGTAGTT TGATTTTGTC AGGATGTGGG CCGGAATCTG AGAATAAGGA AAGTCTGCAA CAACAACCCA GTACTCCCAC GGACCAGCAA GTGCTTGCCG CGCAACATGC TGCAATAAAA GAGGCCGAGC AAAGATCCGT CGCCGCTAAA GCCACTGCAG ACGCGAAAGC AAAAGCCTTA GCCCAGCAAG AAGCGCAACA ATATTCAGAC AAACAGGCTT TGCAGGGGCG CTTACAGGCG GCTCCAAAAT ATCAACATGC AGCCAGAGAA AAAGCAGCCT CCCAAATCGC AAATCCAGGA ACCGCTCGCT ATCAGCAGTT CGATGATAAT CCGGTTAAGC AGGTAGCGCA AAACCCGTTG GCGACATTTA GCCTTGACGT CGACACCGGC AGTTATGCGA ATGTACGCCG TTTTCTCAAT CATGGGCAGT TGCCTCCGCC AGATGCAGTG CGGGTGGAGG AGATGGTGAA TTATTTTCCG TCAGACTGGG TTATTAATGA TAAAAGTAAT AATAAAGAAC CTGTTCCCGC CAGTAAGCCA ATACCTTTCG CTATGCGCTA CGAACTGGCA CCTGCGCCGT GGAATGAACA GCGAACATTG CTGAAAGTTG ATATCCTGGC GAAAGATCGC AAAAGTGAAG AGTTACCAGC TTCTAATCTG GTCTTTCTTA TCGACACTTC TGGTTCAATG ATTTCTGATG AACGTTTGCC ACTTATCCAG TCTTCGTTGA AATTATTGGT CAAAGAACTT CGTGAGCAGG ATAACATTGC CATCGTGACC TATGCTGGCG ACTCCCGTAT TGCGTTGCCT TCTATCTCCG GGAGTCACAA GGCGGAAATT AATGCCGCAA TTGATTCGCT GGATGCCGAT GGCAGTACCA ATGGCGGTGC CGGGCTGGAA CTGGCTTATC AGCAGGCGGC GAAGGGGTTT ATTAAGGGTG GCATCAATCG CATTTTATTG GCCACTGATG GTGATTTTAA CGTCGGCATT GACGATCCTA AATCGATTGA ATCAATGGTC AAAAAACAGC GGGAGTCCGG TGTTTCTCTG TCCACGTTTG GCGTGGGGGA TAGCAATTAC AACGAGGCAA TGATGGTGCG AATCGCCGAT GTTGGCAACG GTAACTACAG CTACATTGAT ACCCTCGCTG AAGCACAGAA AGTATTGAAC AGTGAAATGC GGCAGACGTT GATTAGCGTA GCAAAAGATG TCAAAGCGCA AATTGAGTTT AACCCCGCGT GGGTAACGGA ATACCGTCAG ATTGGTTATG AAAAGCGTCA ACTTCGGGCG GAAGATTTTA ATAACGACAA CGTTGATGCG GGTGATATTG GCGCAGGTAA ACATATAACG TTGTTATTCG AATTAACGCT GAAAGGGCAA AAAGCATCAA TTGATAAGTT ACGCTATGCC CCGGATAACA AATCAGCGAA ATCGGACAAA ACAAAAGAAC TTGCCTGGTT AAAGATTCGT TGGAAATACC CGCAGGGAAA AGAAAGTCAG TTAGTTGAAT TCCCGCTGGG GCCAACAATA AACGCGCCCT CTGAAGATAT GCGTTTTCGC GCAGCAGTAG CTGCATATGG GCAAAAGTTA CGCGGTTCTG AATACCTGGA TGAAATCTCA TGGCAGCAGA TAAAACAGTG GGCTCAGCAG GCAAAAGGGG AAGATCCACA GGGTTACAGG GCGGAATTTA TTCGCCTGGT TGAACTGGCG GGTGGTGTGA CTGACATCAG TCAGTGA
|
Protein sequence | MRNKNIIMLL MSSLILSGCG PESENKESLQ QQPSTPTDQQ VLAAQHAAIK EAEQRSVAAK ATADAKAKAL AQQEAQQYSD KQALQGRLQA APKYQHAARE KAASQIANPG TARYQQFDDN PVKQVAQNPL ATFSLDVDTG SYANVRRFLN HGQLPPPDAV RVEEMVNYFP SDWVINDKSN NKEPVPASKP IPFAMRYELA PAPWNEQRTL LKVDILAKDR KSEELPASNL VFLIDTSGSM ISDERLPLIQ SSLKLLVKEL REQDNIAIVT YAGDSRIALP SISGSHKAEI NAAIDSLDAD GSTNGGAGLE LAYQQAAKGF IKGGINRILL ATDGDFNVGI DDPKSIESMV KKQRESGVSL STFGVGDSNY NEAMMVRIAD VGNGNYSYID TLAEAQKVLN SEMRQTLISV AKDVKAQIEF NPAWVTEYRQ IGYEKRQLRA EDFNNDNVDA GDIGAGKHIT LLFELTLKGQ KASIDKLRYA PDNKSAKSDK TKELAWLKIR WKYPQGKESQ LVEFPLGPTI NAPSEDMRFR AAVAAYGQKL RGSEYLDEIS WQQIKQWAQQ AKGEDPQGYR AEFIRLVELA GGVTDISQ
|
| |