Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal195_1139 |
Symbol | |
ID | 5752866 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS195 |
Kingdom | Bacteria |
Replicon accession | NC_009997 |
Strand | + |
Start bp | 1350170 |
End bp | 1352098 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641287409 |
Product | von Willebrand factor type A |
Protein accession | YP_001553575 |
Protein GI | 160874259 |
COG category | [R] General function prediction only |
COG ID | [COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGCCG TTTACAGCTT TAGGTCAACA TCAAAGGCAT TTAACATGAA AACAAAATAT TCCCCAAAAA CTGTTTCAGT TTCCGATCCA ACTCTCTACC TGCAACGGGG AAACGGTATC CCATCGGCGA GCAATACGGC TGCTCTATTA CTGGTCGCAG TGAGTTTAAC TGCCTGTAGT GGTAAAGGCG CCGAAGTTGA ACATCGACAA GCCAAGCAGC AAGCCGAGCA ACGTCATCAA GTAGCGTCGC AGCGCCAAGC TGAAATGCGT GATGCTGCTA AGGTTGAGAT GGCGAGAGTG GCGGCGCCGA TGCAAATGTC TAGCAATGGG GCAGTGATGG GAATGAGCAT AGCGCCAATG CCGCGTGATT ATGCCGCGAT TCCGTTAGCG CAAAATAAAT TCGAGCAGCA AGTGCAAAAT GGCATCATGG TGGCAGGGGA GATCCCGGTA TCGACGTTTT CCATCGATGT CGATACTGGC AGTTATGCCA CTTTAAGGCG GATGCTGAGG GAAGGGCACT TACCTGAGAA AGGCACTGTC AGAGTTGAGG AAATGCTCAA TTATTTTGCC TACGATTATC CCTTACCCGC TAAAAACGCG GCACCGTTTA GCGTGACGAC AGAGCTTGCT CCCTCACCCT ATAACGATGA CATGATGCTG CTGCGGATTG GCCTTAAAGG TTATGACTTA CCTAAATCTC AGTTAGGCGC CAGCAACTTA GTCTTTTTGC TCGACGTGTC AGGCTCTATG GCGTCAGTGG ATAAATTGCC TTTACTGCAA ACGGCATTAA AGCTGCTAAC AGCGCAGTTA AGCGCGCAGG ATAAAGTCTC TATTGTGGTT TATGCAGGGG CGGCTGGTGT AGTGCTCGAT GGCGCGTCGG GGAACGACAC TCAAACTCTG AACTATGCGC TAGAGCAATT GAGTGCCGGT GGTTCAACCA ACGGTGGTCA AGGGATCACG CAAGCCTATC AATTGGCCAA AAAGCATTTT ATCCCCAATG GCATTAATCG AGTCATCCTT GCGACCGATG GCGATTTCAA TGTTGGCGTG ACAGATTTTG ATGATTTGAT TGCCTTGATT GAAAAGGAAA AAGATCATGG CATTGGCCTA ACGACCTTAG GGTTTGGTTT GGGCAATTAT AACGATCAAC TGATGGAGCA ATTGGCGGAC AAGGGCAATG GCAATTATGC CTATATTGAT ACGCTGAATG AAGCGCGAAA AGTGCTGGTG GACGAGTTGA GTTCGACCTT ATTTACTATC GCCAAAGATG TGAAAGTGCA GGTGGAGTTT AATCCGGCCT TAGTCTCGGA ATACCGTTTG ATTGGTTACG AGAATCGCGC CTTAGCGCGG GAGGATTTTA ATAACGATAA GGTGGATGCG GGCGAGATTG GCGCGGGTCA TACCGTTACA GCCTTGTATG AATTAAGGTA CGTTGAAGCT GGGAATAGGA TGAATGATAA ACTCAGATAT GGCGTTGATG CTCAAACGGG GAAAGAGAAA TACAGCCGTG AAGAAATTGC TTTCCTTAAA TTGAGATACA AGTTGCCAGC ACAAACGCAG AGCCAATTAC TGAGTTATCC CATCAGGTTA GATCAAAGCG TTAAACAGCT CGAGCAAGCA AGTGATGATT TTAGATTTGC CGCCGCGGTT GCAGGGTTAG GGCAATTACT GAATGGCAGT CACTATCTCC ATCAATTTGA TTATACTAAG TTAAGCTTAC TCGCACGTTC AGCATTAGGG GATGATCCCT TTGGTTATCG TCATGAGTTT GTGCAGTTAA TGGAAACCGC AGCGGCGATT GAGCAATCTA ATCAGCTGCC AAGTAAGAAA ACATTCGATG GCTCGGATAA ACCTTTTCCT CCACAAGATA AACTCCATGG CGAGCCAATG AGAGATAAGA GCAATCCGCG TAATGAGAGA TTACAATAG
|
Protein sequence | MRAVYSFRST SKAFNMKTKY SPKTVSVSDP TLYLQRGNGI PSASNTAALL LVAVSLTACS GKGAEVEHRQ AKQQAEQRHQ VASQRQAEMR DAAKVEMARV AAPMQMSSNG AVMGMSIAPM PRDYAAIPLA QNKFEQQVQN GIMVAGEIPV STFSIDVDTG SYATLRRMLR EGHLPEKGTV RVEEMLNYFA YDYPLPAKNA APFSVTTELA PSPYNDDMML LRIGLKGYDL PKSQLGASNL VFLLDVSGSM ASVDKLPLLQ TALKLLTAQL SAQDKVSIVV YAGAAGVVLD GASGNDTQTL NYALEQLSAG GSTNGGQGIT QAYQLAKKHF IPNGINRVIL ATDGDFNVGV TDFDDLIALI EKEKDHGIGL TTLGFGLGNY NDQLMEQLAD KGNGNYAYID TLNEARKVLV DELSSTLFTI AKDVKVQVEF NPALVSEYRL IGYENRALAR EDFNNDKVDA GEIGAGHTVT ALYELRYVEA GNRMNDKLRY GVDAQTGKEK YSREEIAFLK LRYKLPAQTQ SQLLSYPIRL DQSVKQLEQA SDDFRFAAAV AGLGQLLNGS HYLHQFDYTK LSLLARSALG DDPFGYRHEF VQLMETAAAI EQSNQLPSKK TFDGSDKPFP PQDKLHGEPM RDKSNPRNER LQ
|
| |