Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal_1038 |
Symbol | |
ID | 4841413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS155 |
Kingdom | Bacteria |
Replicon accession | NC_009052 |
Strand | + |
Start bp | 1195310 |
End bp | 1197238 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640118262 |
Product | von Willebrand factor type A |
Protein accession | YP_001049431 |
Protein GI | 126173282 |
COG category | [R] General function prediction only |
COG ID | [COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGGCCG TTTACAGCTT TAGGTCAACA TCAAAGGCAT TCAACATGAA AACACAATAT TCCCCAAAAA CTGTTTCAGT TTCCGATCCA ACTCTCTACC TGCAACGGGG AAACGGTATC CCATCGGCGA GCAATACGGC TGCTCTATTA CTGGTCGCAG TGAGTTTAAC GGCCTGTGGT GGTAAAGGCG CCGAAGTTGA ACATCGACAA GCTGAGCAGC AAGCCGAGCA ACGTCATCAA GTAGCGTCGC AGCGCCAAGC TGAAATGCGT GATGCCGCTA AGGTTGAGAT GGCGAGAGTG GCGGCGCCGA TGCAAATGTC TAGCAATGGG GCTGTGATGG GGATGAGCAT AGCGCCAATG CCGCGTGACT ATGCCGCGAT TCCGTTAGCG CAAAATAAAT TCGAGCAGCA AGTGCAAAAT GGCATCATGG TGGCAGGGGA GATCCCTGTA TCGACGTTTT TCATCGATGT CGATACTGGC AGTTATGCCA CCTTAAGACG GATGTTGAGG GAAGGGCGCT TACCCGAGAA AGGCACTGTC AGAGTTGAGG AAATGCTCAA TTATTTTGCC TACGATTATC CCTTACCCGC TAAAAACGCG GCACCGTTTA GCGTGACGAC AGAGCTTGCT CCCTCACCCT ATAACGATGA CATGATGCTG CTGCGGATTG GCCTTAAAGG TTATGACTTA CCTAAATCTC AATTAGGCGC GAGTAACTTA GTCTTTTTAC TCGATGTGTC AGGCTCTATG GCGTCAGCGG ATAAATTGCC TTTACTGCAA ACGGCGTTAA AGCTGCTAAC AGCGCAATTA AGTGCGCAGG ATAAAGTCTC TATTGTGGTT TATGCTGGGG CTGCTGGTGT GGTGCTCGAT GGCGTGTCGG GGAACGACAC TCAAACCTTG ACCTATGCGT TAGAACAATT AAGTGCCGGT GGTTCAATCA ATGGTGGGCA AGGGATCACG CAAGCCTATC AATTGGCCAA AAAGCATTTT ATCCCTAATG GCATAAATCG AGTCATCCTC GCGACCGATG GCGATTTTAA TGTTGGCGTG ACAGATTTTG ATGATTTGAT TGCCTTGATT GAAAAGGAAA AAGATCATGG TATTGGCCTA ACGACCTTAG GGTTTGGTTT GGGCAATTAT AACGATCAAC TGATGGAGCA ATTGGCGGAC AAGGGCAATG GCAACTATGC CTATATTGAT ACGCTGAATG AAGCGCGAAA AGTGCTGGTG GACGAGTTGA GTTCGACCTT ATTTACTATC GCCAAAGATG TAAAAGTGCA GGTGGAGTTT AATCCGGCCT TAGTCTCGGA ATACCGTTTG ATTGGCTATG AGAATCGCGC CTTAGCACGG GAAGATTTTA ATAACTATAA GGTGGATGCG GGCGAGATTG GCGCGGGTCA TACAGTAACA GCCTTGTATG AATTAAGGTA CGTTGAAGCC GGGAATAGGA TGAATGATAA ACTTAGATAT GGCGTTGATG CTCAAACGGG GAAAGAGAAA TACAGCCGTA AAGAAATTGC TTTCCTTAAA TTGAGATACA AGTTGCCAGC ACAAACGCAG AGCCAATTAC TAAGTTATCC CATCAGGTTA GATCAAAGCG TTAAACAGCT CGAGCAAGCA AGTGATGATT TTAGATTTGC CGCCGCTGTT GCAGGGCTAG GACAATTACT GAATGGCAGT CACTATCTAC ATCAATTTGA TTATACTAAG TTAAGCTTAC TCGCACGTTC AGCATTAGGG GATGATCCCT TTGGTTATCG CCATGAGTTT GTGCAGTTAA TGGAAACCGC AGCGGCGATA GAGCAATCTA ATCAGCTGCC AATTAAGAAA GCATTCGATG GCTCGGATAA ACCTTTTCCT CCCCAAGATA AACTCCATGG CGAGCCAATG AGAGATAAGA GCAATCCGCG TAATGAGAGA TTACAATAG
|
Protein sequence | MRAVYSFRST SKAFNMKTQY SPKTVSVSDP TLYLQRGNGI PSASNTAALL LVAVSLTACG GKGAEVEHRQ AEQQAEQRHQ VASQRQAEMR DAAKVEMARV AAPMQMSSNG AVMGMSIAPM PRDYAAIPLA QNKFEQQVQN GIMVAGEIPV STFFIDVDTG SYATLRRMLR EGRLPEKGTV RVEEMLNYFA YDYPLPAKNA APFSVTTELA PSPYNDDMML LRIGLKGYDL PKSQLGASNL VFLLDVSGSM ASADKLPLLQ TALKLLTAQL SAQDKVSIVV YAGAAGVVLD GVSGNDTQTL TYALEQLSAG GSINGGQGIT QAYQLAKKHF IPNGINRVIL ATDGDFNVGV TDFDDLIALI EKEKDHGIGL TTLGFGLGNY NDQLMEQLAD KGNGNYAYID TLNEARKVLV DELSSTLFTI AKDVKVQVEF NPALVSEYRL IGYENRALAR EDFNNYKVDA GEIGAGHTVT ALYELRYVEA GNRMNDKLRY GVDAQTGKEK YSRKEIAFLK LRYKLPAQTQ SQLLSYPIRL DQSVKQLEQA SDDFRFAAAV AGLGQLLNGS HYLHQFDYTK LSLLARSALG DDPFGYRHEF VQLMETAAAI EQSNQLPIKK AFDGSDKPFP PQDKLHGEPM RDKSNPRNER LQ
|
| |