Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0667 |
Symbol | |
ID | 5105273 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 609730 |
End bp | 611085 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640506571 |
Product | von Willebrand factor, type A |
Protein accession | YP_001190766 |
Protein GI | 146303450 |
COG category | [R] General function prediction only |
COG ID | [COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.949287 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0104959 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGGTC TCCTTAGAGG TGTAGATTAC GAAAGCCCCG TTGTAAAGTA CAGGGGCGAG AGGATACTCA ACACCCTAAG GAGGGTCTCT GGGAAGGAAA GCAACGTAGA TCCCCTATTC CTCATTGATA CCTATTACGT GCACTATCTC CCCCTACCTA TACTAAAAAC AAAGGGGGAA ATAGATCAAA GCGACTCCAT TAAGTACTCA TTGATTGATC TTACCATTTC GTCAGAGATT GTAAACAGGA ACAGAAACTA CTCAATTGCA AACTCAGCAG TGAGTATGGC CCTTTCCGTG AGTTATGTGC AAAACTTGAT AGAGGAATTG GAGAGAATTA GGAGGACTTC ACAGTCGCAG GAGGAGAGAG AGGCCGCAGA GCAGATCCTT AACGGAATAA TGAAGGGAAG TCAGGGTAAG GAGGGAAAGC AGAATCAGAA CCAGCAACAG GAAAACCAGA CCACTGGAAA GCTAATGAAG CAGGTTCATG AAAAGGCTAT GGCAAAGGCG TCTGAGGATG CTAATTCCGT CAGGAGTATG CAGAGGATAG TTGGAGGTAA TGGGGCCGGT ACAGGATCGA TGATGAACTT TGAGGGAGAT ATACACGACG TGCTAAGACT AGCTAGGAAC ACGGAGATCA AGAAGATCCT GGAGTTTCTG AGCGGTATCC CGAAGCTGGG TAGCTTCACC AAGAAGAGGA CCACAAGATA TGCTAGGGGA GAGCTTTATG GATATGAGGA AGGTTCAGAC CTAGAGAGAC TGGTTCCCTC AGAACTGGCC TTACCCGAGG AACTCTTTGA TGTGAAGCTT GCAGAGAGCC AGCTATTACT ATATCAGAAA CAGATTAAGG AAACCCTAGG ACCCATATAT CTACTATTGG ACAAGTCAGG CAGCATGGAT GGAGAAAAGA TCCTGTGGGC TAAGGCTGTA GCACTGGCCC TCTACAGTAG GGCTAGAAGG GAGAACAGGG ACTTCTATCT AAGGTTCTTC GATAACATTC CATATCCTCT GATCAAGGTC ATAAAGAATG CCAAGAGCAA GGACGTGATC AAGATGGTAG AGTATATTGG GAAGATAAGA GGTGGAGGCG GAACAGATAT ATCTAGATCT GTAATGTCTG CCTGCGACGA TATAAAGGAC GGTCATGTTA GGGGTGTAAG CGAGGTCATA ATTTTGACGG ATGGAGAGGA TAAGATCGCT GAAACCACTG TTAGGAGATC CCTTAAAGAG GCAAATGCTA CGCTCATTAG CGTGATGATA AGAGGAGATA ACGCGGATCT AAAGAGAGTT TCAGATAACT ACCTAGTGGT TTACCGTCTA GATCAGGGAG ACCTACTTAG GGTAGTTGAA TCCTAA
|
Protein sequence | MTGLLRGVDY ESPVVKYRGE RILNTLRRVS GKESNVDPLF LIDTYYVHYL PLPILKTKGE IDQSDSIKYS LIDLTISSEI VNRNRNYSIA NSAVSMALSV SYVQNLIEEL ERIRRTSQSQ EEREAAEQIL NGIMKGSQGK EGKQNQNQQQ ENQTTGKLMK QVHEKAMAKA SEDANSVRSM QRIVGGNGAG TGSMMNFEGD IHDVLRLARN TEIKKILEFL SGIPKLGSFT KKRTTRYARG ELYGYEEGSD LERLVPSELA LPEELFDVKL AESQLLLYQK QIKETLGPIY LLLDKSGSMD GEKILWAKAV ALALYSRARR ENRDFYLRFF DNIPYPLIKV IKNAKSKDVI KMVEYIGKIR GGGGTDISRS VMSACDDIKD GHVRGVSEVI ILTDGEDKIA ETTVRRSLKE ANATLISVMI RGDNADLKRV SDNYLVVYRL DQGDLLRVVE S
|
| |