Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1383 |
Symbol | |
ID | 5104593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1359381 |
End bp | 1360472 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640507272 |
Product | von Willebrand factor, type A |
Protein accession | YP_001191465 |
Protein GI | 146304149 |
COG category | [R] General function prediction only |
COG ID | [COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.130674 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.0000484488 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTCAGAA TTGTCCTTGT TCCTGAATCG AAATTTGAAG CTAAGAACCT CCACTATGTG ATCCTGATTG ATAGGAGTTA CTCCATGAAG GGTGAGAAGC TGGAGATGGC CAAGGAGGGA GCTAGGTTAC TTGTTGATAA CCTGCCCAAG GATAGTCGTT TCTCCTTACT GGCCTTCAAC GAAAAGGTGT CGATAATCAA GGAGCATGAA CATCCTTCAG AGATGGGGAA GGAACTTGAG AGCCTTAAAG TTGGAAGCGG TACCGCAATG TATAAGGCAT TACAGGAGGC ATTTAACCTA GCTAGAAAGT ACGGCGAACC AACATACGTT ATATTGCTCA CTGATGGGGT TCCCTCAGAC ATGGGATGTA TGCCTGGGCT ATCTAGGAAA TTTGACCTAA ACAGATGTCT TCCCGTATAT CAGGGATTGT CAGTACCTGA GAACGTACAG ATCATATCCT TTGGAATTGG AGATGACTAC AGCGAAGAAA TACTCACTGA GGTTTCAGAA AAGGGAAGAG GCTTCTTTTA TCATGTTACT GACCCTGCTC AAATCCCTGA GAAGATGCCC AAGCTGGTAA AATCTGAGGT TGCTGCTAGT GACGTAACGG TGGATCTAGT GTCCGAGTCC CCTGTGAAAT TACTGAACTA TGATTCCCTG CCTGTAAGGA TAAACGCTGT TGAAGGCGTG GTAAAAATTT TTGGAGAAAC GGTAATTCCC AAGGAGTATA CGGGAAAGTT TATGACGCTT AAGGTGAAGT ACAGGGATGA GAAGGGGATT CGGGATAGGA CACAGGAGTT CTTCCTGACC CGCGCACAGA ACCAGCAGGA CTTCATCTCA GCGATCGACA GAGACGTCAT CATGGAGTAT GAATATCTGC AAACGCTTCA GAACTACTCC AGGGATCTAG AGGCTAGAAA CCTGGTTGAG GCCACGAAGA AGTTGGATAG GCTAAGGGAG ATAGCGGAGC AGACCAGGAG ACAGGACCTT CAGGAGGTGG CGGAGGAACT CACGAGAAAA ATGACAAGCG GAGAGGGTAA CCCGAAGGAA ATTGCGAGCG AGGTTACAAG GAAGATGAGG GGTGCGGAGT AG
|
Protein sequence | MFRIVLVPES KFEAKNLHYV ILIDRSYSMK GEKLEMAKEG ARLLVDNLPK DSRFSLLAFN EKVSIIKEHE HPSEMGKELE SLKVGSGTAM YKALQEAFNL ARKYGEPTYV ILLTDGVPSD MGCMPGLSRK FDLNRCLPVY QGLSVPENVQ IISFGIGDDY SEEILTEVSE KGRGFFYHVT DPAQIPEKMP KLVKSEVAAS DVTVDLVSES PVKLLNYDSL PVRINAVEGV VKIFGETVIP KEYTGKFMTL KVKYRDEKGI RDRTQEFFLT RAQNQQDFIS AIDRDVIMEY EYLQTLQNYS RDLEARNLVE ATKKLDRLRE IAEQTRRQDL QEVAEELTRK MTSGEGNPKE IASEVTRKMR GAE
|
| |