Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_0550 |
Symbol | |
ID | 8446134 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 609975 |
End bp | 611918 |
Gene Length | 1944 bp |
Protein Length | 647 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 645039684 |
Product | von Willebrand factor type A |
Protein accession | YP_003199955 |
Protein GI | 258650799 |
COG category | [R] General function prediction only |
COG ID | [COG4867] Uncharacterized protein with a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCAC GTCGCCGGTA TTCCTACCGG GCCTACGACG GCGGCCCGGA CCCGCTCGCG CCGCCGTTCG ACCTGCGCGA GGCAATCGAC CGGATCGGCG CCGACGTGCT GGACGGCAGC TCTCCGCGGC AGGCCCTGCA GGAACTGCTG CGCCGCGGGC TGGGCGAACG GCGCGGGCTG GACGAGCTGA CCCGCCAGTT GTGGCAGCGG CGGCGGGAAC TGCAACGCAA CAACCGGCTC GACGGCACCC TGCAGCAGGT CCGGGAACTG CTCGACCGGG CGCTGACCGC CGAGCGAAAG GCGCTGGCCC GCCAGGATTC GGACGACGCG CGGTTCGGCG AGCTGCAGCT GGACGCGTTG CCCACCGACG CCGCCGGCGC CGTCCGCGAG CTGGAGAATT ACGACTGGCA GTCCCCCGAC GGGCGGGCCG CCTACGAGCA GATCCGCGAC CTGCTGGGGC GGGAGATGCT CGACCAGCGG TTCGCCGGCA TGAAACAGGC GCTGGAGAAC GCGACATCGC AGGACGTCCA AGCCATCCGG GACATGCTGG CCGACCTCAA CGAGTTGCTG TCGGCGCACG CCCGGCTGGA GGACACCACC GAGCGGTTCG GCGAGTTCAT GAACCGGCAC GGCCACTTCT TCCCCGAGCA ACCGCGCACC ACCGACGAGC TGATCGACGT GCTCGCGGCC AGATCCGCGG CGGCGCAACG GATGATGAAC TCACTGTCGG CCGACCAGCG GGCCGAGCTG GCCGCGCTGA CCGAGCAGGC GTTCGGCGAT CCCCGGCTGG CGCAGTCGCT GGCCCAGCTG GACGCGATGC TGCAGCAGTT ACGGCCCGGC CAGGACTGGG ACGGCTCCGG CAGGTTCCGC GGCGACAACC CGATGGGCAT GGGCGAGGCC ACCCGAGCCC TGGAGGAGCT GGGCCGGCTG GAGTCGCTGG GCGAGCAACT CGCCCAGGGC TACCCGGGCG CCAGCCTGGA CGACATCGAC CTGGACCTGC TGCAGGACGT GCTGGGCGAG CAGGCCCGGG TGGACGCCCG GGCGCTGGCC GAGCTGGAAC GTGAACTGCG CGAACAGGGC CTGCTCGAGC GGGCCGCCGA CGGCTCCCTG CAGCTGTCCC CCAAGGCACT GCGCCGGTTG GGCCAGACCG CGTTGCGCGA CATCGCGGAC CAGGTCGCCG GCCGCTCCGG GGAGCGGGAG ACCCGTCGGT CCGGGCCGGC CGGCGAGGCC TCCGGGGCGA CCCGGCCGTG GGCCTTCGGC GACACCGAAC CGTGGCACGT GCCGCGGACC CTGCTCAACG CCCAGGTGCG GCGGGCCGGC GGCGATCCCC GAGTGCTGGA CGTCACCGAT GTCGAGGTCG TGGAGACCGA GCGACGGGCC CGCGCCGCGG TGGCGCTGTG CGTGGATACC TCCTGGTCGA TGGTGCAGGA GGGCCGGTGG GTGCCGATGA AACGCACCGC CCTGGCCCTG CACCAGCTGA TCTCCACCCG GTTCCGCGGG GACGACCTGG CCCTGATCAC CTTCGGCCGG CACGCCGAGA AGGTCGAGCT GGGGCAGCTG GTCGGGCTGG AGGGTGCCTA CGTGCAGGGC ACCAACCTGC ACCACGCGCT GCTGCTGGCC GGCGCGCACC TGCGGCGGCA CCCCGATGCC ACCCCGGTGG TCCTGGTGGT GACCGACGGC GAACCCACCG CGCACCTGGA ACCCGATGGC TCGCCGCACT TCTCGTACCC GCCGGATCCG GAGACGGTGC ATGCGACGGT GGGCGAACTG GACCGGCTGA CCGGGCTGCG CGCCGCCGTC ACCTTCTTCA TCCTCGGTGA CGACCCGCGG TTGGCCCTGT TCACCGACAA ACTGGCTCGC CGCTGCGGCG GCCGGGTGGT CGCCCCCGAC CTGGACGGGC TCGGGGCCTC GGTGGTCGCC GACTACTTGC GGCATCGGCG ATGA
|
Protein sequence | MTARRRYSYR AYDGGPDPLA PPFDLREAID RIGADVLDGS SPRQALQELL RRGLGERRGL DELTRQLWQR RRELQRNNRL DGTLQQVREL LDRALTAERK ALARQDSDDA RFGELQLDAL PTDAAGAVRE LENYDWQSPD GRAAYEQIRD LLGREMLDQR FAGMKQALEN ATSQDVQAIR DMLADLNELL SAHARLEDTT ERFGEFMNRH GHFFPEQPRT TDELIDVLAA RSAAAQRMMN SLSADQRAEL AALTEQAFGD PRLAQSLAQL DAMLQQLRPG QDWDGSGRFR GDNPMGMGEA TRALEELGRL ESLGEQLAQG YPGASLDDID LDLLQDVLGE QARVDARALA ELERELREQG LLERAADGSL QLSPKALRRL GQTALRDIAD QVAGRSGERE TRRSGPAGEA SGATRPWAFG DTEPWHVPRT LLNAQVRRAG GDPRVLDVTD VEVVETERRA RAAVALCVDT SWSMVQEGRW VPMKRTALAL HQLISTRFRG DDLALITFGR HAEKVELGQL VGLEGAYVQG TNLHHALLLA GAHLRRHPDA TPVVLVVTDG EPTAHLEPDG SPHFSYPPDP ETVHATVGEL DRLTGLRAAV TFFILGDDPR LALFTDKLAR RCGGRVVAPD LDGLGASVVA DYLRHRR
|
| |