Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0568 |
Symbol | |
ID | 9244410 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 703836 |
End bp | 705602 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | von Willebrand factor type A |
Protein accession | YP_003678521 |
Protein GI | 297559547 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCAGTG GACGCCACCG ACAGGCCTCA CCGACCGCCG CGAACACCCG CAGGGTGGTC GCGTCGCCGT TGGGGATCAC CGCGATCGCC GTCGCCCTGA TCGCGGTGGC CGCGGTCGCG GTCCCGGTCG GACTCGACAT GCTCGGCTGC GGCGACACCC GCTACCTGCG CGTGTCGGCG ACCCAGAGCA TCGCGCCCGT GCTCCGCGAG GCCGCGGGCG AGTTCAACGA CGAGCGCCCC AGCTACGACG GGGAGTGCGT GTACGCGCAG GTGGACGAGA TCGCGCCGCA CCGCATCATG ACCGCCCTGT CCGGCGGCCA GGCGGGCGAC TCCACCATCG CCCCGCACGT GTGGGTGCCC GAGTCCTCGG CCTGGGTCGA ACTGACCCGC GTATCCGGGA GCGGCACGCA CCGCATCGAC ACCGAACCGC CCTCGCTGGC CAGCTCACCG GTCGTCCTGG CCGCGCCGCG GGGCGGCGGG GGCCTGCCCG AGCCCGACGA GGCCGGGTGG CCCCTGGTCC TGCCCGACGA ACGCGAACGC CCCCTGGTGA TGGTGGACCC CAACCGGGGC GCGGACGGTA TGGCCGTCAT GCACGCGGTC CGCCGACACC TGGGCACCGG CGACGGAGCC GACACCGCCA TGACCGACTT CGTGCGCGAC GTGCAGCTCG ACAGCGCGTT CGGCGAGATC GACCTCGCCA CCTTCTACAG CTCCGGCCGT ACCGGCGGCG GGAGCGGCGA AGGCGGCGGA GGGCGCGTCG ACCCGCTGAT CGCCGTCCCC GAACAGGCCG TGGTGTCCTA CAACGCCGAC CGCGCCGAGT CCGCGCCGCC GCTGGAGGCC CACTACCCCA CCGAGGGCAC CGTCAGCCTC GACTACCCCT ACGTCACCAC CACCGACACG GCCTCGCTGC GGTCCGCGGC CGCCGACCTG CACGAGGTGC TGCGCCGGGA CTCCTACCGC GCCCGGCTCC GGGAGCTCGG CTTCCGCGAC CCCGACGGCA CCCTGTCCGG TACGGCCGGT GCGGACCCCG ACGGGTTCGG TGTGACCGCG GAGGAGCCGC CCACCCACGA CGACCTGACC GGGGACGCCC TGCTGGCCTC GGTCACCGAC TGGAACCGGC TGTCGATGCC CAGCCGCACC CTGGTGCTCG CCGACACCTC CGCGAACATG GCGGAGGACC TCGACGGGGG CCCGTCCCGG ATGGAGGTCG CCCAGCAGGC GGCGCTCATG GGCCTGTCGC TGTTCCCCGA CGAGACCGAC ATGGGTCTGT GGCTGATGTC GGACGAGAAC GCGAGCGGCC GCGTCGAGGC CGCGGACATG CACCCCCTGG GCGGGGCGGA GCAGGGCGAC ACCGCCACCC GTCGCCGGGA ACTCATCGGG GTGGCCGAGG AGATCGCGGT CCGCGGCGGC GGTTCGCGCC TGTACGACAA CATCCTGGCC GCCTACGACC GGGTGCAGGA CGACTACGAC GAGGACAAGA TCAACAGCGT CATCCTGCTC ACCGCCGGCC AGGACGAGGG GTCCAGCGAC ATCGCGCACG CGGACCTGGT GGCGGCGCTC CAGGACCGCT TCGACCCCGA GCGGCCGGTC AGCATGTTCA TCATCGCCTT CGGGTCGCGT GAGCAGCAGG TCGCGGAGGA GGAGCTGCGG CGGATCGCGG CCGCCACCAG CGGTTCGCTG TTCGTCACCG ACGACCCCGA CGAGATCGGC GACATCTTCC TCAGCTCCAT CTCACGGCGT CTTTGCGTGC CCGACTGCGA CAGCTGA
|
Protein sequence | MPSGRHRQAS PTAANTRRVV ASPLGITAIA VALIAVAAVA VPVGLDMLGC GDTRYLRVSA TQSIAPVLRE AAGEFNDERP SYDGECVYAQ VDEIAPHRIM TALSGGQAGD STIAPHVWVP ESSAWVELTR VSGSGTHRID TEPPSLASSP VVLAAPRGGG GLPEPDEAGW PLVLPDERER PLVMVDPNRG ADGMAVMHAV RRHLGTGDGA DTAMTDFVRD VQLDSAFGEI DLATFYSSGR TGGGSGEGGG GRVDPLIAVP EQAVVSYNAD RAESAPPLEA HYPTEGTVSL DYPYVTTTDT ASLRSAAADL HEVLRRDSYR ARLRELGFRD PDGTLSGTAG ADPDGFGVTA EEPPTHDDLT GDALLASVTD WNRLSMPSRT LVLADTSANM AEDLDGGPSR MEVAQQAALM GLSLFPDETD MGLWLMSDEN ASGRVEAADM HPLGGAEQGD TATRRRELIG VAEEIAVRGG GSRLYDNILA AYDRVQDDYD EDKINSVILL TAGQDEGSSD IAHADLVAAL QDRFDPERPV SMFIIAFGSR EQQVAEEELR RIAAATSGSL FVTDDPDEIG DIFLSSISRR LCVPDCDS
|
| |