Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0567 |
Symbol | |
ID | 9244409 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 701800 |
End bp | 703563 |
Gene Length | 1764 bp |
Protein Length | 587 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | von Willebrand factor type A |
Protein accession | YP_003678520 |
Protein GI | 297559546 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGACGTC ACCGCGGAAG ATACGCAGAA GAGACCTCCG GCCGGTCCAG GCGCCGTCGC GGCCGCGGCG GCGCCTTCGC CGCCCTGGCC GCCGCGCTGG TGATCGTGGT CGGCCTCGCC GCGGTGGGCG TCTACGTGTT CGGCCGGTCC GACGGCTGCG GCGGTTCCGA CATCGCGCTG GACGTCGCGG TCAGCCCCGA ACTCGCCCCG GCCCTGACCG ACGTCGCCTC CGACTTCAAC GCGGAGGAGC ACCAGGTGGA CGGGAGCTGC GTACAGGTGC AGGTGCGCCA GGTCGACTCC GCCAACGTCG CCTTCGGCAT CACCGGCGCG GGAGCCACCA TGGGCGACAC CGACTCCGAC GTGTGGATCC CGGACTCCTC CCTGTGGCCG CGCCTGGTCC AGAGCCAGGC GGGCGACGCC GTCATCACCG AGACCGGCAC CTCCGTGGCC CGTTCGCCTC TGGTCCTCGC CGAACTCACC GAGTTCGCCG ACGAGAACTC CCCGAGCAGT TGGGCGGAGG TCGTGCCCAC CACCGCCCCG GGCCAGGAGG CGGAGCGCAC CGTCCGCGTG GTCGACCCCG CCCGCAACGC CACGGGGCTG GGCACCCTCT ACCTCCTGCA CGGAGCGCTG GAGGAGGCCA GCCCGGACAC CGCCACGTTC AACGCGCGGA TGACCGCGGT CCTGCAGGGC CTGCACCGGG GCGCGTCCTC GGACGAGGAG GCCGCCTTCC TCGCCCTCAG CGGCGGCGGC GCCGAGGCCC CGCCCGTGAT GGTGATGTCG GAGCAGGCCG TGTGGCGCTA CAACGCCGCG CACGGGGACG CCCCCGCCCA GGTCGGCTAC ATGGAGGGCG GCACCTACTA CCTCGACTAC CCCTACGTCG TGCGCAGCGA GGAGAGCGCC GTCACCCGCG CCGCCGAGGA GTTCCGCGAG GCGGTGCGCG GGGAGGAGGC GCGGACCCGG CTGCTCGCGG AGGGGTTCCG CGGCCCCGAG GGGCAGATCG ACGCCTCCGT GCTCACCGAG GAGGTCGGCT TCGCCGCCGA GCCGCCCACC GAACTGCCCA CGCCCGCCGC GGACTCGATC ACCGGGCTGA TCCGCACCTG GAACCAGCTC AAGATGGACT CCCGGGTGCT CGCGATCGTG GACATCTCCG GCTCGATGCT GGCCGAGGTG CCCGGGACCG GGATGACCCG CATGCAGGTG ACCAGCGCCG CCGCCACGCA GGGCCTGGAG ATGTTCACGC CCAGTTCCGA GCTGGGCCTG TGGGAGTTCT CCACCAACGT CAACAACGAA CTGCACTACC AGGAGATCGC GCCGATCCGC GAACTCCAGG CGGCCGCCGA CGACGGCACC GCGCACCGGG ACGTCCTGGC GGGCGCGCTG GCCTCGCTCC AGCCCCTGCC GCAGGGGGAC ACGGCGCTGT ACGAGACCTA CCTGGCGGCC TACCAGGAGA TGTCGCGCAC CTACCAGCCC GACCGGACCA ACGTCATCCT CATGCTGACC GACGGCGACA ACGACAACCC CGGCGGCCTG GGGCTGGACG AGCTGATGTC CCAGATCGAG TCCCTGGCGA GCCCGTCACG GCCGATCCCG ATCATCACGA TCGCGTTCGG GCCCGACGTG CAGAACCTGG AGCCGCTCCA GGAGATCGCC GCCGCCACCG GCGGCGCCGC CTACATGACC GAGGACCCGA CCGAGATCGG CGAGATCTTC CTCCAGGCGT TCTCCCTGCG CATCTCCGAG GACTCCGAGG AGACCACGGA GTAG
|
Protein sequence | MGRHRGRYAE ETSGRSRRRR GRGGAFAALA AALVIVVGLA AVGVYVFGRS DGCGGSDIAL DVAVSPELAP ALTDVASDFN AEEHQVDGSC VQVQVRQVDS ANVAFGITGA GATMGDTDSD VWIPDSSLWP RLVQSQAGDA VITETGTSVA RSPLVLAELT EFADENSPSS WAEVVPTTAP GQEAERTVRV VDPARNATGL GTLYLLHGAL EEASPDTATF NARMTAVLQG LHRGASSDEE AAFLALSGGG AEAPPVMVMS EQAVWRYNAA HGDAPAQVGY MEGGTYYLDY PYVVRSEESA VTRAAEEFRE AVRGEEARTR LLAEGFRGPE GQIDASVLTE EVGFAAEPPT ELPTPAADSI TGLIRTWNQL KMDSRVLAIV DISGSMLAEV PGTGMTRMQV TSAAATQGLE MFTPSSELGL WEFSTNVNNE LHYQEIAPIR ELQAAADDGT AHRDVLAGAL ASLQPLPQGD TALYETYLAA YQEMSRTYQP DRTNVILMLT DGDNDNPGGL GLDELMSQIE SLASPSRPIP IITIAFGPDV QNLEPLQEIA AATGGAAYMT EDPTEIGEIF LQAFSLRISE DSEETTE
|
| |