Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3551 |
Symbol | |
ID | 9247420 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4261133 |
End bp | 4262650 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | von Willebrand factor type A |
Protein accession | YP_003681458 |
Protein GI | 297562484 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.104304 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.664507 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACCTGT CAGCGCTGTC GGACTTCGAC GCCGTCCCCC GGGACACCGA AGACGCCGTG TCCGTCCTGG TCGACATCAC CGCCCCCGAA CGCGAGGAGG AGACCGAACG CCCGCCCGCG ACCCTCCAGG TCGTGCTGGA CCGCAGCGGG TCCATGGGCG GAGGACGCCT GGACGGCGCG GTGCGCGCGC TGCTCTCCCT GGTGGAGCGG CTCGCGCCCT CCGACAACTT CGGACTCGTG TCCTTCAACG ACCAGGCCCG GGTCGAGGTG CCCTGCGGGC CGTTGGAGGA CAAGGCGCGG GTGCGCCGCC TGATCTCCGG GCTGCACGCG TCGGGCGGCA CCGACCTGTC CAGCGGACTG CTGCGCGGCG TGCAGGAGGC CCGCCGCGCC GGAGCCGACA GGGGCGGCAC CCTGCTGCTG ATCTCCGACG GCCACGCCAA CCAGGGCGTC ACCGACCACG ACCTGCTCCG ACAGGTGGCG GCCGACGCCT ACGCCCACGG GGTCACCACC ACGTCCCTGG GGTACGGCCT GGGCTACGAC GAGGAGCTGC TGGGCGCGGT GGCCGACGGC GGCGCGGGCA GCGCCCTGTT CGCCGAGGAC CCCGACACCG CGGGCGGCCT CATCGCCCGG GAGGCCGAGT ACCTGCTGGC CAAGACGGCC CAGGCCGTGT CCCTGCGGGT TCCGTCCGGC CCGCTCCTGC GCTCCGTCTC CGTGGTGGGC GAGATGCCCT CCCACCGGCT CGCGGACGGA TCGGTGGTGG TCGAACTGGG CGACTTCCAC TCCGGGGAGC GGCGCCGCCT GCTGCTGCGC CTGGAGGTCC GCGGGCTGTC CGCGCCGGGC GCCGTCGCCG CGCTGGAGGT CGCCTACGCC GACCCGGCCA CGCTGGACAC CCGCACCGTG TCCCTTTCCG TCGAACTCGA CGTGGTCGCG CGGGACGCGG CCGACGAGCG GGTGCCCCGG CCCGAGGTGC GCGCGGAGGA GGTCCTGCAG CGGGCGCAGA CCGCCAAGAG GAGGGCCAGC GAGGCCATGC GGCGGGGCGA CCGCTTCGGC GCCGCGGGTC TGCTGGAGGA GGCGCGGACG GACCTGGCCG GCCACATGCC GGCGGCGCCC GCCGGGGCGG GTGCGGCGGC TCCGCCGCCG GAGGTGCTCG CGCAGATGGA GGAACTGCGG CGAATGGCGG GGATGTCCCG GACCGGCGAC GCCTCCCGGG TCTCCAAGTC GCTGTACGCG AGCCAGGCGG GTTACTCGCG CAAGAGCGGC CGTCAGCGTC CGGGGGCCGC GCAGGACGGC GGGGAGCGGG CCGGAGGGGA CGGGGACCGG GCGGAGGAGG ACCGGACCGG CGGCAACGGG AGCCGGGACG GTGCCGCCGG TCCCGGTCAG GGCTCCGGCC CCGAGGTCAG GGGCGGCCCC GGCCGGGGCT CCGGTCGCGG TCGCCGCGGT CGCCTCATCC GGGGCCAGCA GACCGACGGC GGACGACCCA CCCCCGACGA AGTCGCTCCT CCCCCGCGGG AGTCCTGA
|
Protein sequence | MHLSALSDFD AVPRDTEDAV SVLVDITAPE REEETERPPA TLQVVLDRSG SMGGGRLDGA VRALLSLVER LAPSDNFGLV SFNDQARVEV PCGPLEDKAR VRRLISGLHA SGGTDLSSGL LRGVQEARRA GADRGGTLLL ISDGHANQGV TDHDLLRQVA ADAYAHGVTT TSLGYGLGYD EELLGAVADG GAGSALFAED PDTAGGLIAR EAEYLLAKTA QAVSLRVPSG PLLRSVSVVG EMPSHRLADG SVVVELGDFH SGERRRLLLR LEVRGLSAPG AVAALEVAYA DPATLDTRTV SLSVELDVVA RDAADERVPR PEVRAEEVLQ RAQTAKRRAS EAMRRGDRFG AAGLLEEART DLAGHMPAAP AGAGAAAPPP EVLAQMEELR RMAGMSRTGD ASRVSKSLYA SQAGYSRKSG RQRPGAAQDG GERAGGDGDR AEEDRTGGNG SRDGAAGPGQ GSGPEVRGGP GRGSGRGRRG RLIRGQQTDG GRPTPDEVAP PPRES
|
| |