Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4403 |
Symbol | |
ID | 9248278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5237103 |
End bp | 5238455 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | von Willebrand factor type A |
Protein accession | YP_003682298 |
Protein GI | 297563324 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.239793 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCGACA CGACTGAGCG CGCGGCCGAA CCGGGCTTCC GGATCGAGGT CGACCAGAAC GTCCTCCTGC CGGTGGGCGG ACGCGAGGTG CACGCGATCG TCAGCGTCAC CTCCACCGGC TCCGTGGTGG TGGGGGACTC GGTGCGCGCC GCCGAGGTGA TCATCGTGGA CACCTCCGGC TCGATGCACG GCGCCAAGAT CGACGCGGCC AAGCAGGCCG CCCGCGCCGC CGTGGGCGTC CTGCGCGAGG GCGTCCACTT CGCGGTGGTC GCCGGGCACA GCGACGCCTC CGTGCTCTTC CCCGAGGGAG GCGGGCGGAT GGTGCGGGCC GACGCCGTCA CCCGCGCCGA GGCCGCGGAG GCGATCGGCG GACTGCGCGC CGACGGCGGC ACCCGCATGG GTTCCTGGCT CGTCCGGGCC GCCGAGCTGT TCGCGACCGT GGAGGGCGGC ATCAAGCACG CCATCCTGCT CACCGACGGC CAGAACAACG AGCCCGCCGC GGTCTTCGGG CAGGCGCTCG ACCGCGTCGC GGGCTCCTTC GTCTGCGACT GCCGGGGCGT GGGCACCGAC TGGAGGGTCG AGGAGCTGCG CCGGATCGAC TCCGCCCTGC TCGGCGGAGG CCCAGGCATC ATCGCCGACC CCGCCGACAT GGTGGCCGAC TTCCGGGCGA TGGCGCAGGC CTCCATGGGC AAGACCGTCG CCGACGTGGT GCTGCGGCTG TGGACGCCCC AGAACGCCGT CGTCCGCCAC GTCAAGCAGG TCGCCCCCAC CGTCAGCGAC CTCACCGGCC GCGGCGTGGA GCGCGTGCCC CAGTCGGGCG ACTACCCCAC GGGGTCCTGG GGCACCGAGA GCCGCGACTA CCACGTGTGC GTCCAGGTGC CGCCCGGCGT CCCCGGCCGC CAGCTGCGCG CCGGATGGGT GCGCGTGGTG CTCCCCGGGA CCGCCGGAGC CGAGGACCAG GTCCTGGCCT CGGGCAACAT CCTCGCGGAG TGGACCGCCG ACGAGGCCCG CGCCACCGAG ATCAACCCCC GCGTGGCCCA CTACACCGGC CAGGTCGAAC TGGCCAGGGC CATCCAGGAC GGGCTCGCGG CCCGTCGCGA CGGCGACGAC GACACCGCCA CCACGCGGCT CGGCCGCGCC GTCGCACTCG CCCACGAGTC GGGCAACGAG GAGACCGCGA GGCTCCTCGG CAGGGTCGTC GACGTGGTCG ACCCCGTCAC CGGAACCGTC CGGCTCAAGC CCGGTGTCAG CAAGGTCGAC GAGATGACCC TGGACACCAA CTCGACCCGG ACCGTACGGA CCCGGCCGCG GCAGAGCGGG CCGGACGCCG GCGGTGCCCC CACACCCGGG TAG
|
Protein sequence | MPDTTERAAE PGFRIEVDQN VLLPVGGREV HAIVSVTSTG SVVVGDSVRA AEVIIVDTSG SMHGAKIDAA KQAARAAVGV LREGVHFAVV AGHSDASVLF PEGGGRMVRA DAVTRAEAAE AIGGLRADGG TRMGSWLVRA AELFATVEGG IKHAILLTDG QNNEPAAVFG QALDRVAGSF VCDCRGVGTD WRVEELRRID SALLGGGPGI IADPADMVAD FRAMAQASMG KTVADVVLRL WTPQNAVVRH VKQVAPTVSD LTGRGVERVP QSGDYPTGSW GTESRDYHVC VQVPPGVPGR QLRAGWVRVV LPGTAGAEDQ VLASGNILAE WTADEARATE INPRVAHYTG QVELARAIQD GLAARRDGDD DTATTRLGRA VALAHESGNE ETARLLGRVV DVVDPVTGTV RLKPGVSKVD EMTLDTNSTR TVRTRPRQSG PDAGGAPTPG
|
| |