Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0759 |
Symbol | |
ID | 9244601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 931038 |
End bp | 932681 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | von Willebrand factor type A |
Protein accession | YP_003678710 |
Protein GI | 297559736 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.09187 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGCTG GAACGGGAAC CGGGAGGATC CGCGGGGGAC GCGGGGTCCG CGGCGCGTGC GCGGCGGTGC TCGCGGCGGC CCTGGCGGCG ACGGGCTGCT CCGCCCCGGG GGCGGACCCG GACGTGACGC TGCGGATCCT GGCCGGGAGC GAGGTCGCCG ACCTGGAGCC GCTGCTGGAG GAGGCCGGGG AGCGCACCGG GGTGACGGTG GAGCTGGAGT ACACCGGCAC GCTGGACGGC ATGGCCCGGA TCGCCGCGGG TGACCGGCAG GGCGAGGGCG GGGAGTACGA CGCGGTGTGG TTCGCCTCCA ACCGCTACCT CAACCTGGAC GGCGACGGCC GGTCGGCGGT GCACGAGGAG ACGCCGGTCA TGGTCTCCCC CGTGGTGCTG GGCGTGGCCG CCGACCGCGC GCGGGAGCTG GGCTGGGACG GGGGCGCGGA GGTGACCTGG TCGGACGTGC ACCGGGCGGT CGTAGAAGAA GAGTTGATTT ACGGGATGAC CAACCCGGGC GCCTCCAACT CGGGTTTCTC CGCGCTGATC GGGGTGGCCT CGGCGTTGGC CGACACGGGC GCGGCTCTGA GGTCGGAGGA CGTGGAGCGG GTGGGGCCGG AGCTGGCGGA GTTCTTCGCG GGCCAGGAGG TGACGGCGGG GTCGTCGGGG TGGCTCACGG ACGCGTTCGT GCGGCGCGCG GAGAGCGGCA TGCCGGTGGA CGGGCTGGTC AACTACGAGT CGGTGATCCT GTCGCTGAAC GCCTCCGGCG CGCTGGAGGA GCCCCTGACG GTCGTGTACC CGGCCGACGG CGTGGTGACG GCCGACTACC CGCTGACGCT GCTGTCGGAC CCCTCCGAGC AGGCGTTGGA CGGGTACGAG CGGCTGGTGG GGGACCTGAC GTCGGAGGAG ACGCAGCAGC AGATCGCCGA CCGGACGTGG CGGCGCCCGG TCACGGCCGG GGCCGAGCTG TCCCCGCCGG TGCCCCCGCT GGTGGAGCTG CCGTTCCCGG CGAGCCGGGA GGTGGTGGAC GGCCTGGTGG CGGACTACTC GGCCTCGCTG CGCCGTCCGG CGCGCACCGT GTACGCGCTG GACGTGTCGG GGTCGATGGA GGGCGGCCGG CTCGCCGAGC TCCAGTCGGC CCTGGGCGCG CTGACCGGCG CGGACGGCGG TTCGCTGGCC CGGAGCACGC AGGCCTTCCA GGAGCGGGAG GTGGTGACGC TGCTGCCGTT CTCCACGTGG CCCGCCGACC CGCGGACCTT CGTGGTGGAG CCGGGTTCGG TGGACGAGGT CAACGCGGAC CTGTCCGCGG CGGTGGAGGG GCTGGAGGCC GAGGGCGACA CGGCCGCCTA CGACGCCCTG GTGCGGGCGT ACGAGCTGTT GGAGAGCGAC ACGGGCTCGG ACGGCGACCC CCTGATGTCG GTGGTGCTGA TGACCGACGG CGAGGTGAAC CGGGGCGTGG GGCTGGAGGG CTTCCGGGAG TCGCTGGCCG CGCGTTCGGA GCCGGTGGCG CGGGTGCCGG TGTTCACGGT GCTGTTCGGC GAGTCGGACG TGCCGGAGAT GACCGAGCTG GCGGAGCTGA CGGGCGGCCG GGTGTTCGAC GCCCGCGAGC AGGACCTGGA GCAGGTCTTC CGGGAGATCC GGGGATACCA GTAG
|
Protein sequence | MAAGTGTGRI RGGRGVRGAC AAVLAAALAA TGCSAPGADP DVTLRILAGS EVADLEPLLE EAGERTGVTV ELEYTGTLDG MARIAAGDRQ GEGGEYDAVW FASNRYLNLD GDGRSAVHEE TPVMVSPVVL GVAADRAREL GWDGGAEVTW SDVHRAVVEE ELIYGMTNPG ASNSGFSALI GVASALADTG AALRSEDVER VGPELAEFFA GQEVTAGSSG WLTDAFVRRA ESGMPVDGLV NYESVILSLN ASGALEEPLT VVYPADGVVT ADYPLTLLSD PSEQALDGYE RLVGDLTSEE TQQQIADRTW RRPVTAGAEL SPPVPPLVEL PFPASREVVD GLVADYSASL RRPARTVYAL DVSGSMEGGR LAELQSALGA LTGADGGSLA RSTQAFQERE VVTLLPFSTW PADPRTFVVE PGSVDEVNAD LSAAVEGLEA EGDTAAYDAL VRAYELLESD TGSDGDPLMS VVLMTDGEVN RGVGLEGFRE SLAARSEPVA RVPVFTVLFG ESDVPEMTEL AELTGGRVFD AREQDLEQVF REIRGYQ
|
| |