Gene Ndas_0568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0568 
Symbol 
ID9244410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp703836 
End bp705602 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content73% 
IMG OID 
Productvon Willebrand factor type A 
Protein accessionYP_003678521 
Protein GI297559547 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCAGTG GACGCCACCG ACAGGCCTCA CCGACCGCCG CGAACACCCG CAGGGTGGTC 
GCGTCGCCGT TGGGGATCAC CGCGATCGCC GTCGCCCTGA TCGCGGTGGC CGCGGTCGCG
GTCCCGGTCG GACTCGACAT GCTCGGCTGC GGCGACACCC GCTACCTGCG CGTGTCGGCG
ACCCAGAGCA TCGCGCCCGT GCTCCGCGAG GCCGCGGGCG AGTTCAACGA CGAGCGCCCC
AGCTACGACG GGGAGTGCGT GTACGCGCAG GTGGACGAGA TCGCGCCGCA CCGCATCATG
ACCGCCCTGT CCGGCGGCCA GGCGGGCGAC TCCACCATCG CCCCGCACGT GTGGGTGCCC
GAGTCCTCGG CCTGGGTCGA ACTGACCCGC GTATCCGGGA GCGGCACGCA CCGCATCGAC
ACCGAACCGC CCTCGCTGGC CAGCTCACCG GTCGTCCTGG CCGCGCCGCG GGGCGGCGGG
GGCCTGCCCG AGCCCGACGA GGCCGGGTGG CCCCTGGTCC TGCCCGACGA ACGCGAACGC
CCCCTGGTGA TGGTGGACCC CAACCGGGGC GCGGACGGTA TGGCCGTCAT GCACGCGGTC
CGCCGACACC TGGGCACCGG CGACGGAGCC GACACCGCCA TGACCGACTT CGTGCGCGAC
GTGCAGCTCG ACAGCGCGTT CGGCGAGATC GACCTCGCCA CCTTCTACAG CTCCGGCCGT
ACCGGCGGCG GGAGCGGCGA AGGCGGCGGA GGGCGCGTCG ACCCGCTGAT CGCCGTCCCC
GAACAGGCCG TGGTGTCCTA CAACGCCGAC CGCGCCGAGT CCGCGCCGCC GCTGGAGGCC
CACTACCCCA CCGAGGGCAC CGTCAGCCTC GACTACCCCT ACGTCACCAC CACCGACACG
GCCTCGCTGC GGTCCGCGGC CGCCGACCTG CACGAGGTGC TGCGCCGGGA CTCCTACCGC
GCCCGGCTCC GGGAGCTCGG CTTCCGCGAC CCCGACGGCA CCCTGTCCGG TACGGCCGGT
GCGGACCCCG ACGGGTTCGG TGTGACCGCG GAGGAGCCGC CCACCCACGA CGACCTGACC
GGGGACGCCC TGCTGGCCTC GGTCACCGAC TGGAACCGGC TGTCGATGCC CAGCCGCACC
CTGGTGCTCG CCGACACCTC CGCGAACATG GCGGAGGACC TCGACGGGGG CCCGTCCCGG
ATGGAGGTCG CCCAGCAGGC GGCGCTCATG GGCCTGTCGC TGTTCCCCGA CGAGACCGAC
ATGGGTCTGT GGCTGATGTC GGACGAGAAC GCGAGCGGCC GCGTCGAGGC CGCGGACATG
CACCCCCTGG GCGGGGCGGA GCAGGGCGAC ACCGCCACCC GTCGCCGGGA ACTCATCGGG
GTGGCCGAGG AGATCGCGGT CCGCGGCGGC GGTTCGCGCC TGTACGACAA CATCCTGGCC
GCCTACGACC GGGTGCAGGA CGACTACGAC GAGGACAAGA TCAACAGCGT CATCCTGCTC
ACCGCCGGCC AGGACGAGGG GTCCAGCGAC ATCGCGCACG CGGACCTGGT GGCGGCGCTC
CAGGACCGCT TCGACCCCGA GCGGCCGGTC AGCATGTTCA TCATCGCCTT CGGGTCGCGT
GAGCAGCAGG TCGCGGAGGA GGAGCTGCGG CGGATCGCGG CCGCCACCAG CGGTTCGCTG
TTCGTCACCG ACGACCCCGA CGAGATCGGC GACATCTTCC TCAGCTCCAT CTCACGGCGT
CTTTGCGTGC CCGACTGCGA CAGCTGA
 
Protein sequence
MPSGRHRQAS PTAANTRRVV ASPLGITAIA VALIAVAAVA VPVGLDMLGC GDTRYLRVSA 
TQSIAPVLRE AAGEFNDERP SYDGECVYAQ VDEIAPHRIM TALSGGQAGD STIAPHVWVP
ESSAWVELTR VSGSGTHRID TEPPSLASSP VVLAAPRGGG GLPEPDEAGW PLVLPDERER
PLVMVDPNRG ADGMAVMHAV RRHLGTGDGA DTAMTDFVRD VQLDSAFGEI DLATFYSSGR
TGGGSGEGGG GRVDPLIAVP EQAVVSYNAD RAESAPPLEA HYPTEGTVSL DYPYVTTTDT
ASLRSAAADL HEVLRRDSYR ARLRELGFRD PDGTLSGTAG ADPDGFGVTA EEPPTHDDLT
GDALLASVTD WNRLSMPSRT LVLADTSANM AEDLDGGPSR MEVAQQAALM GLSLFPDETD
MGLWLMSDEN ASGRVEAADM HPLGGAEQGD TATRRRELIG VAEEIAVRGG GSRLYDNILA
AYDRVQDDYD EDKINSVILL TAGQDEGSSD IAHADLVAAL QDRFDPERPV SMFIIAFGSR
EQQVAEEELR RIAAATSGSL FVTDDPDEIG DIFLSSISRR LCVPDCDS