Gene Ndas_0759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0759 
Symbol 
ID9244601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp931038 
End bp932681 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content74% 
IMG OID 
Productvon Willebrand factor type A 
Protein accessionYP_003678710 
Protein GI297559736 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.09187 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCTG GAACGGGAAC CGGGAGGATC CGCGGGGGAC GCGGGGTCCG CGGCGCGTGC 
GCGGCGGTGC TCGCGGCGGC CCTGGCGGCG ACGGGCTGCT CCGCCCCGGG GGCGGACCCG
GACGTGACGC TGCGGATCCT GGCCGGGAGC GAGGTCGCCG ACCTGGAGCC GCTGCTGGAG
GAGGCCGGGG AGCGCACCGG GGTGACGGTG GAGCTGGAGT ACACCGGCAC GCTGGACGGC
ATGGCCCGGA TCGCCGCGGG TGACCGGCAG GGCGAGGGCG GGGAGTACGA CGCGGTGTGG
TTCGCCTCCA ACCGCTACCT CAACCTGGAC GGCGACGGCC GGTCGGCGGT GCACGAGGAG
ACGCCGGTCA TGGTCTCCCC CGTGGTGCTG GGCGTGGCCG CCGACCGCGC GCGGGAGCTG
GGCTGGGACG GGGGCGCGGA GGTGACCTGG TCGGACGTGC ACCGGGCGGT CGTAGAAGAA
GAGTTGATTT ACGGGATGAC CAACCCGGGC GCCTCCAACT CGGGTTTCTC CGCGCTGATC
GGGGTGGCCT CGGCGTTGGC CGACACGGGC GCGGCTCTGA GGTCGGAGGA CGTGGAGCGG
GTGGGGCCGG AGCTGGCGGA GTTCTTCGCG GGCCAGGAGG TGACGGCGGG GTCGTCGGGG
TGGCTCACGG ACGCGTTCGT GCGGCGCGCG GAGAGCGGCA TGCCGGTGGA CGGGCTGGTC
AACTACGAGT CGGTGATCCT GTCGCTGAAC GCCTCCGGCG CGCTGGAGGA GCCCCTGACG
GTCGTGTACC CGGCCGACGG CGTGGTGACG GCCGACTACC CGCTGACGCT GCTGTCGGAC
CCCTCCGAGC AGGCGTTGGA CGGGTACGAG CGGCTGGTGG GGGACCTGAC GTCGGAGGAG
ACGCAGCAGC AGATCGCCGA CCGGACGTGG CGGCGCCCGG TCACGGCCGG GGCCGAGCTG
TCCCCGCCGG TGCCCCCGCT GGTGGAGCTG CCGTTCCCGG CGAGCCGGGA GGTGGTGGAC
GGCCTGGTGG CGGACTACTC GGCCTCGCTG CGCCGTCCGG CGCGCACCGT GTACGCGCTG
GACGTGTCGG GGTCGATGGA GGGCGGCCGG CTCGCCGAGC TCCAGTCGGC CCTGGGCGCG
CTGACCGGCG CGGACGGCGG TTCGCTGGCC CGGAGCACGC AGGCCTTCCA GGAGCGGGAG
GTGGTGACGC TGCTGCCGTT CTCCACGTGG CCCGCCGACC CGCGGACCTT CGTGGTGGAG
CCGGGTTCGG TGGACGAGGT CAACGCGGAC CTGTCCGCGG CGGTGGAGGG GCTGGAGGCC
GAGGGCGACA CGGCCGCCTA CGACGCCCTG GTGCGGGCGT ACGAGCTGTT GGAGAGCGAC
ACGGGCTCGG ACGGCGACCC CCTGATGTCG GTGGTGCTGA TGACCGACGG CGAGGTGAAC
CGGGGCGTGG GGCTGGAGGG CTTCCGGGAG TCGCTGGCCG CGCGTTCGGA GCCGGTGGCG
CGGGTGCCGG TGTTCACGGT GCTGTTCGGC GAGTCGGACG TGCCGGAGAT GACCGAGCTG
GCGGAGCTGA CGGGCGGCCG GGTGTTCGAC GCCCGCGAGC AGGACCTGGA GCAGGTCTTC
CGGGAGATCC GGGGATACCA GTAG
 
Protein sequence
MAAGTGTGRI RGGRGVRGAC AAVLAAALAA TGCSAPGADP DVTLRILAGS EVADLEPLLE 
EAGERTGVTV ELEYTGTLDG MARIAAGDRQ GEGGEYDAVW FASNRYLNLD GDGRSAVHEE
TPVMVSPVVL GVAADRAREL GWDGGAEVTW SDVHRAVVEE ELIYGMTNPG ASNSGFSALI
GVASALADTG AALRSEDVER VGPELAEFFA GQEVTAGSSG WLTDAFVRRA ESGMPVDGLV
NYESVILSLN ASGALEEPLT VVYPADGVVT ADYPLTLLSD PSEQALDGYE RLVGDLTSEE
TQQQIADRTW RRPVTAGAEL SPPVPPLVEL PFPASREVVD GLVADYSASL RRPARTVYAL
DVSGSMEGGR LAELQSALGA LTGADGGSLA RSTQAFQERE VVTLLPFSTW PADPRTFVVE
PGSVDEVNAD LSAAVEGLEA EGDTAAYDAL VRAYELLESD TGSDGDPLMS VVLMTDGEVN
RGVGLEGFRE SLAARSEPVA RVPVFTVLFG ESDVPEMTEL AELTGGRVFD AREQDLEQVF
REIRGYQ