Gene Ndas_3551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3551 
Symbol 
ID9247420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4261133 
End bp4262650 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content76% 
IMG OID 
Productvon Willebrand factor type A 
Protein accessionYP_003681458 
Protein GI297562484 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.104304 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.664507 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCTGT CAGCGCTGTC GGACTTCGAC GCCGTCCCCC GGGACACCGA AGACGCCGTG 
TCCGTCCTGG TCGACATCAC CGCCCCCGAA CGCGAGGAGG AGACCGAACG CCCGCCCGCG
ACCCTCCAGG TCGTGCTGGA CCGCAGCGGG TCCATGGGCG GAGGACGCCT GGACGGCGCG
GTGCGCGCGC TGCTCTCCCT GGTGGAGCGG CTCGCGCCCT CCGACAACTT CGGACTCGTG
TCCTTCAACG ACCAGGCCCG GGTCGAGGTG CCCTGCGGGC CGTTGGAGGA CAAGGCGCGG
GTGCGCCGCC TGATCTCCGG GCTGCACGCG TCGGGCGGCA CCGACCTGTC CAGCGGACTG
CTGCGCGGCG TGCAGGAGGC CCGCCGCGCC GGAGCCGACA GGGGCGGCAC CCTGCTGCTG
ATCTCCGACG GCCACGCCAA CCAGGGCGTC ACCGACCACG ACCTGCTCCG ACAGGTGGCG
GCCGACGCCT ACGCCCACGG GGTCACCACC ACGTCCCTGG GGTACGGCCT GGGCTACGAC
GAGGAGCTGC TGGGCGCGGT GGCCGACGGC GGCGCGGGCA GCGCCCTGTT CGCCGAGGAC
CCCGACACCG CGGGCGGCCT CATCGCCCGG GAGGCCGAGT ACCTGCTGGC CAAGACGGCC
CAGGCCGTGT CCCTGCGGGT TCCGTCCGGC CCGCTCCTGC GCTCCGTCTC CGTGGTGGGC
GAGATGCCCT CCCACCGGCT CGCGGACGGA TCGGTGGTGG TCGAACTGGG CGACTTCCAC
TCCGGGGAGC GGCGCCGCCT GCTGCTGCGC CTGGAGGTCC GCGGGCTGTC CGCGCCGGGC
GCCGTCGCCG CGCTGGAGGT CGCCTACGCC GACCCGGCCA CGCTGGACAC CCGCACCGTG
TCCCTTTCCG TCGAACTCGA CGTGGTCGCG CGGGACGCGG CCGACGAGCG GGTGCCCCGG
CCCGAGGTGC GCGCGGAGGA GGTCCTGCAG CGGGCGCAGA CCGCCAAGAG GAGGGCCAGC
GAGGCCATGC GGCGGGGCGA CCGCTTCGGC GCCGCGGGTC TGCTGGAGGA GGCGCGGACG
GACCTGGCCG GCCACATGCC GGCGGCGCCC GCCGGGGCGG GTGCGGCGGC TCCGCCGCCG
GAGGTGCTCG CGCAGATGGA GGAACTGCGG CGAATGGCGG GGATGTCCCG GACCGGCGAC
GCCTCCCGGG TCTCCAAGTC GCTGTACGCG AGCCAGGCGG GTTACTCGCG CAAGAGCGGC
CGTCAGCGTC CGGGGGCCGC GCAGGACGGC GGGGAGCGGG CCGGAGGGGA CGGGGACCGG
GCGGAGGAGG ACCGGACCGG CGGCAACGGG AGCCGGGACG GTGCCGCCGG TCCCGGTCAG
GGCTCCGGCC CCGAGGTCAG GGGCGGCCCC GGCCGGGGCT CCGGTCGCGG TCGCCGCGGT
CGCCTCATCC GGGGCCAGCA GACCGACGGC GGACGACCCA CCCCCGACGA AGTCGCTCCT
CCCCCGCGGG AGTCCTGA
 
Protein sequence
MHLSALSDFD AVPRDTEDAV SVLVDITAPE REEETERPPA TLQVVLDRSG SMGGGRLDGA 
VRALLSLVER LAPSDNFGLV SFNDQARVEV PCGPLEDKAR VRRLISGLHA SGGTDLSSGL
LRGVQEARRA GADRGGTLLL ISDGHANQGV TDHDLLRQVA ADAYAHGVTT TSLGYGLGYD
EELLGAVADG GAGSALFAED PDTAGGLIAR EAEYLLAKTA QAVSLRVPSG PLLRSVSVVG
EMPSHRLADG SVVVELGDFH SGERRRLLLR LEVRGLSAPG AVAALEVAYA DPATLDTRTV
SLSVELDVVA RDAADERVPR PEVRAEEVLQ RAQTAKRRAS EAMRRGDRFG AAGLLEEART
DLAGHMPAAP AGAGAAAPPP EVLAQMEELR RMAGMSRTGD ASRVSKSLYA SQAGYSRKSG
RQRPGAAQDG GERAGGDGDR AEEDRTGGNG SRDGAAGPGQ GSGPEVRGGP GRGSGRGRRG
RLIRGQQTDG GRPTPDEVAP PPRES