Gene Ndas_0567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0567 
Symbol 
ID9244409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp701800 
End bp703563 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content72% 
IMG OID 
Productvon Willebrand factor type A 
Protein accessionYP_003678520 
Protein GI297559546 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGACGTC ACCGCGGAAG ATACGCAGAA GAGACCTCCG GCCGGTCCAG GCGCCGTCGC 
GGCCGCGGCG GCGCCTTCGC CGCCCTGGCC GCCGCGCTGG TGATCGTGGT CGGCCTCGCC
GCGGTGGGCG TCTACGTGTT CGGCCGGTCC GACGGCTGCG GCGGTTCCGA CATCGCGCTG
GACGTCGCGG TCAGCCCCGA ACTCGCCCCG GCCCTGACCG ACGTCGCCTC CGACTTCAAC
GCGGAGGAGC ACCAGGTGGA CGGGAGCTGC GTACAGGTGC AGGTGCGCCA GGTCGACTCC
GCCAACGTCG CCTTCGGCAT CACCGGCGCG GGAGCCACCA TGGGCGACAC CGACTCCGAC
GTGTGGATCC CGGACTCCTC CCTGTGGCCG CGCCTGGTCC AGAGCCAGGC GGGCGACGCC
GTCATCACCG AGACCGGCAC CTCCGTGGCC CGTTCGCCTC TGGTCCTCGC CGAACTCACC
GAGTTCGCCG ACGAGAACTC CCCGAGCAGT TGGGCGGAGG TCGTGCCCAC CACCGCCCCG
GGCCAGGAGG CGGAGCGCAC CGTCCGCGTG GTCGACCCCG CCCGCAACGC CACGGGGCTG
GGCACCCTCT ACCTCCTGCA CGGAGCGCTG GAGGAGGCCA GCCCGGACAC CGCCACGTTC
AACGCGCGGA TGACCGCGGT CCTGCAGGGC CTGCACCGGG GCGCGTCCTC GGACGAGGAG
GCCGCCTTCC TCGCCCTCAG CGGCGGCGGC GCCGAGGCCC CGCCCGTGAT GGTGATGTCG
GAGCAGGCCG TGTGGCGCTA CAACGCCGCG CACGGGGACG CCCCCGCCCA GGTCGGCTAC
ATGGAGGGCG GCACCTACTA CCTCGACTAC CCCTACGTCG TGCGCAGCGA GGAGAGCGCC
GTCACCCGCG CCGCCGAGGA GTTCCGCGAG GCGGTGCGCG GGGAGGAGGC GCGGACCCGG
CTGCTCGCGG AGGGGTTCCG CGGCCCCGAG GGGCAGATCG ACGCCTCCGT GCTCACCGAG
GAGGTCGGCT TCGCCGCCGA GCCGCCCACC GAACTGCCCA CGCCCGCCGC GGACTCGATC
ACCGGGCTGA TCCGCACCTG GAACCAGCTC AAGATGGACT CCCGGGTGCT CGCGATCGTG
GACATCTCCG GCTCGATGCT GGCCGAGGTG CCCGGGACCG GGATGACCCG CATGCAGGTG
ACCAGCGCCG CCGCCACGCA GGGCCTGGAG ATGTTCACGC CCAGTTCCGA GCTGGGCCTG
TGGGAGTTCT CCACCAACGT CAACAACGAA CTGCACTACC AGGAGATCGC GCCGATCCGC
GAACTCCAGG CGGCCGCCGA CGACGGCACC GCGCACCGGG ACGTCCTGGC GGGCGCGCTG
GCCTCGCTCC AGCCCCTGCC GCAGGGGGAC ACGGCGCTGT ACGAGACCTA CCTGGCGGCC
TACCAGGAGA TGTCGCGCAC CTACCAGCCC GACCGGACCA ACGTCATCCT CATGCTGACC
GACGGCGACA ACGACAACCC CGGCGGCCTG GGGCTGGACG AGCTGATGTC CCAGATCGAG
TCCCTGGCGA GCCCGTCACG GCCGATCCCG ATCATCACGA TCGCGTTCGG GCCCGACGTG
CAGAACCTGG AGCCGCTCCA GGAGATCGCC GCCGCCACCG GCGGCGCCGC CTACATGACC
GAGGACCCGA CCGAGATCGG CGAGATCTTC CTCCAGGCGT TCTCCCTGCG CATCTCCGAG
GACTCCGAGG AGACCACGGA GTAG
 
Protein sequence
MGRHRGRYAE ETSGRSRRRR GRGGAFAALA AALVIVVGLA AVGVYVFGRS DGCGGSDIAL 
DVAVSPELAP ALTDVASDFN AEEHQVDGSC VQVQVRQVDS ANVAFGITGA GATMGDTDSD
VWIPDSSLWP RLVQSQAGDA VITETGTSVA RSPLVLAELT EFADENSPSS WAEVVPTTAP
GQEAERTVRV VDPARNATGL GTLYLLHGAL EEASPDTATF NARMTAVLQG LHRGASSDEE
AAFLALSGGG AEAPPVMVMS EQAVWRYNAA HGDAPAQVGY MEGGTYYLDY PYVVRSEESA
VTRAAEEFRE AVRGEEARTR LLAEGFRGPE GQIDASVLTE EVGFAAEPPT ELPTPAADSI
TGLIRTWNQL KMDSRVLAIV DISGSMLAEV PGTGMTRMQV TSAAATQGLE MFTPSSELGL
WEFSTNVNNE LHYQEIAPIR ELQAAADDGT AHRDVLAGAL ASLQPLPQGD TALYETYLAA
YQEMSRTYQP DRTNVILMLT DGDNDNPGGL GLDELMSQIE SLASPSRPIP IITIAFGPDV
QNLEPLQEIA AATGGAAYMT EDPTEIGEIF LQAFSLRISE DSEETTE