Gene Ndas_1621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1621 
Symbol 
ID9245471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1985092 
End bp1987191 
Gene Length2100 bp 
Protein Length699 aa 
Translation table11 
GC content77% 
IMG OID 
Productvon Willebrand factor type A 
Protein accessionYP_003679556 
Protein GI297560582 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.921819 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.480885 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATTGC GGTCACCGCA CTACCCCTTT TCCGCGATCG TCGGCTGCGA CGCCGAGGAA 
CTCGACGACC TGGGCCTGTC CCTCGTCCTC ACCAGCGTCT CGCCGGAGAT CGGCGGCGTC
CTGGTGCGCG GCGAGAAGGG CACCGCCAAG TCCACCGCGG TCCGGGCCCT GGCCTCCCTC
CTGCCGCCCG TCGACGTCTA CCAGGGCGAC CGGTTCTCCG TGGACCCCGC CGACCCGGCG
CAGCACTCCC CCGACGGGCC CTTCGGGTCC GGCACGGCCG TGGAGAGCCG CCCGGTGCGC
CTGGTCGAAC TGCCCGTCGG CGCCACCGAG GACCGCGTCC TGGGCTCCCT GCACCTGGAA
CAGGCCCTCA CCCACGGCAG GGTCGCCTAC GAACCCGGCC TGCTGGCCCG GGCCCACCGC
GGCATCCTCT ACGTCGACGA GGTCAACCTC CTGCACGACC ACCTGGTCGA CCTGCTGCTG
GACGCCGCCG CGACCGGCCG GGTCACCGTG GAGCGCGACG GGTTCTCCGT GGAGCACGCG
GCCCGGTTCC TGCTCATCGG CACCATGAAC CCCGAGGAGG GCGAGCTGCG CCCGCAGCTC
CTGGACCGGT TCGGACTCAC CGTCGAGGTC GCCGCGCCGT CCGAGCCCGC GATCCGCGCC
GAGGTGGTGC GCAGGCGCAT GTCCCACGAC GCCGACCCCG CCGCCTTCGC CGGGCGCTAC
CACGGGGCCG AGAAGGCGCT GGCCGAACGC ATCGCGGCGG CCCGGGAGGC ACTGGGCCGG
GTGCGCCTGT CCGAGGCCGC GCTGCTGAAG ATCGCCGAGG TGTGCGCCGC CTACGACGTG
GACGGCCTGC GCGCCGACAT CGTGACCGCG CGCACGGCGA TGGCGCACGC GGCCTGGTCG
GGCCGGACCT CGGTCACCCG GGCCGACATC CGCCGCGCAG CCACGCTCGC CCTGCCGCAC
CGGCGCCGAC GCAACCCCTT CGACGCGCCG GGACTCGACG AGGAGCTCCT GGACCGGATC
CTGGGCGACG AGGAACCGCC GCCCGACCCC CCGGAGCCGC CGGGCCCGCA GGGGACCGAC
GACGGCGACG ATTCCGAAAC CCCGTCAGAC ACACAGGACC CACAGGATCC CTCCGACAAC
GCCAGTCCCC CGGACAACGC CGGGGACACC GGGGAAGCCG AGACCTCCGG CGGCGAACAG
CCCGACCCGG AGCGCTCCCC CGCCTCAGCC GAGCACGCGC CCGAGGACGC CGAGGGCGAC
TCCCCCGAAC CCCGCCCCTC CGGCGCCTCC CCGACCACCG CCAGGGCCGC CGCCCCCTAC
CGGACCCGGC TGCTCACCGT GCGGGGCTCC GGCGAGGGCG CCGACGGCAG GCGCAGCCGG
GCCGTCGGCA CGCGGGGCCG GCGGATCGGC GCCGCCGAGC CCGGCCGGGG TGCGGGCAGC
GCGGTCCACC TGGTGGAGAC CGTGCGGGCC GCCGCGCTGC GGCCCCAGGG CGGCGGCCGA
CTGCGGCTGC GCCCCCGCGA CCTGCGCGTC GCGGTCCGCG AGGGTCAGGA GACCAACCTG
GTGCTGTTCT GCGTGGACGC CTCCGGCTCC ATGGCGGCGC GCAGGCGTAT GACCGAGGTC
AAGACCGCGA TCCTGTCCCT GCTCCTGGAC GCCTACCGGC GCCGCGACAA GGTCGGCCTG
GTCACCTTCC GGGGGCGCGA GGCCGAACTC ACGCTGCCGC CGACCCGTTC GGTGGACGTG
GCCGCGGCCC GCCTCGACGA CCTGCCCGCC GGGGGGCGCA CCCCGCTGGC CGAGGGCCTG
GAGGAGGCGG CCCGCGTCCT GCGCCGCGAG CGGCTGCGGG ACCCGAGGCT GCGTCCGCTC
CTGGTCGTGG TCACCGACGG CCGGGCCACC GGCGGCAAGG GGGCGGTGGG CCGCGCGATG
GCCGCCGCCG ACCACGTCGC CGGACTGGGC GTGACCACCG TCGTGGTGGA CGGGGAGTCC
GGGCCGCTGC GCCTGGGCCT GGCCGCCTCC CTGGCCGCCC GCCTGGGCGC CGACCACATG
CCCGTCAGCG AGGTCAGCGC CGACGCGCTG GGCACCGCCG TACGAGAGAG GGCCGCCTGA
 
Protein sequence
MPLRSPHYPF SAIVGCDAEE LDDLGLSLVL TSVSPEIGGV LVRGEKGTAK STAVRALASL 
LPPVDVYQGD RFSVDPADPA QHSPDGPFGS GTAVESRPVR LVELPVGATE DRVLGSLHLE
QALTHGRVAY EPGLLARAHR GILYVDEVNL LHDHLVDLLL DAAATGRVTV ERDGFSVEHA
ARFLLIGTMN PEEGELRPQL LDRFGLTVEV AAPSEPAIRA EVVRRRMSHD ADPAAFAGRY
HGAEKALAER IAAAREALGR VRLSEAALLK IAEVCAAYDV DGLRADIVTA RTAMAHAAWS
GRTSVTRADI RRAATLALPH RRRRNPFDAP GLDEELLDRI LGDEEPPPDP PEPPGPQGTD
DGDDSETPSD TQDPQDPSDN ASPPDNAGDT GEAETSGGEQ PDPERSPASA EHAPEDAEGD
SPEPRPSGAS PTTARAAAPY RTRLLTVRGS GEGADGRRSR AVGTRGRRIG AAEPGRGAGS
AVHLVETVRA AALRPQGGGR LRLRPRDLRV AVREGQETNL VLFCVDASGS MAARRRMTEV
KTAILSLLLD AYRRRDKVGL VTFRGREAEL TLPPTRSVDV AAARLDDLPA GGRTPLAEGL
EEAARVLRRE RLRDPRLRPL LVVVTDGRAT GGKGAVGRAM AAADHVAGLG VTTVVVDGES
GPLRLGLAAS LAARLGADHM PVSEVSADAL GTAVRERAA