Gene Ndas_5471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5471 
Symbol 
ID9249374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp661432 
End bp663282 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content68% 
IMG OID 
Productchaperone protein DnaK 
Protein accessionYP_003683356 
Protein GI297564383 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGTG CGGTCGGAAT CGACCTCGGT ACGACGAACT CGTGTGTCGC GGTCCTGGAG 
GGCGGCGAGC CCACGGTCAT CGCCAACGCC GAGGGCGCCC GTACCACCCC GTCCGTCGTC
GCCTTCGCCA AGAACGGTGA GGTGCTCGTC GGCGAGGTCG CCAAGCGCCA GGCGGTCACC
AACGTCGACC GCACCATCCG CTCGGTCAAG CGCCACATCG GCACCGACTG GACGGTGAAG
ATCGACGACA AGACCTTCAA CCCCCAGCAG ATCAGCGCCT TCGTGCTCCA GAAGCTCAAG
CGCGACGCCG AGGCCTACCT GGGCGAGGAC GTGACCGACG CGGTCATCAC CGTCCCGGCC
TACTTCAGCG ACTCCCAGCG CCAGGCCACC AAGGAGGCCG GCACCATCGC GGGCCTCAAC
GTCCTGCGCA TCATCAACGA GCCGACCTCG GCCGCGCTGG CCTACCACCT GGAGAAGGAG
GACGAGGCCA CCATCCTGGT CTACGACCTC GGTGGCGGCA CCTTCGACGT CTCCCTCCTG
GAGGTCGGCG ACGGCGTCGT GGAGGTCAAG GCGACCAACG GCGACAACCA CCTGGGCGGC
GACGACTGGG ACCAGGCCAT CGTCGACTGG CTGGTCGAGC GCTTCAAGAA CTCCAACGGC
GTGGACCTGT CCAAGGACAA GATGGCCCTC CAGCGCCTGC GCGAGGCCGC GGAGAAGACC
AAGATCGAGC TGTCCAGCTC CAGCGAGTCG GCGATCAACC TGCCCTACAT CACGGCCTCG
GCCGAGGGCC CGCTGCACCT GGACGAGAAG CTCTCCCGCG CCGAGTTCCA GCGCCTGACC
GCCGACCTGG TCGAGCGGAC CAAGACCCCG TTCCAGCAGG TCCTCAAGGA CGCCGGGATC
AGCCTGGACC AGATCCACCA CGTGGTCATG GTCGGCGGCT CCACCCGTAT GCCCGCCATC
GTGGACCTGG TCAAGGAGAT GACCGGCAAG GACCCCAACA AGGGCGTCAA CCCGGACGAG
GTCGTGGCCA TCGGCGCCTC GCTCCAGGCC GGTGTGCTCA AGGGCGAGGT CAAGGACGTC
CTGCTGCTGG ACGTCACCCC GCTGTCGCTG GGCATCGAGA CCAAGGGCGG CGTGTTCACC
AAGCTCATCG AGCGCAACAC GACCATCCCG ACCAAGCGCT CCGAGATCTT CACGACGGCC
GACGACAACC AGCCGTCCGT GCAGATCCAG GTGTACCAGG GTGAGCGCGA CATCGCCCAG
TACAACAAGA AGCTGGGCGT CTTCGACCTG ACCGGTCTGC CCCCGGCGCC GCGCGGCGTC
CCGCAGATCG AGGTCGCCTT CGACATCGAC GCCAACGGCA TCGTCAGCGT CACCGCGAAG
GACCTGGGCA CCGGCAAGGA GCAGTCCGTC ACCATCTCCG GCGGCTCCGC GATGTCCAAG
GACGACATCG ACAAGATGGT CCGCGAGGCC GAGCAGTACG CGGAGGAGGA CCGCAAGCGC
CGCGAGGAGG CCGAGGTCCG CAACAACGCC GAGTCCCTCG TCTACCAGAC CGAGAAGGTC
ATCAAGGACA ACGAGGACAA GGTCCCGGCG GACGTGCGCT CCGAGACCGA GGCCGCCGTC
GCCGAGCTGA AGACCGCGCT GGAGGGCTCC GACGTGGAGG CCATCCGCAC CGCGAGCGAG
AAGGTCGCGC TGGCCAGCCA GAAGATCGGC TCCGCCATCT ACAGCCAGGG CCAGCAGGGC
GCCGAGGGCG ACGCCCAGGG CGCCCAGAGC TCCGCCGACG ACGCCGACGT CGTGGACGCC
GAGATCGTCG ACGAGGACAA CAAGGGCACC CAGGGCAACC AGCAGTCCTG A
 
Protein sequence
MARAVGIDLG TTNSCVAVLE GGEPTVIANA EGARTTPSVV AFAKNGEVLV GEVAKRQAVT 
NVDRTIRSVK RHIGTDWTVK IDDKTFNPQQ ISAFVLQKLK RDAEAYLGED VTDAVITVPA
YFSDSQRQAT KEAGTIAGLN VLRIINEPTS AALAYHLEKE DEATILVYDL GGGTFDVSLL
EVGDGVVEVK ATNGDNHLGG DDWDQAIVDW LVERFKNSNG VDLSKDKMAL QRLREAAEKT
KIELSSSSES AINLPYITAS AEGPLHLDEK LSRAEFQRLT ADLVERTKTP FQQVLKDAGI
SLDQIHHVVM VGGSTRMPAI VDLVKEMTGK DPNKGVNPDE VVAIGASLQA GVLKGEVKDV
LLLDVTPLSL GIETKGGVFT KLIERNTTIP TKRSEIFTTA DDNQPSVQIQ VYQGERDIAQ
YNKKLGVFDL TGLPPAPRGV PQIEVAFDID ANGIVSVTAK DLGTGKEQSV TISGGSAMSK
DDIDKMVREA EQYAEEDRKR REEAEVRNNA ESLVYQTEKV IKDNEDKVPA DVRSETEAAV
AELKTALEGS DVEAIRTASE KVALASQKIG SAIYSQGQQG AEGDAQGAQS SADDADVVDA
EIVDEDNKGT QGNQQS