Gene Ndas_5566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5566 
Symbol 
ID9249469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp767419 
End bp770418 
Gene Length3000 bp 
Protein Length999 aa 
Translation table11 
GC content77% 
IMG OID 
ProductBeta-glucosidase 
Protein accessionYP_003683451 
Protein GI297564478 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.441417 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTCC AGCCGCACCT GCCGCCGTAC CGGGACCCCG CCCTGCCGCC CGAACGGCGC 
GTGGACGACC TGCTCGCCCG CCTCACCCCG GAGGAGCGGA TCGGCCTGCT CCACCAGCAC
CAGGCGCCCG TCGAGCGCCT GGGCGTCGGC CCCTTCCGGA CCGGGACCGA GGCGCTGCAC
GGACTGGCCT GGCTCGGACC GGCCACCGTG TACCCGCAGG CCGTCGGCCT GGCCAGCACC
TGGGACCCCG GCCTGCTCCG CCTCGTCGGC CGGGCCGTCG GCGACGAGGT GCTCGCCCGC
AGGGGCGCGG GCGCGTCGCC CAACGTCTGG GCGCCCGTGG TCAACCCGCT GCGCGACCCC
CGCTGGGGCC GCAACGAGGA GGGCTACTCC GAGGACGCGT GGCTGACCGG GCTCCTGGCC
GCCGCCTACG CCCGGGGGCT GGCCGGTGAC GGCGACACGC TGCGCCTCGC GCCCACCCTC
AAGCACTTCC TCGCCTACAA CAACGAGGAG GGCCGCTGCG AGACCTCCAG CGACCTGCCG
CCGAGGGTCC TGCGCGAGTA CGAGCTCCAG GCCTTCCTGC CCGCCCTGGC CGAGGGGGCG
GCCGTGGCCG TCATGCCCTC CTACAACCTG GTCAACGGGC GGCCCGCGCA CCTGAGCCCG
CTGATCGGCG ACGTGCTGCG CCCGGCCGCG CCCGACCGCC TGCTGGTGGT CAGCGACGCC
TACGCGCCCG CCAACCTCGT GGAGGCGCAG CACTTCCACC CCGACCTGCC CACGGCCTAC
GCGCACGCGA TCCGGGCCGG GCTCGACAGC TTCACCCAGG ACGACGACCG CACCGAGCCC
ACCCTCGGGC ACGTGCGCGA GGCGCTGCGC CGGGGGCTGC TCACCGCGGC CGACATCGAC
CGCGCCGCGC GCAACGTGCT CTCGGTCCGG ATCCGCCTGG GCGAGTTCGA CCCCGTGATG
GGCGGGGTTG ACGGGATCGC CGGGGTGGGC GGAGACCTGC TGGCCGGGGA CGGCCCGAAC
GGTGAGGTGT GGGGCGGGGG CGCGGGGGAC ACCGGGGGCG GCGGCACGGC GGGGGAGGCC
GGGCACGGGA CCGCCGGGGA CGCCGGGGCC TTCGACACCC CCGAGCACCG GGCGCTCGCC
CGGGAGGCGG CCACCCGCTC GATCGTGCTG CTCCGCAACG AGGGCGCGCT CCCGCTGCGC
GACGTGCGCC GGGTCGCCGT CGTCGGCCAG CTCGGCGACA CCGTGCTGGA GGACTGGTAC
AGCGGGACGC TGCCCTACGC CGTCACCGCG CGCGCCGGGA TCGCCGAGCG GGTGGAGACG
GTGTTCTGCG AGGGCGTGGA CCGCATCCGG CTGAGCACGT TCGGGCACGG ACTCGTGGTG
GACCCCGGCG ACGGGGGCCC CGCCCGGTTG GAGGCCCACG GCGTCTGCGG CGGGCACCGC
GGCGCGGACG GGCACGCGCC GACCGTGGAG GAGGAGGTCG AGGGCGCGGA CTTCGACCTG
CTGGACTGGG GCGGCGGCGC CTGCGCCCTG CGCTCGACGC GGACGGGGAG GTACCTCGAC
GTGGGACCCG GAGACGTCCT GGCCTTCACC GCGCCGGGGC CCGGCACCTG GGAGGTCACC
CAGACCTTCC GCCTGGAGGA GCGCAAGGGC CTGCACACCC TCGTCCACGT GCGCAGCGGC
CGCGGTGTCA CCTCGGATCC GCGCGACGGG AACCTGCGCC TGGCCGGAGC AGGGGAGGGG
GAGGGCCTGT TCATCGTGGA CGGGCTCGGC ACGGGAGCCG AGGAGGCCGC GCGGGTGGCC
TCCGCCGCCG ACGTCGCGGT CGTCGTCCTG GGCGACCACC CCATGGTCAA CGGGCGCGAG
ACCGAGGACC GGACCGACCT GGACCTGCCC GCCGCCCAGG AGCGGGTGCT GCGCGCCGTG
CGCGCGGCCA ACCCCGCCAC GGTCCTGGTG CTCAGCAGCG GCTGCCCGTT CGGGATCACC
TGGGCCGACG AGCACGTCCC GGCCGTCCTG TGGTCGGCGC ACGGCGGCCA GGAGTACGGC
CACGCGCTCG CGGACGTGCT CTTCGGCGAC GCCGAGCCCT CCGGACGGCT TCCGCAGACC
TGGTACCGCT CCGCCGACGA CCTGCCCGGG ATGCTCGACT ACGACATCGT CGGCGCGCGC
GGCACCTACC TGTACTTCGA GGGCGAGCCG CTCTACCCCT TCGGGCACGG CCTCGGCTAC
ACGTCGGTGG AGTACGCGGC CCCGGTGGTC CGCGTGGACG GCGGCCGGGT CACCGTCGCC
GTCATGGTCG CCAACACGGG CGCGCGGCCC GCCGACGAGG TGGTGCAGGT CTACACCCGC
CAGCTGGCCT CGCGTGTGCG GACCCCGCTG CGCGCGCTGC GCGGGTTCGC CCGGGTGCGC
GTGGAGCCGG GCCGCAGCGC GCTCGCCACG GTCTCCTTCG ACGTGGACCG GCTCGCGCTG
TGGGACACCA CCCGCGACCG CTTCGTGGTG GAGGACGCGC CGCGCCAGGT GCTGGTCGGG
CGTTCGGCGA CCGACATCCG CGCCACCGCG GCCCTGGACG TGCCGGGGGA GGCCATCCCG
CCCCGCGACG CGCGCACGCC GTGGTCGGCC GCGGGCTACG ACGACGCCCG GGGCACGTCC
CTGTGCCCGC TCACCCCCGA GCGGGGCGAC GCGGTGCGTT CCCGGGAGGG CGGCGCGTGG
GCCGCCTTCC GCGACGTGGA CCTCGGCGAC GGGGTCACCG GGTGCCGCCT CACGGTGAAC
GCCGAACACG CCAGCACGGT CCGCGTCCGC CTGGACGACC CCGAGGACGG ACCGACCGCC
GCGGTGGCCG ACGTCCCGGC GGGCAAGGAC GGTTACGACT TCACCGAGGT CGGCGCGGCC
TTCGAGGACG CGGGGGGAGC CGGGGGGATC CGTGACCTGT ACCTGGTCTT CGACCGGCCG
GGTACGGCGG TGGCCGACCT GGTCCTGACC GGCGCTGCTC CGGCACCGGA GGCGGACTGA
 
Protein sequence
MTLQPHLPPY RDPALPPERR VDDLLARLTP EERIGLLHQH QAPVERLGVG PFRTGTEALH 
GLAWLGPATV YPQAVGLAST WDPGLLRLVG RAVGDEVLAR RGAGASPNVW APVVNPLRDP
RWGRNEEGYS EDAWLTGLLA AAYARGLAGD GDTLRLAPTL KHFLAYNNEE GRCETSSDLP
PRVLREYELQ AFLPALAEGA AVAVMPSYNL VNGRPAHLSP LIGDVLRPAA PDRLLVVSDA
YAPANLVEAQ HFHPDLPTAY AHAIRAGLDS FTQDDDRTEP TLGHVREALR RGLLTAADID
RAARNVLSVR IRLGEFDPVM GGVDGIAGVG GDLLAGDGPN GEVWGGGAGD TGGGGTAGEA
GHGTAGDAGA FDTPEHRALA REAATRSIVL LRNEGALPLR DVRRVAVVGQ LGDTVLEDWY
SGTLPYAVTA RAGIAERVET VFCEGVDRIR LSTFGHGLVV DPGDGGPARL EAHGVCGGHR
GADGHAPTVE EEVEGADFDL LDWGGGACAL RSTRTGRYLD VGPGDVLAFT APGPGTWEVT
QTFRLEERKG LHTLVHVRSG RGVTSDPRDG NLRLAGAGEG EGLFIVDGLG TGAEEAARVA
SAADVAVVVL GDHPMVNGRE TEDRTDLDLP AAQERVLRAV RAANPATVLV LSSGCPFGIT
WADEHVPAVL WSAHGGQEYG HALADVLFGD AEPSGRLPQT WYRSADDLPG MLDYDIVGAR
GTYLYFEGEP LYPFGHGLGY TSVEYAAPVV RVDGGRVTVA VMVANTGARP ADEVVQVYTR
QLASRVRTPL RALRGFARVR VEPGRSALAT VSFDVDRLAL WDTTRDRFVV EDAPRQVLVG
RSATDIRATA ALDVPGEAIP PRDARTPWSA AGYDDARGTS LCPLTPERGD AVRSREGGAW
AAFRDVDLGD GVTGCRLTVN AEHASTVRVR LDDPEDGPTA AVADVPAGKD GYDFTEVGAA
FEDAGGAGGI RDLYLVFDRP GTAVADLVLT GAAPAPEAD