Gene Ndas_0656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0656 
Symbol 
ID9244498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp803587 
End bp805059 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content76% 
IMG OID 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_003678607 
Protein GI297559633 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.335861 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.754125 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTACG ACCCGACCCT GGCGCGTCTG GCCAACGCCA CGCTGCTGGT GCCCTTCGAG 
TCCTACCAAG CCCCGCGCTG GGTCCTGGAG GGTCTGGCGG ACGGGATCTC CGGTGTCTGT
CTGTTCCACA ACAACCTGGA CGGCCCCGAG CAGGTGACCG CGCTCAACGC GACCCTGGCC
GGGGTCACCG ACACCCCGCT GATCTCCCTG GACGAGGAGG GCGGCGACGT CACCCGCATC
GGCCAGGCTC AGGGCAGCGA CTACCCCGGC AACGCGGCCC TGGGCGCCGT GGACGACCCC
GGCCTGACCC GCGCCACCCT GCGCTCCCTG GGCGGACGCC TGGCCGAGCT GGGCTTCAAC
CTGGACCTGG CGCCCTCGGT GGACGTCAAC GTCGCCGACG ACAACCCGGT GATCGGCACC
CGCTCCTTCG GCTCCGACCC CGAACTGGTG GCCCGGCACG CGGCGGCCGC CGTGCTCGGC
CTCCAGGAGG CGGGCGTGGC CGCGTGCGCC AAGCACTTCC CCGGCCACGG CGCCACCTCC
CAGGACTCCC ACCACGTGCT GCCGCGCGTG GAGGCCGACG CCGACCTCCT GCGCCGCCGC
GAGCTGCTGC CCTTCCGCGC GGCCGTGGAG GCGGGCGTCC GGTCGATCCT GACCGCGCAC
ATCGAGATGC CCGGCCTGGG CGGCGACGGC CCCGCCACGC TCACCCCGCG CATCCTCAAC
GACCTGCTCC GCGGCGAGCT GGGCTTCACC GGCACCGTGG TCAGCGACGC CATGGACATG
CAGGGCGTCA GCGGCCGTAT CGGCATCCCC GAGGCCTGCG TGCGCGCGGT GGCCGCCGGG
GTGGACCTGC TGTGCCTGGG CCGGTTCGTC TACGCCGACC AGGTCGAGCT GATCCGCGCC
GCGCTGGTGG ACGCGGTCCG GGAGGGCCGC CTGCCCGGGG AGCGCCTGGA GGAGGCCGCC
GGGCGCAACG CCGAGCTGCG GACCTGGATC CGCGCGGCCC AGACCCGGCG CTCGGACGCG
CCCGCGGCCG ACGGGGTGGG CCTGGTCGGG GCCCGCCGCG CGGTGCGCGT GGACGGCGAC
CTGCCGCCGC TGGCGGACCC CTACGTGGTG GAGGTGGACG CCCCCTCCGG CATGGCGGTG
GGCGAGGTCC CCTGGGGCCT GTCCCCCTGG TTTCCGGGCA CCGAGCGGGT CTCCCCCGAC
GTCGCGCACG CCGACCGGCT GGCCGCCAGC GCCCGGGACC GCGACCTGGT CGTGGTGGTG
CGGGACGCGC ACCGCTACCC CTCCGCCCAG GCGCTCGTCA ACCGCCTGCT CAGCTCCCAC
CCCGACGCCG TGGTGGTGGA GATGGGGCTG CCCATCTGGC GGCCCGACTG CGGCGCGCAC
GTGAGCACCT ACGGCGCCGC GCACGTGAAC GGGCTGAGCG CGGCGGAGCT GCTGGGGGCA
CCGGTGGGCG CCCCCTCCCC CGGCGTGAAC TGA
 
Protein sequence
MPYDPTLARL ANATLLVPFE SYQAPRWVLE GLADGISGVC LFHNNLDGPE QVTALNATLA 
GVTDTPLISL DEEGGDVTRI GQAQGSDYPG NAALGAVDDP GLTRATLRSL GGRLAELGFN
LDLAPSVDVN VADDNPVIGT RSFGSDPELV ARHAAAAVLG LQEAGVAACA KHFPGHGATS
QDSHHVLPRV EADADLLRRR ELLPFRAAVE AGVRSILTAH IEMPGLGGDG PATLTPRILN
DLLRGELGFT GTVVSDAMDM QGVSGRIGIP EACVRAVAAG VDLLCLGRFV YADQVELIRA
ALVDAVREGR LPGERLEEAA GRNAELRTWI RAAQTRRSDA PAADGVGLVG ARRAVRVDGD
LPPLADPYVV EVDAPSGMAV GEVPWGLSPW FPGTERVSPD VAHADRLAAS ARDRDLVVVV
RDAHRYPSAQ ALVNRLLSSH PDAVVVEMGL PIWRPDCGAH VSTYGAAHVN GLSAAELLGA
PVGAPSPGVN