Gene Ndas_3302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3302 
Symbol 
ID9247164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3940960 
End bp3943755 
Gene Length2796 bp 
Protein Length931 aa 
Translation table11 
GC content73% 
IMG OID 
Productglycoside hydrolase family 81 
Protein accessionYP_003681214 
Protein GI297562240 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGTCC TGGCCGCCGC CACGGCCCTG GTCTGCGCCG CGGGCGTGGC CTCCGCCCCT 
CCGGCGGCCG TGGCCGCCGA GGTCCCCGTC GGCTCGGGCG GCTACAGCGA CACCCTCCCC
GCCGGAGCCC CCGGGCCCTC CGACCTGAAC GGCGCCCCGG TCGACCCCAA GGTGACCGAG
GACTTCGAGG GGCACGCCCC CACCAACGAG TGGTGGTCGT CGCTGATCTT CCAGCGCTAC
CCCGACAACC CCCACGGTGA GAACCTCTTC GCCCACCCCG CCAGCTTCCA CCCCCACGAG
GGCGGGCTGG AGATGGGCTA CCCGCGGGAC TCCCAGCTCG TCGCCGACGG CCTCAAGTAC
GAGCACCCCC ACACCGCCGA CCTGTCCCTG GGGGTCCAGG GCCTGTCCTC CCCCGACACC
CGGGTCGCCG ACCACGGCGA CTGGACCGTC ACCGCGCACT GGGAGGGCTC CGGCCGGACC
CTGCGCGTGA CCATCGGCCA GGGCCTGCCC TTCGCCTACG CCGACACCGC GGGCGGCGAC
GCCCAGGTGA CCTTCTCCGG CACCCCCGAG GTCTGGCACC AGGAGGGCTC CGTGGTCGGC
GCCAGCGTGG GCGGCCGCCA CTACGCGCTG TTCGCCCCCT CCGACGCCCC CTGGACGCGC
TCAGGCGACA CCTTCACCGC CCCGACCGGC GCCGAGGGCC ACTACTCCGT GGCGGTCCTG
CCCTCACCCG AGGACCTGGA CACCTTCGCC CCCTACGCCC ACTCCTTCGT CACCGGCTCC
CGCGTGGAGT ACGACTACGA CGAGGCCGCC GCCACCCTGA CCAGCACCTA CCGGGTGGAG
ACCGAGCCGC GCGAGGGAAG TGGGGAGGGC ACGCTCATGG CCCTCTACCC GCACCACTGG
CAGGAGGCCT CCACCCCCGT CACCGACCTG TCCTACTCCT CCCCCCGGGG CGAGATGCGC
GTGGCGCAGG GCGCGAGCTT CACCACCGAG CTGGCCGCCC AGGGCATCCT GCCCAGCCTG
CCGACCGTGG AAAGCGCCGA CCACGACCGC ATGCGCCAGC TCATCGACGA GGTCCTGGCC
GAGCAGGAGC ACTTCCCCGA GCCCGGCGAC ACCTACTGGG ACGGCAAGGC GCTGGGCAGG
CTGGCCCAGC TGGTGCCCAT CGCCGACTCC ATCGGCCACA CCGAGGGCCG CGACGCGCTG
CTGGACCTGG TCAGGGGCCG TCTGGAGGAC TGGTTCACGG CCGGGGGGAC CCGCCAGTTC
GCCTACGAGG CCGACTGGGG CACCATGCTC GGCTACCCGG ACAGCTTCGG CTCCGCCACC
GAGGTCAACG ACCACGACTT CCACTACGGC TACTTCGTCT CCGCGGCGGC CGTCGTGGCC
CGCTACGACA GCGCCTGGGC CGCCGAGGAC GCCTGGGGCG GCATGGTCCG CCTGCTGATC
CGCGACGTGG CCGAGACCGA CCCGAACAGC GACATGTTCC CGCGCCTGCG GTCCTTCTCG
CCCTACGCGG GGCACGGATG GGCCTCCGGG CACGCGGGGT TCGCCGCCGG GAACAACCAG
GAGTCCTCCT CCGAGGGCAT GCACTTCGCC GCCGCCACCG CACTGTTCGG CGCCCTGACC
GGGGACGAGG AGCTGCGCGA CCTGGGCGTG TACCTGCACA CCACGCAGGC CTCCGCCATC
ACCCGCTACT GGCAGGACCA CGGCGGCGAC ACCTTCCCCG AGGGCTTCGA ACACGACGTC
GTCGGCATGG TGTGGGGCGA CGGCGGCGAC TACCGCATCT GGTGGGACGG CGGCGACGAG
GAGCACTACG GCATCAACTA CCTGCCCATC ACGGCGAGTT CGCTGTACCT GGGGCACGAC
CCCGAACACG CCGGGGCGAT GTACGAGTCG CTGGTGGACC GCCTGGGCCG CGAACCGCAG
ATCTGGCGCG ACATCCACTG GGAGCACCGC GCCCTCTCCG ACGGCGACGA CGCCCTGCGC
ATGTTCGAGG AGCAGTGGAG CACCTACGAA CCGGAGGCGG GCGAGTCCAA GCCGCACACC
TACCAGTGGG TCTCGACGCT CGCCGAGGTC GGCACCGTGG ACACCTCCGT GACCGCCGAC
ACCGCGCACT ACGCCGTCTT CACCGACGGC GGGGAGCGCA CGCACGTCGC CTTCAACCCC
TCCGGCTCCC CGCTGACGGT CACCTTCTCC GACGGCGTCG AACTGGAGGT GGAGCCCCGC
TCGCTGGCCT CCACCACGGG CGAGGGCGGC GGCGGTGACC CCGGCGACCC GGGCGACCCG
GGCGACCCGG GCGACCCGGG TGAGGGCAAC GTCGGCGACG GCGGCCTGCA CCTGACCCCC
TCCGCGCTCT CCAGCGCCCC GCACGGCTCC GCCGGCGAGA TCACCGTGGC CTCCGCGGGG
GGACGCAACC AGGACGGCCT GCCGCCGGGC GACCGCGTGG TGCTGCGCGC CGACGACCTG
ACCGGCGCCC ACACCGGCGG CGCCACCGCG TTCTCGCTGC CCGTGGACTC GGGTAACGCG
GTCGGCAACG CGGTCCAGGT CCGCGTGGTC TACGACCTGG ACGGCGACGG CACCGACGAC
CGCACGGAGA CCTACCGCTA CTTCGCCACC AACGACCTGA ACGGCTGGGA GGAGTACACG
CAGGGCCGGG GCGCGGCGAG CGCCGACGGG TCCCTCGGCG ACCTGGAGGG CGGCTCCGTC
CGCGTCGAGC TGTGGAGCGC CCTGGGCAAC GCCTCCTCGC GGGTCCTGGT CGGCCCCGAC
GGCGGCGCCT CGGTCACGAT CCCCTTCACC GACTGA
 
Protein sequence
MGVLAAATAL VCAAGVASAP PAAVAAEVPV GSGGYSDTLP AGAPGPSDLN GAPVDPKVTE 
DFEGHAPTNE WWSSLIFQRY PDNPHGENLF AHPASFHPHE GGLEMGYPRD SQLVADGLKY
EHPHTADLSL GVQGLSSPDT RVADHGDWTV TAHWEGSGRT LRVTIGQGLP FAYADTAGGD
AQVTFSGTPE VWHQEGSVVG ASVGGRHYAL FAPSDAPWTR SGDTFTAPTG AEGHYSVAVL
PSPEDLDTFA PYAHSFVTGS RVEYDYDEAA ATLTSTYRVE TEPREGSGEG TLMALYPHHW
QEASTPVTDL SYSSPRGEMR VAQGASFTTE LAAQGILPSL PTVESADHDR MRQLIDEVLA
EQEHFPEPGD TYWDGKALGR LAQLVPIADS IGHTEGRDAL LDLVRGRLED WFTAGGTRQF
AYEADWGTML GYPDSFGSAT EVNDHDFHYG YFVSAAAVVA RYDSAWAAED AWGGMVRLLI
RDVAETDPNS DMFPRLRSFS PYAGHGWASG HAGFAAGNNQ ESSSEGMHFA AATALFGALT
GDEELRDLGV YLHTTQASAI TRYWQDHGGD TFPEGFEHDV VGMVWGDGGD YRIWWDGGDE
EHYGINYLPI TASSLYLGHD PEHAGAMYES LVDRLGREPQ IWRDIHWEHR ALSDGDDALR
MFEEQWSTYE PEAGESKPHT YQWVSTLAEV GTVDTSVTAD TAHYAVFTDG GERTHVAFNP
SGSPLTVTFS DGVELEVEPR SLASTTGEGG GGDPGDPGDP GDPGDPGEGN VGDGGLHLTP
SALSSAPHGS AGEITVASAG GRNQDGLPPG DRVVLRADDL TGAHTGGATA FSLPVDSGNA
VGNAVQVRVV YDLDGDGTDD RTETYRYFAT NDLNGWEEYT QGRGAASADG SLGDLEGGSV
RVELWSALGN ASSRVLVGPD GGASVTIPFT D