Gene Ndas_1086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1086 
Symbol 
ID9244932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1334308 
End bp1336152 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content73% 
IMG OID 
Productglycoside hydrolase family 2 sugar binding protein 
Protein accessionYP_003679034 
Protein GI297560060 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.358579 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGACGC ACCAGCGCTC ACAGCGCGGG GACGGGCCCG CCACCCCGCC CCGCCCCGAG 
TACCCGCGCC CGCAGTTCAC CCGCCCCGAC TGGCTGTGCC TGAACGGCAC CTGGGACTTC
GAGATCGACC GCGGCGACAG CGGGCTCGAA CGGGGCCTGC GCGAGGCCGG ACTCTCCGGC
ACCATCACCG TCCCCTTCTG CCCGGAGTCG GAGCTGTCCG GCGTGGGCGA CACCGACTTC
ATGGAGGCGG TCTGGTACCG GCGCACCGTC CGCGTCCCCG ACGCCTGGGC GGGACGGCGC
GTCCTACTGC ACTTCCAGGC CGTCGATCAC GACACCACCG TGTGGGTCAA CGGCACGGAG
GTCGTCCGCC ACCGCGGCGG GTTCACCCCC TTCACCGCCG ACCTGTCCGG GGTCGCCGCT
CCCGGCGAGG AGGCCGTGGT GGTCGTGCGC GCCCGCGACA GCCGACACGG CTTCCAGGCC
CGCGGCAAGC AGGCCACCTG GTACGCCAAC ACCGGATGCC ACTACACGCG CACCACCGGG
ATCTGGCAGA CCGTGTGGAT GGAACCGGTC CCCGACACGC ACCTGCGCCG CCCCCGCATC
ACCCCGGACC TGGCCAACGG GGCCTTCCAC CTGCTCCTGC CGCTGTCCGG CAGCGGCGAG
GGCCTGCGGG TGCGCGCCGT CCTGGAGGAC GGGGACGGCG AGGTCACCGC CGCCGAGGCC
CGCGCCGACC TGGACACCGC GCCCCGGCTG ACACTGGCGG TGCCGGTGGA GCGGCGCCGC
GCCTGGTCGC CGCAGGACCC GCACCTGTAC ACGCTGCGCC TGGAACTGCT GGACGCCGAG
GGGAGGGTGG TGGACCGGGC CGGGTCCTAC GCGGGCCTGC GGTCGGTCTC CGTCCAGGGC
AAGGCGATCC TGATCAACGG CCGACGCGTG TTCCAGCGCC TGGTCCTGGA CCAGGGCTAC
TACCCCGACG GGCTGATGAC CGCGCCCGAC GACGCCGCCC TGGTGCGCGA CATCGAACTG
GGTCTGCGGG CCGGGTTCAA CGGGGCCCGC CTGCACCAGA AGGTCTTCGA GGAGCGCTTC
CTCTACCACG CCGACCGGCT CGGCTACCTG GTCTGGGGCG AGTTCGGCGA CTGGGGGTGC
GCGGCCCACG GCGGCCCCGC CGACGACAAC CAGCGGCCGG ACGCCTCCTA CGTGGCCCAG
TGGACCGAAG CCGTGGAACG CGACTACTCC CACCCCAGCG TCGTCGGGTG GTGCCCGCTC
AACGAGACCT TCCAACGGCT GCACGACCGC TTCACCGCGC TGGACGACGT GACCCGCGCG
ATGTTCCTGG CCACCAAGGC GATCGACCCC TCCCGCCCGG TGGTGGACGC CTCCGGGTAC
GCCCACCGGG TCCCCGAGAC CGACGTCTAT GACTCCCACA GCTACGAGCA GGACCCCGAG
GCGTTCCGCA AGCAGATGAG CGGCCTCGCC CAGGACGACC CCTACGTCAA CCGCGGCGCG
GACGGCCGCG ACTGGTCGGT GCCCTACCGC GGCCAGCCCT ACTTCTGCAG CGAGTTCGGC
GGGATCCGCT GGGACCCCGG CACCGACGGC GGCGAGCAGT CGTGGGGGTA CGGCGACGAC
CCCAGGACCC CGGAGGAGTT CCACACCCGC TTCGAGGGTC TGACGGGCGT GCTGCTGGAG
GACCCGGACA TGTTCGGTTA CTGCTACACG CAGCTCACCG ACGTGTTCCA GGAACGCAAC
GGCGTCTACC GGTTCGACCG CGGCGACAAG CTCGACACCG CCCGCATCGC GGCCGCCCAG
CGCAGGACCG CCGCGTACGA GAAGGCCGAT CGTCGGCCCG AGTGA
 
Protein sequence
MPTHQRSQRG DGPATPPRPE YPRPQFTRPD WLCLNGTWDF EIDRGDSGLE RGLREAGLSG 
TITVPFCPES ELSGVGDTDF MEAVWYRRTV RVPDAWAGRR VLLHFQAVDH DTTVWVNGTE
VVRHRGGFTP FTADLSGVAA PGEEAVVVVR ARDSRHGFQA RGKQATWYAN TGCHYTRTTG
IWQTVWMEPV PDTHLRRPRI TPDLANGAFH LLLPLSGSGE GLRVRAVLED GDGEVTAAEA
RADLDTAPRL TLAVPVERRR AWSPQDPHLY TLRLELLDAE GRVVDRAGSY AGLRSVSVQG
KAILINGRRV FQRLVLDQGY YPDGLMTAPD DAALVRDIEL GLRAGFNGAR LHQKVFEERF
LYHADRLGYL VWGEFGDWGC AAHGGPADDN QRPDASYVAQ WTEAVERDYS HPSVVGWCPL
NETFQRLHDR FTALDDVTRA MFLATKAIDP SRPVVDASGY AHRVPETDVY DSHSYEQDPE
AFRKQMSGLA QDDPYVNRGA DGRDWSVPYR GQPYFCSEFG GIRWDPGTDG GEQSWGYGDD
PRTPEEFHTR FEGLTGVLLE DPDMFGYCYT QLTDVFQERN GVYRFDRGDK LDTARIAAAQ
RRTAAYEKAD RRPE