Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1086 |
Symbol | |
ID | 9244932 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1334308 |
End bp | 1336152 |
Gene Length | 1845 bp |
Protein Length | 614 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | glycoside hydrolase family 2 sugar binding protein |
Protein accession | YP_003679034 |
Protein GI | 297560060 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.358579 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGACGC ACCAGCGCTC ACAGCGCGGG GACGGGCCCG CCACCCCGCC CCGCCCCGAG TACCCGCGCC CGCAGTTCAC CCGCCCCGAC TGGCTGTGCC TGAACGGCAC CTGGGACTTC GAGATCGACC GCGGCGACAG CGGGCTCGAA CGGGGCCTGC GCGAGGCCGG ACTCTCCGGC ACCATCACCG TCCCCTTCTG CCCGGAGTCG GAGCTGTCCG GCGTGGGCGA CACCGACTTC ATGGAGGCGG TCTGGTACCG GCGCACCGTC CGCGTCCCCG ACGCCTGGGC GGGACGGCGC GTCCTACTGC ACTTCCAGGC CGTCGATCAC GACACCACCG TGTGGGTCAA CGGCACGGAG GTCGTCCGCC ACCGCGGCGG GTTCACCCCC TTCACCGCCG ACCTGTCCGG GGTCGCCGCT CCCGGCGAGG AGGCCGTGGT GGTCGTGCGC GCCCGCGACA GCCGACACGG CTTCCAGGCC CGCGGCAAGC AGGCCACCTG GTACGCCAAC ACCGGATGCC ACTACACGCG CACCACCGGG ATCTGGCAGA CCGTGTGGAT GGAACCGGTC CCCGACACGC ACCTGCGCCG CCCCCGCATC ACCCCGGACC TGGCCAACGG GGCCTTCCAC CTGCTCCTGC CGCTGTCCGG CAGCGGCGAG GGCCTGCGGG TGCGCGCCGT CCTGGAGGAC GGGGACGGCG AGGTCACCGC CGCCGAGGCC CGCGCCGACC TGGACACCGC GCCCCGGCTG ACACTGGCGG TGCCGGTGGA GCGGCGCCGC GCCTGGTCGC CGCAGGACCC GCACCTGTAC ACGCTGCGCC TGGAACTGCT GGACGCCGAG GGGAGGGTGG TGGACCGGGC CGGGTCCTAC GCGGGCCTGC GGTCGGTCTC CGTCCAGGGC AAGGCGATCC TGATCAACGG CCGACGCGTG TTCCAGCGCC TGGTCCTGGA CCAGGGCTAC TACCCCGACG GGCTGATGAC CGCGCCCGAC GACGCCGCCC TGGTGCGCGA CATCGAACTG GGTCTGCGGG CCGGGTTCAA CGGGGCCCGC CTGCACCAGA AGGTCTTCGA GGAGCGCTTC CTCTACCACG CCGACCGGCT CGGCTACCTG GTCTGGGGCG AGTTCGGCGA CTGGGGGTGC GCGGCCCACG GCGGCCCCGC CGACGACAAC CAGCGGCCGG ACGCCTCCTA CGTGGCCCAG TGGACCGAAG CCGTGGAACG CGACTACTCC CACCCCAGCG TCGTCGGGTG GTGCCCGCTC AACGAGACCT TCCAACGGCT GCACGACCGC TTCACCGCGC TGGACGACGT GACCCGCGCG ATGTTCCTGG CCACCAAGGC GATCGACCCC TCCCGCCCGG TGGTGGACGC CTCCGGGTAC GCCCACCGGG TCCCCGAGAC CGACGTCTAT GACTCCCACA GCTACGAGCA GGACCCCGAG GCGTTCCGCA AGCAGATGAG CGGCCTCGCC CAGGACGACC CCTACGTCAA CCGCGGCGCG GACGGCCGCG ACTGGTCGGT GCCCTACCGC GGCCAGCCCT ACTTCTGCAG CGAGTTCGGC GGGATCCGCT GGGACCCCGG CACCGACGGC GGCGAGCAGT CGTGGGGGTA CGGCGACGAC CCCAGGACCC CGGAGGAGTT CCACACCCGC TTCGAGGGTC TGACGGGCGT GCTGCTGGAG GACCCGGACA TGTTCGGTTA CTGCTACACG CAGCTCACCG ACGTGTTCCA GGAACGCAAC GGCGTCTACC GGTTCGACCG CGGCGACAAG CTCGACACCG CCCGCATCGC GGCCGCCCAG CGCAGGACCG CCGCGTACGA GAAGGCCGAT CGTCGGCCCG AGTGA
|
Protein sequence | MPTHQRSQRG DGPATPPRPE YPRPQFTRPD WLCLNGTWDF EIDRGDSGLE RGLREAGLSG TITVPFCPES ELSGVGDTDF MEAVWYRRTV RVPDAWAGRR VLLHFQAVDH DTTVWVNGTE VVRHRGGFTP FTADLSGVAA PGEEAVVVVR ARDSRHGFQA RGKQATWYAN TGCHYTRTTG IWQTVWMEPV PDTHLRRPRI TPDLANGAFH LLLPLSGSGE GLRVRAVLED GDGEVTAAEA RADLDTAPRL TLAVPVERRR AWSPQDPHLY TLRLELLDAE GRVVDRAGSY AGLRSVSVQG KAILINGRRV FQRLVLDQGY YPDGLMTAPD DAALVRDIEL GLRAGFNGAR LHQKVFEERF LYHADRLGYL VWGEFGDWGC AAHGGPADDN QRPDASYVAQ WTEAVERDYS HPSVVGWCPL NETFQRLHDR FTALDDVTRA MFLATKAIDP SRPVVDASGY AHRVPETDVY DSHSYEQDPE AFRKQMSGLA QDDPYVNRGA DGRDWSVPYR GQPYFCSEFG GIRWDPGTDG GEQSWGYGDD PRTPEEFHTR FEGLTGVLLE DPDMFGYCYT QLTDVFQERN GVYRFDRGDK LDTARIAAAQ RRTAAYEKAD RRPE
|
| |