Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4558 |
Symbol | |
ID | 9248439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5400831 |
End bp | 5403425 |
Gene Length | 2595 bp |
Protein Length | 864 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | glycoside hydrolase family 9 |
Protein accession | YP_003682451 |
Protein GI | 297563477 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.213075 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGTCCGG CACGAAGAAA CCGCGCGGCG GCGCTCGGCG CCGCAGCCGC ACTCGGCACG AGTCTCCTCG TCGCCTCCCC CGCCCAGGCC GACGACGAAC CGGTCGAGCA GATCACCAAC GGCGACTTCT CCGACGGCAC CACCGGCTGG TGGGCGACCG AGGGGATCGA CCTCGCCGTC AGCGAGGACG GAGCCCTGTG CGTCGCGGTC CCCGGCGGCA CCAGCGACCC GTGGGACCAG ATCGTCGGCC AGAACGACAT CCCCCTCGTG GCGGAGGAGT CCTACTCCCT CACGTTCACC GCCTCCGGCG CCGACGGCCT GCCCGTCCGC GCGCTGGTCC AGGAGCCCGT CGACCCCTGG CGCACCGAAC TGGACGAGCG CCCCGTGCTC ACGCCGACGG CGACCGGCTA CGAGTACGTC TTCACCGCGT CGGCCGAGAT GGCCGACGCC CAGCTCGCCT TCCAGATCGG CGGGGCCGAG GAGGACTGGA CGTTCTGCCT GGACGACGTG TCGCTGCTCG GCGGCGCCGA ACCACCGGTC TACGAACCCG ACACCGGCCC GCGCGTCCGC GTCAACCAGG TCGGCTACCT CCCGCAGGGC CCCAAGAACG CCACCGTGGT CACCGACGCC GAGGAGGCCC TGCCCTGGCG GCTGGCCGAC GCCGGGGGCG CGGTCGTGGC CGAGGGCGAC ACCGTGCCGT ACGGCCTGGA CGAGTCCTCC GGGCAGAACG TGCACACCGT CGACTTCAGC GGCTTCACCG GGACGGGCGA GGGCTACACC CTGACCGCCG ACGGCGAGAC CAGCCACCCC TTCGGCATCA CCGCCGACCC CTACGCCGCC CTGGCCACGG ACGCACTGGA CTTCTACTAC ACCCAGCGCA GCGGTATCGA GATCCTCGAC GAACTCGCCC CCGGGTACGG GCGCGAGGCC GGGCACGCGG GCGTGGAGCC CAACCAGGGC GACACCGACG TCACCTGCCA CCCCTCAGCC CCGTGCGACT ACTCGCTCGA CGTGTCCGGC GGCTGGTACG ACGCGGGCGA CCACGGCAAG TACGTGGTGA ACGGCGGCAT CTCCGTCCAC CAGCTGATGA GCGTCTACGA GCGCGCGCAC ACCGCGCCCA CCGGCGACCC CGCCCGGGTC GCCGACTCCA CCCTGTCCGT CCCCGAGCGG GACAACGGCG TGCCGGACGT CCTGGACGAG GCCCGCTGGG AGATGGAGTT CCTGCTGTCC ATGCGGGTCC CCGAGGGCGA GGAGCTCGCG GGGATGGCCC ACCACAAGGT CCACGACGAG CGCTGGACCG GCCTGCCCCT GCTGCCCTCC GAGGACCCCC AGCCCCGCTA CCTGCACCCG CCGACGACGG CGGCCACACT GAACCTGGCC GCGACCGCCG CCCAGTGCTC CCGCGTGTTC GCCCCCTACG ACGCCGACTT CGCCGCGACC TGCCTGGAGG CGGCCGAGAC CGCCTGGGAC GCCGCGGTGG CCGAACCCGA GCGCTACGCG ACCGTCGGCG GCGAGGGCGG CGGCGCCTAC GACGACGACA ACGTGGCCGA CGAGTTCTAC TGGGCCGCCG CGGAGCTGTA CCTGGCCACC GGGCACGGGG ACTACGAGGA GGCGGTGGTC TCGTCCGAGC TGCACACCGC CGACGTGTTC ACCTCCGCCG GGTTCGACTG GCGCTGGACC GCCCCCCTGG GCCGCCTCCA GCTCGCCACC GTGCCGAGCG GGCTCGACGG GCGCGACGAG GTCCGCGCCT CCGTCGTGGA GGGGGCCGAG CAGTACCTGG CCGACGTCTC GGAGCACCCC TACGGGCTGG CCTACGACCC CGAGGGCGGG GTCTTCGCCT GGGGCTCCAA CAACCTCGTG CTCAACAACA TGGTGGTGAT GGCCAGCGCC TACGACATCG GCGGCGACAC GCGCTTCCGC GACGCGGTCC TGGAGGGCAT GGACTACATC CTGGGACGCA ACGCCCTCAA CCAGTCGTAC GTGACCGGGT ACGGCGAGAA CGACGCGGTC AACCAGCACA GCCGCTGGTA CGCCCACCAG CTCGACCCGC GCCTGCCCAA CCCGCCCGAG GGCACCCTGT CCGGCGGACC CAACTCCGAC ACCGGCACCT GGGACCCGGT CGCCCAGTCC AACCTGGACG GGTGCGCCCC CCAGTTCTGC TACATCGACC ACATCGACTC GTGGGCGACC AACGAGCTGA CCATCAACTG GAACTCCACG CTGGCCTGGG TGTCCGGGTT CGTCGCCGAC CAGGGCGACG CCTCCCTGCC CGCCGGCTCC TCCTGCGAGG TGGACTACGT GGTCCACGGC GAGTGGCACG ACCGCTTCAA CACCCAGGTG ACCGTCCGCA ACACGGGTGA CGAGCCCGTG AACGGCTGGA AGCTGGTGTG GTCCTTCCCC GGCGGCCAGA CCGTGGAGCG CCACTGGAGC AGCGCGCTGG AGCAGGCCGG GCACGCGGTG ACCGCCGTGA ACGCGGACTG GAACGGGACC ATCGAGCCCG GCGACGAGGT GACGTTCGGC TTCATCGGCA CCCTGGCGTC CGGGGCGAAC GCCGTGCCCG CGAGGTTCGC CCTCAACGGT TCGGTCTGTA GCTGA
|
Protein sequence | MSPARRNRAA ALGAAAALGT SLLVASPAQA DDEPVEQITN GDFSDGTTGW WATEGIDLAV SEDGALCVAV PGGTSDPWDQ IVGQNDIPLV AEESYSLTFT ASGADGLPVR ALVQEPVDPW RTELDERPVL TPTATGYEYV FTASAEMADA QLAFQIGGAE EDWTFCLDDV SLLGGAEPPV YEPDTGPRVR VNQVGYLPQG PKNATVVTDA EEALPWRLAD AGGAVVAEGD TVPYGLDESS GQNVHTVDFS GFTGTGEGYT LTADGETSHP FGITADPYAA LATDALDFYY TQRSGIEILD ELAPGYGREA GHAGVEPNQG DTDVTCHPSA PCDYSLDVSG GWYDAGDHGK YVVNGGISVH QLMSVYERAH TAPTGDPARV ADSTLSVPER DNGVPDVLDE ARWEMEFLLS MRVPEGEELA GMAHHKVHDE RWTGLPLLPS EDPQPRYLHP PTTAATLNLA ATAAQCSRVF APYDADFAAT CLEAAETAWD AAVAEPERYA TVGGEGGGAY DDDNVADEFY WAAAELYLAT GHGDYEEAVV SSELHTADVF TSAGFDWRWT APLGRLQLAT VPSGLDGRDE VRASVVEGAE QYLADVSEHP YGLAYDPEGG VFAWGSNNLV LNNMVVMASA YDIGGDTRFR DAVLEGMDYI LGRNALNQSY VTGYGENDAV NQHSRWYAHQ LDPRLPNPPE GTLSGGPNSD TGTWDPVAQS NLDGCAPQFC YIDHIDSWAT NELTINWNST LAWVSGFVAD QGDASLPAGS SCEVDYVVHG EWHDRFNTQV TVRNTGDEPV NGWKLVWSFP GGQTVERHWS SALEQAGHAV TAVNADWNGT IEPGDEVTFG FIGTLASGAN AVPARFALNG SVCS
|
| |