Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5562 |
Symbol | |
ID | 9249465 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | + |
Start bp | 760720 |
End bp | 763440 |
Gene Length | 2721 bp |
Protein Length | 906 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | cellulose-binding family II |
Protein accession | YP_003683447 |
Protein GI | 297564474 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.655446 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCCCG AACAGCACCG GCCCCCCAGA CGCGGGCGGG GCTTACCGGC GCTCGCCGTC GCGGCGGCCA CCGCACTGGC CGCCTCCACC GTCGTCGCCA CCACCGCGGC CGGGGCCGCT CCCGCCGAGC CGACCGCGGC ACAGGACGCC TACGAGTGGG ACAACGTCGA GATCGTCGGG GGCGGCTTCG TCCCCGGCAT CGTCTTCAGC GAGACCGAAC CGGGCCTGGC CTACGCCCGC ACCGACATCG GCGGCGCCTA CCGCTGGAAC CCCGACACCG AACGCTGGAT CCCCCTGCTC GACTGGGTCG GCTGGGACGA GTGGGGCTAC ACGGGCGTCG TCAGCATCGC CACCGACCCC GTGGACCCCG ATCGCGTGTA CGCGGCCGTG GGCACCTACA CCAACGGCTG GGACCCCAAC AACGGGGCCA TCCTCAGCTC CGACGACCGC GGCGAGACCT GGGACGTCGC CGAGCTGCCC TTCAAGCTCG GCGGCAACAT GCCCGGCCGC GGCCTCGGCG AGCGCCTGGC CGTGGACCCC AACGACAACA GCGTCGTCTA CTTCGGCGCG AGCGGCGGCA ACGGCCTGTG GCGCAGCACC GACCACGGCG CCACCTGGGC CGAGGTCGAG GCCTTCCCCA ACCCCGGCGA CTACGTCCAG GACCCGGGCG ACGAGACCGG CATGATGTCC GACATCACCG GCGTGACCTG GGTGGACTTC GACCCCAGGA CCGGGTCCGA GGGCTCGGTC ACCCAGGACG TCTACGTCGG CGTCGCCGAC CTGGACGACC CGGTCTACCG CAGTCAGGAC GGCGGACAGA CCTGGGAGCC CGTCCCCGGC GCGCCCACGG GGCACCTGCC CGCGCACTCG GTCGTGGACC ACGAGGGCGG CCAGCTCTAC ATGGCCACCA CCAGCACCCC CGGCCCCTAC GACGGCGACT CCGGCGACGT GTGGCGCATG GACCTGGCGA CGGGGGAGTG GACCGACATC AGCCCCGTCC CCTCCGGCTC CGAGGACAAC TACTTCGGCT ACGGCGGCCT GACCATCGAC CGGCAGGACC CGGACACGCT GATGGTCGCC ACCCAGATCT CCTGGTGGCC CGACATCCAG ATCTACCGCA GCACCGACCG CGGCCAGACC TGGACCCAGG CGTGGGACTG GGGCGCCTAC CCCGAGCGCA CCACGCGCTA CGAGATGGAC ATCTCCGGAG CGCCCTGGCT GGACTTCGGC GGTACCGGGA CGCCCCCGGA GACCCAGCCC AAACTCGGCT GGATGACGCA GGCGATGGCG ATCGACCCGT TCGACTCCGA CCGTTTCATG TACGGCACGG GCGCCACGGT CTACGGCAGC GACAACCTCA CCGACTGGGA CGCGGGCACC ACCTTCGACA TCGGGGTCAG GGCGCACGGC ATCGAGGAGA CGGCCGTGAA CGACCTGATC AGCCCGCCCG AGGGCGCGCC GCTGCACTCG GCCCTGCTCG ACATCGGCGG CTTCACCCAC CAGGACCTGG AGACCGTCCC CGACCAGATG TACCAGCAGC CCTACTGGGG CCACGGGACC AGCCTGGACT TCGCCGAACT CCAGCCCGCG ACCATCGCGC GGGTCGGCGG CAGCGACGCC GAGGCCGCCA TCGGCCTGTC CACCGACGGC GGCGAGAGCT GGTGGGCCGG GCAGGAGCCC GGCGGCGTGA CCGGCGGCGG CACGGTCGCG GTGAACGCGG ACGGCTCGTC CGTCGTGTGG AGCCCCGACG GCACAGGCGT CCACGTCTCC ACCACCCTGG GCTCGTCGTG GACCGCCTCC ACCGGCGTTC CGGCGGGCGC GAGGGTGGAG GCCGACCGGG TGGACCCGGA CGTGTTCTAC GCCGTCTCCG GCGGCACCTT CTACACCAGC ACCGACGGCG GGGCCACCTT CACGGCGGGC TTCGACGGGC TCCCGGCCGA GGGCAACATC CGCTTCGGCG CGGTGCCCGG CCACACCGGT GACGTGTGGG TCGCCGGAGG CACCGGTGAC CACTACGGCA TGTGGCGGAG CACGGACGCC GGGGCCTCCT TCGAGCAGGT CGAGGCCGTG GACGAGGGCG ACGCGGTCGG CTTCGGCGCG CCCGCGCCGG GCTCGGACTA CCCGGCGGTG TACACCAGCT CCAGGATCGA CGGCGTGCGC GGGATCTTCC GCTCCGACGA CGCGGGCGAG AGCTGGGTGC GGATCAACGA CGACCAGCAC CAGTGGGCCT GGACGGGGGC GACCATCACC GGCGACCCCA ACGTCTACGG CCGGGTCTAC GTCGGCACCA ACGGCCGGGG GATCGTCTAC GGCGACCTGG CGGGCGGGGG CGGCGACCCC GAGCCCACGC CGGAGCCCAC GCCGGACCCC GAACCCACCC CGGAGCCGTC CCCGGACCCC GAGCCCGGGG ACTGCGCGGT CGAGTACACC GTCACCAACA CCTGGAGCGG CGGGTTCCAG GCCGGGGTGA CGGTCACCAA CGACGGCGAC GAGGCACTGG AGGGCTGGGA GGTCGGCTGG GAGTTCACGG CCGGTGAGGA GGTCACGAGC CTCTGGAACG GCGCGTACAC CCAGGACGGC GCGTCCGTGC GGGTCACCGA CGCCGGGTGG AACGCCCGGA TCGCCCCGGG GAGCTCGGTG ACGGTCGGCT TCAACGGGAC CGTGGACGGC GAGCCCGCGC AGCCGACCGG GCTCACCCTC GACGGGGAGG CCTGCGGCTG A
|
Protein sequence | MPPEQHRPPR RGRGLPALAV AAATALAAST VVATTAAGAA PAEPTAAQDA YEWDNVEIVG GGFVPGIVFS ETEPGLAYAR TDIGGAYRWN PDTERWIPLL DWVGWDEWGY TGVVSIATDP VDPDRVYAAV GTYTNGWDPN NGAILSSDDR GETWDVAELP FKLGGNMPGR GLGERLAVDP NDNSVVYFGA SGGNGLWRST DHGATWAEVE AFPNPGDYVQ DPGDETGMMS DITGVTWVDF DPRTGSEGSV TQDVYVGVAD LDDPVYRSQD GGQTWEPVPG APTGHLPAHS VVDHEGGQLY MATTSTPGPY DGDSGDVWRM DLATGEWTDI SPVPSGSEDN YFGYGGLTID RQDPDTLMVA TQISWWPDIQ IYRSTDRGQT WTQAWDWGAY PERTTRYEMD ISGAPWLDFG GTGTPPETQP KLGWMTQAMA IDPFDSDRFM YGTGATVYGS DNLTDWDAGT TFDIGVRAHG IEETAVNDLI SPPEGAPLHS ALLDIGGFTH QDLETVPDQM YQQPYWGHGT SLDFAELQPA TIARVGGSDA EAAIGLSTDG GESWWAGQEP GGVTGGGTVA VNADGSSVVW SPDGTGVHVS TTLGSSWTAS TGVPAGARVE ADRVDPDVFY AVSGGTFYTS TDGGATFTAG FDGLPAEGNI RFGAVPGHTG DVWVAGGTGD HYGMWRSTDA GASFEQVEAV DEGDAVGFGA PAPGSDYPAV YTSSRIDGVR GIFRSDDAGE SWVRINDDQH QWAWTGATIT GDPNVYGRVY VGTNGRGIVY GDLAGGGGDP EPTPEPTPDP EPTPEPSPDP EPGDCAVEYT VTNTWSGGFQ AGVTVTNDGD EALEGWEVGW EFTAGEEVTS LWNGAYTQDG ASVRVTDAGW NARIAPGSSV TVGFNGTVDG EPAQPTGLTL DGEACG
|
| |