Gene Ndas_5562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5562 
Symbol 
ID9249465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp760720 
End bp763440 
Gene Length2721 bp 
Protein Length906 aa 
Translation table11 
GC content73% 
IMG OID 
Productcellulose-binding family II 
Protein accessionYP_003683447 
Protein GI297564474 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.655446 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCCCG AACAGCACCG GCCCCCCAGA CGCGGGCGGG GCTTACCGGC GCTCGCCGTC 
GCGGCGGCCA CCGCACTGGC CGCCTCCACC GTCGTCGCCA CCACCGCGGC CGGGGCCGCT
CCCGCCGAGC CGACCGCGGC ACAGGACGCC TACGAGTGGG ACAACGTCGA GATCGTCGGG
GGCGGCTTCG TCCCCGGCAT CGTCTTCAGC GAGACCGAAC CGGGCCTGGC CTACGCCCGC
ACCGACATCG GCGGCGCCTA CCGCTGGAAC CCCGACACCG AACGCTGGAT CCCCCTGCTC
GACTGGGTCG GCTGGGACGA GTGGGGCTAC ACGGGCGTCG TCAGCATCGC CACCGACCCC
GTGGACCCCG ATCGCGTGTA CGCGGCCGTG GGCACCTACA CCAACGGCTG GGACCCCAAC
AACGGGGCCA TCCTCAGCTC CGACGACCGC GGCGAGACCT GGGACGTCGC CGAGCTGCCC
TTCAAGCTCG GCGGCAACAT GCCCGGCCGC GGCCTCGGCG AGCGCCTGGC CGTGGACCCC
AACGACAACA GCGTCGTCTA CTTCGGCGCG AGCGGCGGCA ACGGCCTGTG GCGCAGCACC
GACCACGGCG CCACCTGGGC CGAGGTCGAG GCCTTCCCCA ACCCCGGCGA CTACGTCCAG
GACCCGGGCG ACGAGACCGG CATGATGTCC GACATCACCG GCGTGACCTG GGTGGACTTC
GACCCCAGGA CCGGGTCCGA GGGCTCGGTC ACCCAGGACG TCTACGTCGG CGTCGCCGAC
CTGGACGACC CGGTCTACCG CAGTCAGGAC GGCGGACAGA CCTGGGAGCC CGTCCCCGGC
GCGCCCACGG GGCACCTGCC CGCGCACTCG GTCGTGGACC ACGAGGGCGG CCAGCTCTAC
ATGGCCACCA CCAGCACCCC CGGCCCCTAC GACGGCGACT CCGGCGACGT GTGGCGCATG
GACCTGGCGA CGGGGGAGTG GACCGACATC AGCCCCGTCC CCTCCGGCTC CGAGGACAAC
TACTTCGGCT ACGGCGGCCT GACCATCGAC CGGCAGGACC CGGACACGCT GATGGTCGCC
ACCCAGATCT CCTGGTGGCC CGACATCCAG ATCTACCGCA GCACCGACCG CGGCCAGACC
TGGACCCAGG CGTGGGACTG GGGCGCCTAC CCCGAGCGCA CCACGCGCTA CGAGATGGAC
ATCTCCGGAG CGCCCTGGCT GGACTTCGGC GGTACCGGGA CGCCCCCGGA GACCCAGCCC
AAACTCGGCT GGATGACGCA GGCGATGGCG ATCGACCCGT TCGACTCCGA CCGTTTCATG
TACGGCACGG GCGCCACGGT CTACGGCAGC GACAACCTCA CCGACTGGGA CGCGGGCACC
ACCTTCGACA TCGGGGTCAG GGCGCACGGC ATCGAGGAGA CGGCCGTGAA CGACCTGATC
AGCCCGCCCG AGGGCGCGCC GCTGCACTCG GCCCTGCTCG ACATCGGCGG CTTCACCCAC
CAGGACCTGG AGACCGTCCC CGACCAGATG TACCAGCAGC CCTACTGGGG CCACGGGACC
AGCCTGGACT TCGCCGAACT CCAGCCCGCG ACCATCGCGC GGGTCGGCGG CAGCGACGCC
GAGGCCGCCA TCGGCCTGTC CACCGACGGC GGCGAGAGCT GGTGGGCCGG GCAGGAGCCC
GGCGGCGTGA CCGGCGGCGG CACGGTCGCG GTGAACGCGG ACGGCTCGTC CGTCGTGTGG
AGCCCCGACG GCACAGGCGT CCACGTCTCC ACCACCCTGG GCTCGTCGTG GACCGCCTCC
ACCGGCGTTC CGGCGGGCGC GAGGGTGGAG GCCGACCGGG TGGACCCGGA CGTGTTCTAC
GCCGTCTCCG GCGGCACCTT CTACACCAGC ACCGACGGCG GGGCCACCTT CACGGCGGGC
TTCGACGGGC TCCCGGCCGA GGGCAACATC CGCTTCGGCG CGGTGCCCGG CCACACCGGT
GACGTGTGGG TCGCCGGAGG CACCGGTGAC CACTACGGCA TGTGGCGGAG CACGGACGCC
GGGGCCTCCT TCGAGCAGGT CGAGGCCGTG GACGAGGGCG ACGCGGTCGG CTTCGGCGCG
CCCGCGCCGG GCTCGGACTA CCCGGCGGTG TACACCAGCT CCAGGATCGA CGGCGTGCGC
GGGATCTTCC GCTCCGACGA CGCGGGCGAG AGCTGGGTGC GGATCAACGA CGACCAGCAC
CAGTGGGCCT GGACGGGGGC GACCATCACC GGCGACCCCA ACGTCTACGG CCGGGTCTAC
GTCGGCACCA ACGGCCGGGG GATCGTCTAC GGCGACCTGG CGGGCGGGGG CGGCGACCCC
GAGCCCACGC CGGAGCCCAC GCCGGACCCC GAACCCACCC CGGAGCCGTC CCCGGACCCC
GAGCCCGGGG ACTGCGCGGT CGAGTACACC GTCACCAACA CCTGGAGCGG CGGGTTCCAG
GCCGGGGTGA CGGTCACCAA CGACGGCGAC GAGGCACTGG AGGGCTGGGA GGTCGGCTGG
GAGTTCACGG CCGGTGAGGA GGTCACGAGC CTCTGGAACG GCGCGTACAC CCAGGACGGC
GCGTCCGTGC GGGTCACCGA CGCCGGGTGG AACGCCCGGA TCGCCCCGGG GAGCTCGGTG
ACGGTCGGCT TCAACGGGAC CGTGGACGGC GAGCCCGCGC AGCCGACCGG GCTCACCCTC
GACGGGGAGG CCTGCGGCTG A
 
Protein sequence
MPPEQHRPPR RGRGLPALAV AAATALAAST VVATTAAGAA PAEPTAAQDA YEWDNVEIVG 
GGFVPGIVFS ETEPGLAYAR TDIGGAYRWN PDTERWIPLL DWVGWDEWGY TGVVSIATDP
VDPDRVYAAV GTYTNGWDPN NGAILSSDDR GETWDVAELP FKLGGNMPGR GLGERLAVDP
NDNSVVYFGA SGGNGLWRST DHGATWAEVE AFPNPGDYVQ DPGDETGMMS DITGVTWVDF
DPRTGSEGSV TQDVYVGVAD LDDPVYRSQD GGQTWEPVPG APTGHLPAHS VVDHEGGQLY
MATTSTPGPY DGDSGDVWRM DLATGEWTDI SPVPSGSEDN YFGYGGLTID RQDPDTLMVA
TQISWWPDIQ IYRSTDRGQT WTQAWDWGAY PERTTRYEMD ISGAPWLDFG GTGTPPETQP
KLGWMTQAMA IDPFDSDRFM YGTGATVYGS DNLTDWDAGT TFDIGVRAHG IEETAVNDLI
SPPEGAPLHS ALLDIGGFTH QDLETVPDQM YQQPYWGHGT SLDFAELQPA TIARVGGSDA
EAAIGLSTDG GESWWAGQEP GGVTGGGTVA VNADGSSVVW SPDGTGVHVS TTLGSSWTAS
TGVPAGARVE ADRVDPDVFY AVSGGTFYTS TDGGATFTAG FDGLPAEGNI RFGAVPGHTG
DVWVAGGTGD HYGMWRSTDA GASFEQVEAV DEGDAVGFGA PAPGSDYPAV YTSSRIDGVR
GIFRSDDAGE SWVRINDDQH QWAWTGATIT GDPNVYGRVY VGTNGRGIVY GDLAGGGGDP
EPTPEPTPDP EPTPEPSPDP EPGDCAVEYT VTNTWSGGFQ AGVTVTNDGD EALEGWEVGW
EFTAGEEVTS LWNGAYTQDG ASVRVTDAGW NARIAPGSSV TVGFNGTVDG EPAQPTGLTL
DGEACG