Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3519 |
Symbol | |
ID | 9247388 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4226866 |
End bp | 4228632 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | 1, 4-beta cellobiohydrolase |
Protein accession | YP_003681426 |
Protein GI | 297562452 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACAC CCCCCCGTGC CGCGAACCTA CGTTCGTGGA CACGACGCGG CCTGGCCGCG ACCTCCGCCC TGGTGCTCGG CAGCACCCTG GCGGTCGCCT CCTCCGTACC CGCCAGCGCG GCGGCCGGTT GCGAGGTCGA CTACCAACTC AACGACTGGG GCTCCGGGTT CACCGCGAGC GTGGAGATCA CCAACCTCGG CGACGCGGTC AACGGCTGGA CCCTGGAATG GGACTTCGCC GGAAACCAGC GGATCACCAA CTCCTGGAAC GGCACCGTCA CCCAGAGCGG ACAGAGCGTC TCGGTCACCA ACGCCGGGCA CAACGCCTCA CTCTCCACCG ACGGCACCGC CAGCTTCGGC TTCCAGGGAA GCTACACCGG AAGCAACGCC GCGCCCACGG CCTTCGAACT GAACGGCGTG CTGTGCAGCG GTGACGTCGA GGAGCCGGAG GAACCGGAGG AACCGGAAGA GCCCGGGGAG CCCGAGGAGC CCGGCAGCAA CGGTCGGGTG GACAACCCGT ACGTGGGCGC TGAGGTGTAC GTCAACCCGA TCTGGTCGGC CAACGCCGCC GCCGAGCCCG GCGGGGACGC CGTGGCCGAC GAGCCCACCG GGGTGTGGCT GGACCGCATC AGCGCCATCG AGGGCAACGA CAGCCCCACC ACCGGCAGCA TGGGACTGCG CGATCACCTG GACGAGGCCC TGGCCCAGGC CAACGGTGAA CCCCTGGTGT TCCAGGTGGT CATCTACAAC CTGCCCGGCC GTGACTGCGC CGCTTTGGCC TCCAACGGCG AGCTGGGCCC GGACGAGATC GACCGGTACA AGAACGACTA CATCGATCCC ATCGCCCAGA TCCTGGCCGA TTACGAGGAC ACCGAGCTGC GGGTGGTGAC CACGGTCGAG ATCGACTCGC TGCCCAACCT GGTCACCAAC GTCTCCCCGC GCGAGACCGC GACCGAGAAC TGCGACGAGA TGCTGGCCAA CGGCAACTAC GTCGAGGGGG TGGGCTACGC GCTGGCGCAG CTGGGCGCGA TCGACAACGT CTACAACTAC GTCGACGCCG GCCACCACGG GTGGATCGGT TGGCAGGACA ACTTCACCGC CTCGGCGGCG CTGTTCTTCG AGGCGGCCAA CGCCGCCGGT GCCAGCCCCG ACGATGTGCA CGGGTTCATC GCCAACACCG CGAACTACTC GGCTCTGGTG GAGGAACACT TCTCCGTCGA CCAGACCATC GCCGGTACGC CGGTGCGCCA GTCGGAGTGG GTGGACTGGA ACCAGTTCAC CGACGAGTTG TCCTTCGCCC AGGCTCTGCG TGAGGAGCTG GTGGGTCAGG GGTTCGATTC GGGGATCGGG ATGCTCATCG ACACCTCCCG TAACGGGTGG GGTGGGCCGG ACCGTCCGGA CGGGCCGGGG CCGAGCACGG ATGTGAACGC CTACGTGGAC GGGGGCCGCT ACGACCGCCG TCTCCAGTCG GGGAACTGGT GCAACCAGTC CGGTGCGGGG CTGGGTGAGC GCCCCCAGGC CGCTCCGGAG GCGGGGATCG ACGCCTATGT GTGGATGAAG CCGCCGGGTG AGTCCGACGG GTCCAGTGAG TTCATCGAGA ACCCCGAGGG CAAGGGGTTC GACCGGATGT GCGATCCGAC CTATGAGGGC AACCCGCGCA ACAACTACAA CATGAGTGGC GCGCTGCCCA ACGCGCCGAT CTCGGGGCAC TGGTTCTCCG CGCAGTTCCA GGAGCTTTTG GCCAACGCCT ACCCGCCCAT CCAGTAG
|
Protein sequence | MSTPPRAANL RSWTRRGLAA TSALVLGSTL AVASSVPASA AAGCEVDYQL NDWGSGFTAS VEITNLGDAV NGWTLEWDFA GNQRITNSWN GTVTQSGQSV SVTNAGHNAS LSTDGTASFG FQGSYTGSNA APTAFELNGV LCSGDVEEPE EPEEPEEPGE PEEPGSNGRV DNPYVGAEVY VNPIWSANAA AEPGGDAVAD EPTGVWLDRI SAIEGNDSPT TGSMGLRDHL DEALAQANGE PLVFQVVIYN LPGRDCAALA SNGELGPDEI DRYKNDYIDP IAQILADYED TELRVVTTVE IDSLPNLVTN VSPRETATEN CDEMLANGNY VEGVGYALAQ LGAIDNVYNY VDAGHHGWIG WQDNFTASAA LFFEAANAAG ASPDDVHGFI ANTANYSALV EEHFSVDQTI AGTPVRQSEW VDWNQFTDEL SFAQALREEL VGQGFDSGIG MLIDTSRNGW GGPDRPDGPG PSTDVNAYVD GGRYDRRLQS GNWCNQSGAG LGERPQAAPE AGIDAYVWMK PPGESDGSSE FIENPEGKGF DRMCDPTYEG NPRNNYNMSG ALPNAPISGH WFSAQFQELL ANAYPPIQ
|
| |