Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2449 |
Symbol | |
ID | 9246299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2905630 |
End bp | 2907402 |
Gene Length | 1773 bp |
Protein Length | 590 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | 1, 4-beta cellobiohydrolase |
Protein accession | YP_003680375 |
Protein GI | 297561401 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGAA TCCTGCGCAC TCCGGACAGA CGTCCGGGCG CACGGCGCGG CCTCGCCGCG ACATCGGCCC TGATCGTGGG CGCCGCACTG GCCACGGCCG TGCCCGCCCC CGCCAGCGCG GCGGCCGGTT GCGAGGTCGA CTACCAACTC AACGACTGGG GCTCCGGGTT CACCGCGAGC GTGGAGATCA CCAACCTCGG CGACGCGGTC AACGGCTGGA CCCTGGAATG GGACTTCGCC GGAAACCAGC GGATCACCAA CTCCTGGAAC GGCACCGTCA CCCAGAGCGG ACAGAGCGTC TCGGCCACCG ACGCCGGGTA CAACGGCGCG ATCGCGACGG ACGGCACGGC CACCTTCGGC TTCCAGGCCA CGTACTCGGG CGCCAACGCC GTCCCCGCCG AGTTCACCCT GAACGGTGTG GCCTGTGAGG GCCTGGTCGA CCCCGGCCCC GACCCCGATC CCGACCCTGA CCCCGACCCC GATCCGGACC CGGACCCCGG TACGGGCGAG CGGGTGGACA ACCCGTACGT GGGCGCTGAG GTGTACGTCA ACCCGATCTG GTCGGCCAAC GCCGCCGCCG AGCCCGGCGG GGACGCCGTG GCCGACGAGC CCACCGGGGT GTGGCTGGAC CGCATCAGCG CCATCGAGGG CAACGACAGC CCCACCACCG GCAGCATGGG ACTGCGCGAC CACCTGGACG AGGCCCTGGC CCAGGCCAAC GGTGAACCCC TGGTGTTCCA GGTGGTCATC TACAACCTGC CCGGCCGTGA CTGCGCCGCT TTGGCCTCCA ACGGCGAGCT GGGCCCGGAC GAGATCGACC GGTACAAGAA CGACTACATC GATCCCATCG CCCAGATCCT GGCCGATTAC GAGGACACCG AGCTGCGGGT GGTGACCACG GTCGAGATCG ACTCGCTGCC CAACCTGGTC ACCAACGTCT CCCCGCGCGA GACCGCGACC GAGAACTGCG ACGAGATGCT GGCCAACGGC AACTACGTCG AGGGGGTGGG CTACGCGCTG GCGCAGCTGG GCGCGATCGA CAACGTCTAC AACTACGTCG ACGCCGGCCA CCACGGGTGG ATCGGTTGGC AGGACAACTT CACCGCCTCG GCGGCGCTGT TCTTCGAGGC GGCCAACGCC GCCGGTGCCA GCCCCGACGA TGTGCACGGG TTCATCGCCA ACACCGCGAA CTACTCGGCT CTGGTGGAGG ACAACTTCTC CGTCGACCAG ACCATCGCCG GTACGCCGGT GCGCCAGTCG GAGTGGGTGG ACTGGAACCA GTTCACCGAC GAGTTGTCCT TCGCCCAGGC TCTGCGTGAG GAGCTGGTGG GTCAGGGGTT CGATTCGGGG ATCGGGATGC TCATCGACAC CTCCCGTAAC GGGTGGGGTG GGCCGGACCG CCCGGACGGG CCGGGGCCGA GCACGGATGT GAACGCCTAC GTGGACGGGG GCCGCTACGA CCGCCGTCTC CAGTCGGGGA ACTGGTGCAA CCAGTCCGGT GCGGGGCTGG GTGAGCGCCC CCAGGCCGCT CCGGAGGCGG GGATCGACGC CTATGTGTGG ATGAAGCCGC CGGGTGAGTC CGACGGGTCC AGTGAGTTCA TCGAGAACCC CGAGGGCAAG GGGTTCGACC GGATGTGCGA TCCGACCTAT GAGGGCAACC CGCGCAACAA CTACAACATG AGCGGTGCGC TGCCCAACGC GCCGATCTCG GGGCACTGGT TCTCCGCGCA GTTCCAGGAG CTTTTGGCCA ACGCCTACCC GCCCATCCAG TAG
|
Protein sequence | MSRILRTPDR RPGARRGLAA TSALIVGAAL ATAVPAPASA AAGCEVDYQL NDWGSGFTAS VEITNLGDAV NGWTLEWDFA GNQRITNSWN GTVTQSGQSV SATDAGYNGA IATDGTATFG FQATYSGANA VPAEFTLNGV ACEGLVDPGP DPDPDPDPDP DPDPDPGTGE RVDNPYVGAE VYVNPIWSAN AAAEPGGDAV ADEPTGVWLD RISAIEGNDS PTTGSMGLRD HLDEALAQAN GEPLVFQVVI YNLPGRDCAA LASNGELGPD EIDRYKNDYI DPIAQILADY EDTELRVVTT VEIDSLPNLV TNVSPRETAT ENCDEMLANG NYVEGVGYAL AQLGAIDNVY NYVDAGHHGW IGWQDNFTAS AALFFEAANA AGASPDDVHG FIANTANYSA LVEDNFSVDQ TIAGTPVRQS EWVDWNQFTD ELSFAQALRE ELVGQGFDSG IGMLIDTSRN GWGGPDRPDG PGPSTDVNAY VDGGRYDRRL QSGNWCNQSG AGLGERPQAA PEAGIDAYVW MKPPGESDGS SEFIENPEGK GFDRMCDPTY EGNPRNNYNM SGALPNAPIS GHWFSAQFQE LLANAYPPIQ
|
| |