Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2363 |
Symbol | |
ID | 9246213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2809443 |
End bp | 2811026 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_003680291 |
Protein GI | 297561317 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.14021 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.334984 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGACG ACGCGAGCCG ACCCGCGGTC ATCCCGGTGC CGACGCTGAC GGAACCGCGT CCGGGCGCCC TCCGGCTGGA CCGCGGACTC TCCCTGCGCG TCGAACCGGA GGCCGCGCTG GCGGGCGCGC TGTTACGCGA GGTCCTGGCC CCGCTGACCG GACCTGGGGC CGGGGGCGGT CCGGGCGGCG GTCCTGCCCC CGTCACCGTG CGCGTGGACC CCCGACGGGC CCGCGAACAC GGCTACACGC TGACCGTGGA CACCGACGGC GTCGAGATCG TCGGCGCGGA CCCGGTCGGG GCCTTCCACG GCGCGCAGAC CCTGCGCCAG CTCCTGCCCG CGCGGCTGTG GCGGACGGGC GCGTCCGCGC GGGGCGCGGG CATCCCCTGC ACGCGCGTGG AGGACCACCC CCGGCTGCGC TGGCGCGGCC TCATGCTGGA CGTGGCCCGG CACTTCCTGC CCAAGGACTT CGTCCTGCGC CTGATCGACC TGCTCGCCCT GCACAAGCTC AACGTCCTGC ACCTGCACCT GACCGACGAC CAGGGCTGGC GGATCGAGGT GCCCGGCTAC CCCCGGCTGA CGGAGGTCGG CGCGTGGCGG GCGCGCACCG TGGTCGGCGA CGACCTCGGC ACGAGCCCCG ATCCGGTCTA CGACCAGACC CCGCACGGCG GCTACTACAC CGAGGCCGAC CTGCGCGAGA TCGTGGCGCA CGCCCGCGAC CGGTGCGTCG AGGTGGTCCC GGAGATCGAC ATGCCCGGCC ACATGCAGGC GGCCATCGCC GCCTACCCCG AACTCGGCAA CACCGGCGCC CCGACGATCG TGCGCGAGCG CTGGGGGGTG GGCACCCGCC TGCTCAACGT GGACGAGACC GCGCTGCGGT TCTGCCGCGA CGTCCTGACG CACGTGATGG ACGTTTTCCC CGGCCGGTAC GTCCACTGCG GCGGGGACGA GGTCCCCAAG GACGAGTGGC GCGACAGCCC CCGCGTGCAG GAGCACATCC GCCGACTGGG GCTCGCGGAC GAGGAGGAGC TCCAGGGCTG GTTCACCGGG TACATCGCCG CGTTCCTGGC CGAGCACGGC CGCACGCTGG TGGGCTGGGA CGAGGTCCTG GACGGGGGAG CGCTCCCCGC CGACGTGGCG GTGATGGCCT GGCGCGAGCA GGAGAGGGGC GAGGCCGCGG CCCGGCTCGG CCACGAGACG GTCATGGCGC CCACCAGCCA CGTGTACCTC GACTACTACC AGGACGAGGA CACCGCGGCC GAGCCCCTCG CCTTCGGCGG CGGCTTCGTG CCGCTGTCCA CCGTCTTCGA CTTCGAACCG GTGCCCGCCG CGCTGGACGG GGAGGCCGCC CGGCGCGTCG TCGGCGCCGA GGCCCAGATG TGGACCGAGT ACGTGGCCAC CGAGGAGCAC ACCGAGTACA TGCTCTTCCC CCGCCTGTGC GCCTTCGCCG AGGCCGTCTG GCGCCAGCCG GGCGAGCGCG GCCACGACCG CGCCGACTTC CTCGCCCGGC TGCGCGCGCA CCTGGAGCGC CTGGACGCCC TCGGGGTCGG CTACCGGCCG CTGGACGGAA GGAACGGCCC GTGA
|
Protein sequence | MPDDASRPAV IPVPTLTEPR PGALRLDRGL SLRVEPEAAL AGALLREVLA PLTGPGAGGG PGGGPAPVTV RVDPRRAREH GYTLTVDTDG VEIVGADPVG AFHGAQTLRQ LLPARLWRTG ASARGAGIPC TRVEDHPRLR WRGLMLDVAR HFLPKDFVLR LIDLLALHKL NVLHLHLTDD QGWRIEVPGY PRLTEVGAWR ARTVVGDDLG TSPDPVYDQT PHGGYYTEAD LREIVAHARD RCVEVVPEID MPGHMQAAIA AYPELGNTGA PTIVRERWGV GTRLLNVDET ALRFCRDVLT HVMDVFPGRY VHCGGDEVPK DEWRDSPRVQ EHIRRLGLAD EEELQGWFTG YIAAFLAEHG RTLVGWDEVL DGGALPADVA VMAWREQERG EAAARLGHET VMAPTSHVYL DYYQDEDTAA EPLAFGGGFV PLSTVFDFEP VPAALDGEAA RRVVGAEAQM WTEYVATEEH TEYMLFPRLC AFAEAVWRQP GERGHDRADF LARLRAHLER LDALGVGYRP LDGRNGP
|
| |