Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4194 |
Symbol | |
ID | 9248068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5008669 |
End bp | 5009982 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | 1, 4-beta cellobiohydrolase |
Protein accession | YP_003682093 |
Protein GI | 297563119 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.28585 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCGCA CACGCATAGC CGTCGGAGCA GCAGTGAGCT CCGTCTCGGC ACTGGCCCTG GGAACGGCGC TGCTCGCCAC GGCGCCCGCC TCGGCCGCCG ACTCCGAGTT CTACGTCAAC CCCAACACGT CGGCGGCCGT CTGGGTCGAG GAGAACCCGA ACGACCCCCG GGCCGACGTC ATCCGCGACC GCATCGCCTC GGTCGCCCAG GCCACCTGGT TCACCCAGTA CAACCCCGCC GAGGTCCGCG ACGACGTGGA CGCGGTGGTC AGCGCCGCCG ACGCCCAGGG CCAGACCCCC ATCCTGGTGG TCTACAACAT CCCCGGCCGC GACTGCGGCA ACCACAGCGG CGGCGGGGCG CCCAGCCACG ACGCCTACCG CGCCTGGGTC GACGAGGTCG CCGCGGGGCT GGAGGGCCGG TCCGCCACCA TCGTCCTGGA GCCCGACGCC CTGCCGCTGG TGAGCGGCTG CAGCGACCCG TCCGAGCTCC TGGACTCCAT GGCCTACGCG GGCAAGGCGC TCATGGAGGG CTCCTCCGAG GCCAGGGTCT ACTTCGACAT CGGCAACTCG GCCTGGCTGG ACCCGCAGGA GGCCGCCGGC CTGCTCAACG GCGCGGACGT CGCGAACAGC GCGCACGGCG TCGCCACCAA CACCTCCAAC TACAACTGGA CCCACGACGA GGTCGCCTTC GCGGAGGCCG TCATCGCCGC GACGGGCGTG CCCGGCCTCG GCGCCGTGAT CGACACCAGC CGCAACGGCA ACGGCCCCGC CCCCCAGAAC GAGTGGTGCG ACCCGCCGGG GCGGATGATC GGCCGCCCCA GCACCACCGA CACCGGGAAC CCGCTGATCG ACGCCTTCAT CTGGACCAAG CTGCCCGGTG AGGCCGACGG CTGCATCGCG CCCGCGGGGC AGTTCGTGCC CCAGGCCGCC TACGACATGG CGGTGAACGC CCCCGAGTAC CCCACCGACC CCGGCGAGCC GACCGACCCC GAGGAGCCCA CCGACCCGCC CGAGGGCGAG GGCTGCACGG CCGACTACAG GGTCGTCAGC GAGTGGGGCA ACGGCTTCCA GGCGGCGGTG ACGGTCACCG CCGAGGACTC CCTCAGCGGC TGGACCGTGA CGTGGACCTA CGCCGACGGG CAGCGGTTCA GCCAGGGCTG GAACGCCGAG TTCTCCAGCA GCGGGTCGCG GGTCACCGCC TCCGACCTCG GCTGGAACGG CACGCTCAGC GCCGGCGGCA GCACCGAGTT CGGGTTCACC GGGACCCACG GCGGTAGCAA CGGCGTGCCC GAGGTGACGT GCTCCGCGGC CTGA
|
Protein sequence | MSRTRIAVGA AVSSVSALAL GTALLATAPA SAADSEFYVN PNTSAAVWVE ENPNDPRADV IRDRIASVAQ ATWFTQYNPA EVRDDVDAVV SAADAQGQTP ILVVYNIPGR DCGNHSGGGA PSHDAYRAWV DEVAAGLEGR SATIVLEPDA LPLVSGCSDP SELLDSMAYA GKALMEGSSE ARVYFDIGNS AWLDPQEAAG LLNGADVANS AHGVATNTSN YNWTHDEVAF AEAVIAATGV PGLGAVIDTS RNGNGPAPQN EWCDPPGRMI GRPSTTDTGN PLIDAFIWTK LPGEADGCIA PAGQFVPQAA YDMAVNAPEY PTDPGEPTDP EEPTDPPEGE GCTADYRVVS EWGNGFQAAV TVTAEDSLSG WTVTWTYADG QRFSQGWNAE FSSSGSRVTA SDLGWNGTLS AGGSTEFGFT GTHGGSNGVP EVTCSAA
|
| |