Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0656 |
Symbol | |
ID | 9244498 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 803587 |
End bp | 805059 |
Gene Length | 1473 bp |
Protein Length | 490 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003678607 |
Protein GI | 297559633 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.335861 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.754125 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGTACG ACCCGACCCT GGCGCGTCTG GCCAACGCCA CGCTGCTGGT GCCCTTCGAG TCCTACCAAG CCCCGCGCTG GGTCCTGGAG GGTCTGGCGG ACGGGATCTC CGGTGTCTGT CTGTTCCACA ACAACCTGGA CGGCCCCGAG CAGGTGACCG CGCTCAACGC GACCCTGGCC GGGGTCACCG ACACCCCGCT GATCTCCCTG GACGAGGAGG GCGGCGACGT CACCCGCATC GGCCAGGCTC AGGGCAGCGA CTACCCCGGC AACGCGGCCC TGGGCGCCGT GGACGACCCC GGCCTGACCC GCGCCACCCT GCGCTCCCTG GGCGGACGCC TGGCCGAGCT GGGCTTCAAC CTGGACCTGG CGCCCTCGGT GGACGTCAAC GTCGCCGACG ACAACCCGGT GATCGGCACC CGCTCCTTCG GCTCCGACCC CGAACTGGTG GCCCGGCACG CGGCGGCCGC CGTGCTCGGC CTCCAGGAGG CGGGCGTGGC CGCGTGCGCC AAGCACTTCC CCGGCCACGG CGCCACCTCC CAGGACTCCC ACCACGTGCT GCCGCGCGTG GAGGCCGACG CCGACCTCCT GCGCCGCCGC GAGCTGCTGC CCTTCCGCGC GGCCGTGGAG GCGGGCGTCC GGTCGATCCT GACCGCGCAC ATCGAGATGC CCGGCCTGGG CGGCGACGGC CCCGCCACGC TCACCCCGCG CATCCTCAAC GACCTGCTCC GCGGCGAGCT GGGCTTCACC GGCACCGTGG TCAGCGACGC CATGGACATG CAGGGCGTCA GCGGCCGTAT CGGCATCCCC GAGGCCTGCG TGCGCGCGGT GGCCGCCGGG GTGGACCTGC TGTGCCTGGG CCGGTTCGTC TACGCCGACC AGGTCGAGCT GATCCGCGCC GCGCTGGTGG ACGCGGTCCG GGAGGGCCGC CTGCCCGGGG AGCGCCTGGA GGAGGCCGCC GGGCGCAACG CCGAGCTGCG GACCTGGATC CGCGCGGCCC AGACCCGGCG CTCGGACGCG CCCGCGGCCG ACGGGGTGGG CCTGGTCGGG GCCCGCCGCG CGGTGCGCGT GGACGGCGAC CTGCCGCCGC TGGCGGACCC CTACGTGGTG GAGGTGGACG CCCCCTCCGG CATGGCGGTG GGCGAGGTCC CCTGGGGCCT GTCCCCCTGG TTTCCGGGCA CCGAGCGGGT CTCCCCCGAC GTCGCGCACG CCGACCGGCT GGCCGCCAGC GCCCGGGACC GCGACCTGGT CGTGGTGGTG CGGGACGCGC ACCGCTACCC CTCCGCCCAG GCGCTCGTCA ACCGCCTGCT CAGCTCCCAC CCCGACGCCG TGGTGGTGGA GATGGGGCTG CCCATCTGGC GGCCCGACTG CGGCGCGCAC GTGAGCACCT ACGGCGCCGC GCACGTGAAC GGGCTGAGCG CGGCGGAGCT GCTGGGGGCA CCGGTGGGCG CCCCCTCCCC CGGCGTGAAC TGA
|
Protein sequence | MPYDPTLARL ANATLLVPFE SYQAPRWVLE GLADGISGVC LFHNNLDGPE QVTALNATLA GVTDTPLISL DEEGGDVTRI GQAQGSDYPG NAALGAVDDP GLTRATLRSL GGRLAELGFN LDLAPSVDVN VADDNPVIGT RSFGSDPELV ARHAAAAVLG LQEAGVAACA KHFPGHGATS QDSHHVLPRV EADADLLRRR ELLPFRAAVE AGVRSILTAH IEMPGLGGDG PATLTPRILN DLLRGELGFT GTVVSDAMDM QGVSGRIGIP EACVRAVAAG VDLLCLGRFV YADQVELIRA ALVDAVREGR LPGERLEEAA GRNAELRTWI RAAQTRRSDA PAADGVGLVG ARRAVRVDGD LPPLADPYVV EVDAPSGMAV GEVPWGLSPW FPGTERVSPD VAHADRLAAS ARDRDLVVVV RDAHRYPSAQ ALVNRLLSSH PDAVVVEMGL PIWRPDCGAH VSTYGAAHVN GLSAAELLGA PVGAPSPGVN
|
| |