Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5261 |
Symbol | |
ID | 9249159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | - |
Start bp | 425041 |
End bp | 427461 |
Gene Length | 2421 bp |
Protein Length | 806 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003683147 |
Protein GI | 297564174 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGGAC CCTCGGAACA CGTCGCGGAC CGGGAGGCCG CCGACGACGG TTGGCGCGAC CCCTCCCTCC CCGACGCGTC GCGCGTCGAG CGACTGCTGG GATCGATGAC CCTGGAGGAG AAGGTCGCCC AGCTTCACGG TGTGTGGGTG AGCGCGGACG CCTCCGGCGA GGCCGTCGCA CCGCACCAGC ACGACCTCAC CCAGGAACCC CCCGCCTGGG AGAAGGTCAT CGAGCACGGT CTCGGCCAGC TCACCCGGCC CTTCGGCACC GCCCCGGTCG ACCCCGCCGC CGGCCGGGCG TCGCTGGCCC GCAGCCAGCG GGAGATCATG GCCGCCAACC GGTTCGGCGT CCCGGCCCTG GCCCACGAGG AGTGCCTGGC CGGGTTCGCC GCCTGGACCG CGACGATCTA CCCGGTCCCC CTGGCCTGGG GCGCCAGCTT CGACCCCGAC CTGGTCGAGC GGATGGCCGC GCAGATCGGC GCGAGCATGC GCCGGGTCGG CGTCCACCAG GGCCTGGCCC CCGTGCTGGA CGTGTCCCGC GACCCGCGCT GGGGCCGCAC CGAGGAGACC ATCGGCGAGG ACCCCCACCT CGTGGCGACC GTGGGCACCG CCTACGTGCG CGGCCTCCAG TCCGCCGGGA TCGTGGCCAC CCTCAAGCAC TTCACGGGCT ACTCGGCCTC CCGCGGCGGC CGCAACCTCG CGCCGGTCTC GATCGGTCCG CGCGAGTTCG CCGACGTCCT GCTGCCCCCC TTCGAGATGG CCGTGCGCGA CGGAGGCGCC GGGTCGGTCA TGTCCGCCTA CAACGACAAC GACGGAGTGC CCGCCGCCGC GGACACGCGC CTGCTCACCG GCCTGCTGCG GGACCAGTGG GGCTTCGAGG GCACCGTGGT GGCCGACTAC TTCGGCGTCG CCTTCCTCCA GACCCTGCAC CGCGTCGCCG ACTCCGCCGA GCGGGCCGGC GCCCTGGCCC TGACCGCGGG CGTGGACGTC GAACTGCCCA CCGTGCACTG CTACGGCGAC CGGCTCACCG CCCTGGTCCG CTCGGGCGAG GTGCCCGAGG AGCTGGTCGA CCGGGCCGCC CGGCGCGTGC TGACGCAGAA GTGCCAGCTG GGCCTGCTCG ACGCCGGCTG GTCCCCGGAG CCCGAGGACC CCGCCGTCCC CGTGGACCTG GACCCCGCCG AACACCGCGC GCTGGCCCGC GAGCTCGCCG AGCGCTCGGT GGTCCTGCTC TCCAACACCG ACGACGCGCT CCCCCTGGCC GACACCGGGG ACCTGGCCCT GGTCGGCCCG CTCGCCGACA CCGCCGACGC CGTGCTCGGC TGCTACTCCT TCCCCGCCCA CGTGGGCAGG CGCCACCCCG GCACCGCCGT CGGCGTGGAG ATACCCACCC TGCTGGAGTC CCTGGGCGCC GAACTGCCCG GCGTCCGCGT CGAGCACCGC GCCGGGTGCT CCGTGGACGG CGACTCCACC GAAGGCTTCG CCGAGGCCGT GTCGGCGGCC GCACGCGCCC GGGTGTGCGT GGCCGTGGTG GGCGACCGCT CCGACCTGTT CGGCAGGGGT ACCTCCGGCG AGGGCTGCGA CGTCGAGGAC CTGCGCCTGC CCGGGGTCCA GCAGGAGCTC CTGGAGGCCC TGGCCGACAC CGGCACCCCC GTGGTCGCGG TGGTGGTGTC CGGACGGCCC TACGCGCTGG GCCCGGTCGC CGACCGGCTG GCCGCCGTCG TCCAGGCCTT CCTGCCCGGC GAGGAGGGCA TGCCCGCCGT GGCGGGGGTG CTCTCGGGCC GGGTCAACCC CAGCGGGCGC CTGCCGGTGT CCGTGCCGCG TTCCTCCGGC GGCCAGCCCG TCACCTACCT CGGCCCCGAC CTGGCCCACC GCAGCGAGGT CAGCTCGGTG GACCCGACCC CGCGCCACCC CTTCGGCCAC GGCCTGTCCT ACACCCGGTT CGTCTGGGAG GACCCGCGCG TGGACGCGGG CGCCGTCCGC CCGGAGGAGG CCACCCGGGT GGGCACCGAC GGCGAGGTCA CCGTCGGCTG CACCGTCCGC AACGTCGGCG GCTCCGCCGG GACCGAGGTC GTCCAGCTCT ACCTGCACGA CCCCGTCGCC CAGGTGGCCA GGCCCCGCAG ACAGCTGGTC GGCTACGCGC GGGTGCACCT GGAGTCCGGG GAGGCGCGGG CGGTGGACTT CTCCGTCCAC GCCGACCTGG CCTCCTACAC CGGTCCGGAC GGGCGGCGCG TCGTCGAGCC CGGCCGCCTG GAGCTGCTGC TGTCCGCGTC CAGCGAGGAC GTCCGGCACA CCGTGCCGGT CCTGCTCACC GGGCCCACGC GCGTCGTGGA CCACACCCGG CGCCTGGCCT GCGGGGTCCA GCTCGACCCC GTCGACCAGT CCGGCGGGGC CGACCGGGAG GAGGTCGCCG CAGGCGGCTG A
|
Protein sequence | MTGPSEHVAD REAADDGWRD PSLPDASRVE RLLGSMTLEE KVAQLHGVWV SADASGEAVA PHQHDLTQEP PAWEKVIEHG LGQLTRPFGT APVDPAAGRA SLARSQREIM AANRFGVPAL AHEECLAGFA AWTATIYPVP LAWGASFDPD LVERMAAQIG ASMRRVGVHQ GLAPVLDVSR DPRWGRTEET IGEDPHLVAT VGTAYVRGLQ SAGIVATLKH FTGYSASRGG RNLAPVSIGP REFADVLLPP FEMAVRDGGA GSVMSAYNDN DGVPAAADTR LLTGLLRDQW GFEGTVVADY FGVAFLQTLH RVADSAERAG ALALTAGVDV ELPTVHCYGD RLTALVRSGE VPEELVDRAA RRVLTQKCQL GLLDAGWSPE PEDPAVPVDL DPAEHRALAR ELAERSVVLL SNTDDALPLA DTGDLALVGP LADTADAVLG CYSFPAHVGR RHPGTAVGVE IPTLLESLGA ELPGVRVEHR AGCSVDGDST EGFAEAVSAA ARARVCVAVV GDRSDLFGRG TSGEGCDVED LRLPGVQQEL LEALADTGTP VVAVVVSGRP YALGPVADRL AAVVQAFLPG EEGMPAVAGV LSGRVNPSGR LPVSVPRSSG GQPVTYLGPD LAHRSEVSSV DPTPRHPFGH GLSYTRFVWE DPRVDAGAVR PEEATRVGTD GEVTVGCTVR NVGGSAGTEV VQLYLHDPVA QVARPRRQLV GYARVHLESG EARAVDFSVH ADLASYTGPD GRRVVEPGRL ELLLSASSED VRHTVPVLLT GPTRVVDHTR RLACGVQLDP VDQSGGADRE EVAAGG
|
| |