Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5373 |
Symbol | |
ID | 9249276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | - |
Start bp | 552843 |
End bp | 555203 |
Gene Length | 2361 bp |
Protein Length | 786 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | glycoside hydrolase 15-related protein |
Protein accession | YP_003683259 |
Protein GI | 297564286 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.534575 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGGTCA CCCAAGCGGT CGAAGCGGTT CCGGGCATGC CGGGGGACAC CGCTCCCGCG CTCATGCCCG TGGACGTCCG GCAACGCGTC TCCGCCCTCG CCAGAACCTC ACGGCTCCTG GTGGCCTGCG ACTACGACGG CGCCCTCTCC ACGGTCGACC CTGCCGACCG GCGTCCGCTG CCCGAGGCCC TGCGCGCGCT GCGCGACCTG GCCGACCTGC CGGGCACCGT GTGCGCGGTC ATCTCCGGCC GCCCCCTGCG CGACCTGGCC GCGCTCTCGC GCCTGCCCGC CGAGGTGCGC CTCGTGGGCG CCCACGGCAC CGAGTTCGAC ACCGACGTGA CCGTCGACCC CGACCACCGC AACGCCGACT CGCCCTCGGA CAAGGCGGCC GCCCTGGAGC TGCTGCGCGA CCAGGTCGAG GCCACCGCCA TCCTCTACAT CGGCGGCGGC GAGGGCGAGG AGCCCGTCTT CATGCGGCTG ACCGGCGCCG ACGCGGGCGT GCGGGTCGGC GAGGGGCCCA CCGTGGCCTC CCACCGCGTG GCCGACACCC CGACCGCCGC GGCCCTGCTG TCCGCGCTGG CAGCCGAGCG CCGCTTCTGG GTCTTCGGCG AGCGCCCCAC GCCCATCGAG CGCATGTCGA TGCTCTCCAA CCAGGGCTCC GTGGCCCTGG TCGGCCCGGA CGCGCGCCTG CTGTGGTTCT GCCACCCCGA GCCCGACTCG AACGCCGTGT TCGCCGAGGT CCTGGGCGGC CGCCAGGCGG GCGTGTTCGC CGTCGCCCCC GCCCACGGAG GCCGTCCGCT GGGCCAGCGC TACCTGCCCG GCACCATGAC CGTGCGCACC CGCTGGTCGC GCATGGACGT CACCGACTAC CTCGCACACG GCACCCCCCA GGGGCGCACC GACCTGATCC GGGTGATCAG CGGCGTCACC CCGGCCACGG TGGAGTTCGC CCCCCGTCCC GACTTCGGCC GCGCGCCGGT GCGCATCACG CCGCAGGAGA ACGGCCTGCT GGTCGAGGGC GCCGACTTCC CGATGGCCCT GTACTCGCCC GGCGTGGAGT GGGAGATCGA CCACGACGGC GTCTCCGACC TCGCCCGCGC GGTCGTCCAC CCGCGCTCGG AGCAGCCCGT GGTGCTGGAG CTGCGCTGCG GCACCGACTC CCTCGTGCAC GGCACGCTGC CCGAGTCCGA GCGTCAGCGG ACCTCCGGCG AACACTGGTC GCGCTGGCTC GACGGGCTCA CCCTGCCCGA CACCGCCCAG CAGCTGGCGG CGCGCTCCGC GCTCACCCTG CGCGGGCTGT GCACCCCCTC CGGGGGCGTG ATGGCCGCCG CGACCACGTC GCTGCCCGAG GAGATCGGCG GCGTACGCAA CTGGGACTAC CGGTACTGCT GGCTGCGCGA CGGCGCCCTG ACCGTGCAGT CGCTGGTCTC CCTGGGCTCG ACCGCCGAGG CCGAGGAGTT CCTGGACTGG GTGCACCGGG TCGTGGACTC GCTGCCCGGC CCGGAGATGC TGCGCCCGCT GTACTCGCTG CGGGGCACGA ACCTGGGCCC GGAGGAGGTC ATCGAGTCGC TGTCCGGGTA CGCGGGCTCG CGGCCGGTGC GGATCGGCAA CCTGGCCGAC CACCAGGTGC AGCTGGACGT GTTCGGCCCC GTCGTGGAGC TGATCGAGAA GCTGTCCTCG GTGCGCGGCA CCCTGGCCGA CCGCGACTGG GACCTGGTGC GGTCGATGGC CGAGGCGGTC GCCCGCCGCT GGCACGAGCC CGACCACGGC ATCTGGGAGG AGCGCGACGA GCCCCGCCAG CGCGTCTACT CCAAGGTGAT GTGCTGGGTG ACCCTGGACC GCGCCGTCTC CCTGGCCCGT GCCTACGGCC GCGAGGTGGA CCCGTCCTGG CCGGACCTGC GCGACGGCAT CGCCGCCGAG GTGCTGGACA AGGGCTGGAA CGAGGAGGCG CAGGCCTTCA CCACCGCCTA CGACGGCACC GACCTGGACG CCGCGTCCCT GCACATCGGC CTGTCCGGGC TGATCGACCC GGCCGACGAG CGCTTCCAGG CCACCGTGAC CGCCGTGGAG GCGGAGCTGC GCAGCGGCCC GACCGTGTAC CGGTACCACC GCGACGACGG CCTGCCGGGC GGCGAGGGCG GCTTCCACCT GTGCACCGCG TGGCTGATCG AGGCGTACCT GCTGACCGGC CGCCGCGCGG AGGCGGACGA GCTGTTCAAG CACCTGGTGG ACTGCGCCGG ACCGACCGGG CTCATCCCGG AGGAGTTCGA CCCGGTCACC GAGCGGGCGC TGGGCAACCA CCCGCAGGCG TACTCGCACC TGGGCCTGAT CCGCTGCGCC CAGCTGCTCG ACCGCCTCTG A
|
Protein sequence | MTVTQAVEAV PGMPGDTAPA LMPVDVRQRV SALARTSRLL VACDYDGALS TVDPADRRPL PEALRALRDL ADLPGTVCAV ISGRPLRDLA ALSRLPAEVR LVGAHGTEFD TDVTVDPDHR NADSPSDKAA ALELLRDQVE ATAILYIGGG EGEEPVFMRL TGADAGVRVG EGPTVASHRV ADTPTAAALL SALAAERRFW VFGERPTPIE RMSMLSNQGS VALVGPDARL LWFCHPEPDS NAVFAEVLGG RQAGVFAVAP AHGGRPLGQR YLPGTMTVRT RWSRMDVTDY LAHGTPQGRT DLIRVISGVT PATVEFAPRP DFGRAPVRIT PQENGLLVEG ADFPMALYSP GVEWEIDHDG VSDLARAVVH PRSEQPVVLE LRCGTDSLVH GTLPESERQR TSGEHWSRWL DGLTLPDTAQ QLAARSALTL RGLCTPSGGV MAAATTSLPE EIGGVRNWDY RYCWLRDGAL TVQSLVSLGS TAEAEEFLDW VHRVVDSLPG PEMLRPLYSL RGTNLGPEEV IESLSGYAGS RPVRIGNLAD HQVQLDVFGP VVELIEKLSS VRGTLADRDW DLVRSMAEAV ARRWHEPDHG IWEERDEPRQ RVYSKVMCWV TLDRAVSLAR AYGREVDPSW PDLRDGIAAE VLDKGWNEEA QAFTTAYDGT DLDAASLHIG LSGLIDPADE RFQATVTAVE AELRSGPTVY RYHRDDGLPG GEGGFHLCTA WLIEAYLLTG RRAEADELFK HLVDCAGPTG LIPEEFDPVT ERALGNHPQA YSHLGLIRCA QLLDRL
|
| |