Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4927 |
Symbol | |
ID | 9248814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | - |
Start bp | 61356 |
End bp | 63689 |
Gene Length | 2334 bp |
Protein Length | 777 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | glycoside hydrolase family 31 |
Protein accession | YP_003682816 |
Protein GI | 297563843 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.606594 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTCA CCGACGGCTT CTGGCAGATG AGGGAGGGCG TGCGCGCCAA CCGCGCGCGC GAGGCCCGCG ACGTGCGGGT GCACGACGAC CGGTTCACCC TCTACGCCCC GGTGCGCCCG ATCGGGCACC GGGGGGACAC GCTCAACACA CCGCTGATCA CGGTGGACTG CTGGTCGCCC GCGCCCGGTG TGATCGGTGT GCGGAGCACG CACCTGGCGG GCTCGGTCCG GCGCGCGCCG GAGTTCGACG TGCGCAGCGA CCCCGGCGCC GCCCCCTCGG TGGCGCGGGA CGGCTCCTCG GTGGAGCTGA CCAGCGGGGA GCTGTCGCTG CGGGTGGCCA CCGAGGGCCC CTGGCGGATG GAGTTCCGCG CGGGCGGCGC CGTGCTGACC GGCTCCGACG CCAAGGGCAC CGCGTTCATG GAGGACGCGG ACGGCTCCCA CCACATGCTC GGGCAGCTGT CGCTGGGGAT CCGCGAGCTG GTGTACGGCA TGGGCGAGCG GTTCACGCCG TTCACGCGCA ACGGCCAGAC CGTGGACATC TGGCAGGCCG ACGGCGGCAC GAGCAGCGAG CAGGCCTACA AGAACGTGCC GTTCTACCTG ACCAACCGGG GCTACGGGGT GTTCGTGGCG CACTCGGGGC CGGTGTCGTT CGAGGTCGGC TCGGAGTCGG TGGGCCGCGT GCAGTTCAGC GTGGAGGACC ACGCCCTGAC CTACTACGTG ATCCACGGGG AGAGCCCCAA GGAGATCCTG GCCCGGTACA CCGCGCTCAC CGGCCGCCCC GCGCTGCCGC CGCGGTGGTC GTTCGGGCTG TGGCTGTCCA CGTCGTTCAC CACCTCCTAC GACGAGGACA CGGTCAACCG CTTCATCGAC GGCATGGCGG AGCGGGGCGT GCCGCTCAGC GTGTTCCACT TCGACTGCTT CTGGATGCGC GAGTTCCACT GGTGCGACTT CGAGTGGGAC CCGGAGCTGT TCCCCGATCC CGTGGGCATG CTCTCCCGGC TCAAGGGGCG CGGGCTGCGG ACCTGCGTGT GGATCAACCC CTACATCGCC CAGCGGTCGG CGCTGTTCGA GGAGGGCTCC CGGCTGGGGC ACCTGGTCCG GCGGCCGGAC GGGACCGTGT GGCAGTGGGA CATGTGGCAG GCGGGGATGG CGCTGGTGGA CTTCACGTCC GCCGACGCCC GCGCGTGGTA CGCCGGAAAG CTCAAGGTCC TGCTCGACAT GGGTGTGGAC TGCTTCAAGA CCGACTTCGG CGAGCGGGTG CCGACCGACG TGGTGTGGTC GGACGGCTCC GACCCGCAGG CCATGCACAA CTACTACACG CACCTGTACA ACGAGACGGT GTTCGACCTG CTGAAGCGCG AGCGCGGTGA GGGCGAGGCC GTCCTGTTCG CGCGCTCGGC CACGGCGGGC GGGCAGAGCT TCCCGGTGCA CTGGGGCGGC GACTGCGCCT CGACGTTCGA GGCGATGGCG GAGAGCCTGC GCGGCGGCCT GTCCCTGGGG CTGTCGGGGT TCGGGTTCTG GAGCCACGAC ATCGGCGGCT TCGAGGGCAC CCCCGACGCC GCGGTGTTCA AGCGCTGGCT CGCGTTCGGC CTGCTCTCCT CGCACAGCAG GCTGCACGGC AGCCGCTCCT ACCGGGTGCC GTGGGACTTC GACGAGGAGT CCACCGAGGT GGCCCGCGTG TTCACCCGGC TCAAGTGCGC GCTCATGCCC TACCTGTTCG GCGCGGCCGT GCAGGCCCAC CGGGAGGGGA CGCCCGTGAT GCGCGCGATG CTGCTGGAGT TCCCCGACGA CCCGACCTGC CACCACCTGG ACACGCAGTA CATGCTGGGT GAGGACCTGC TGGTCGCCCC GGTGCTGAGC GCGGACGGCT CCGTGGAGTA CTACGTCCCC GAGGGCGTGT GGACCCACCT GATCACGGGC GAGACGGTGC GGGGCCCGGT CTGGCGCCGC GAGACCCACG GGTTCGACTC CCTGCCCCTG CTGGTGCGGC CGAACGCGGT CCTGCCGGTC GGCGCGGTGG ACGACCGGCC CGACTACGAC TACACGGACG GGCTGACCCT GCGCGTGTAC GGTGCCGGGG AGGCGGCCAC GGCGACCACC ACGGTCGTGC CGTCCGCCGA CGGCTCGGCC GCCGCGGTCT TCCGGACCGA GCGCTCGGGG GGCGGGGTCA CCGTGGAGGC CGGGGCCGCT CCGGCCCACG GGTGGCGGGT GCTGCTGGTC GGCACTGGCG GGGCCGAGAC CGATGGCACC GGGGCGGAGG TCACCGTGAC CGACGACGGC ACCCTGGTGT CGGTTCCGGC AGGGACCGCC CGCGTGGACC TGCGCCTGTC CTGA
|
Protein sequence | MKFTDGFWQM REGVRANRAR EARDVRVHDD RFTLYAPVRP IGHRGDTLNT PLITVDCWSP APGVIGVRST HLAGSVRRAP EFDVRSDPGA APSVARDGSS VELTSGELSL RVATEGPWRM EFRAGGAVLT GSDAKGTAFM EDADGSHHML GQLSLGIREL VYGMGERFTP FTRNGQTVDI WQADGGTSSE QAYKNVPFYL TNRGYGVFVA HSGPVSFEVG SESVGRVQFS VEDHALTYYV IHGESPKEIL ARYTALTGRP ALPPRWSFGL WLSTSFTTSY DEDTVNRFID GMAERGVPLS VFHFDCFWMR EFHWCDFEWD PELFPDPVGM LSRLKGRGLR TCVWINPYIA QRSALFEEGS RLGHLVRRPD GTVWQWDMWQ AGMALVDFTS ADARAWYAGK LKVLLDMGVD CFKTDFGERV PTDVVWSDGS DPQAMHNYYT HLYNETVFDL LKRERGEGEA VLFARSATAG GQSFPVHWGG DCASTFEAMA ESLRGGLSLG LSGFGFWSHD IGGFEGTPDA AVFKRWLAFG LLSSHSRLHG SRSYRVPWDF DEESTEVARV FTRLKCALMP YLFGAAVQAH REGTPVMRAM LLEFPDDPTC HHLDTQYMLG EDLLVAPVLS ADGSVEYYVP EGVWTHLITG ETVRGPVWRR ETHGFDSLPL LVRPNAVLPV GAVDDRPDYD YTDGLTLRVY GAGEAATATT TVVPSADGSA AAVFRTERSG GGVTVEAGAA PAHGWRVLLV GTGGAETDGT GAEVTVTDDG TLVSVPAGTA RVDLRLS
|
| |