Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4565 |
Symbol | |
ID | 9248446 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5409324 |
End bp | 5411195 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | glycoside hydrolase 15-related protein |
Protein accession | YP_003682458 |
Protein GI | 297563484 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCAGG GCACCGCGAG CATCGGCGAC CACGGCTTCC TCTCGGACTG CCACACCGCC GCGCTCACCA CACCCGACGG CACCGTCGAC TGGCTGTGCG TTCCCCGGTT CGACGGGCCC GCCCTCGTCT CGGGGATCCT GGACCCGCGC GGAGGCGGAT GGACCCTGGA GGTCGAGGGG GCGAGCCCGG CCGGACGCGC CTACGTCGAC GACACGCTCG TCCTGGAGAC GCTCTGGCGG GGTACGGACA CCGAGGTGGC CGTCCGCGAC CTGCTCGCGG TGCGAAGGCC GGAGGAGGGC GGGGCGGGCC TGTACCGGCA GGGTCTCCTC CTGCGGGTCG TCGAGTGCCG CTCCGGGAGC ACGTCCGTGC GCTCGCGGCT CGACGCCAGA CCCGACTTCG CGCGTGCCGA ACCCGTGTGG GAGCGGGTGG ACGGCGGACT GCGCGAGGCC TCGGGGCCGA TGCTGTCGGG TTCGCCCGCG CCCGCGCTCG CGCGGGACGG CGTGCCGGAG TACCGCGTGG AGCTGGCCGA GGGGGACACC GCCGTGTTCG CCCTGGACTA CCTGGAGGGC GGGCGCCGCG TCGGACTGGG GGAGGGCCGG GCGCTGCTGC GGGAGACCCT GGACGCCTGG CGGGAGTGGT CCGGCCGGAC CGACTACGAC GGGGTCGGCG CCACGCACGT GCGCCGCAGC GCCCTCACCC TGCGCGGTCT GCTCCACGAG GAGAGCGGCG CCCTGATCGC CGCACCCACC ACGTCCCTGC CCGAGTGGCC GGGAGGCCCG CGCAACTGGG ACTACCGCTA CGTCTGGCAC CGCGACGCCG CGCTCGTCGT CCTGGCCTTC CTTCGGCTCG GGCACGCCGA GGAGGCGGGG CACTACCTGC GCTTCCTGCT GCGCATGTGC GGTCAGCCGA TCGACTGGGT CCCCCCGGTG CAGGCGGTCG ACGAACAGCC GCCGCCTGAG GAGGAGACCC TGGACCACCT CGCCGGACAC GCCGGGTCCA GACCGGTCCG CGTCGGCAAC GACGCCTACT CACAGCACCA GCTGGACGTG TACGGGCACG TGCTCGACGC CGCGCTGTCC TACGAGGAGG CCACCGGCGG GCTCGGGCGC GGGGACGTCG AACAGCTCTC CTCGATGGTC GACGCGGCGT GCCGGGTCTG GCGCGAGCCG GACGAGGGCA TGTGGGAGGT GCGGTCGCGG CCGCGGCACT GGACGAGTTC CAAGGTCTAC GCCTGGGTGT GCCTGGACCG CGGGATCCAG CTGGCCACCG AGTCCGGCAA GGCGGGCGGG GACGTCCCGC TGGACAAGTG GCGCAAGGAG CTGGACGCCG TGCGCCAGGA GGTCCTGGAC CGGGGCTACG ACGCGGAGGC CGGGACCTTC ACGCAGTCCT ACTGCTCGTC CCACGTGGAC GGGTCGCTGC TGAGGATCCC GCTCCTGGGC TTCCTGGAGG GGACCGACCC GCGCGTGCTC GCGACCCTGG AACGGGTGGA CGCGGAGCTG GGCGGGGAGG GCGGGCTCGT CCACAGGTAC GACCCCGGGA CGACCGACGA CGGACTGGGC ACCCCGGAGG GCGCCTTCCT CCTCTGCTCC TTCGACATGG TCTCCGCCCT GGTGCTCGCC GGGCGGACCG AGGAGGCGCG GCGGAGGTTC GAGGAACTGT GCGGGAGCTC GGGAGAGCTC GGCCTGCACG CGGAGGAGAT GGCCGCCGAC GGCACCATGC TGGGCAACTT CCCCCAGGCC TTCACCCACC TCGCGCTGAT CGAGGCGGCC GTCAACCTCG ACCAGGCGGG GGACGGGGAG GCGCTGCACT CGTGGGTGCG CGACAGGTCG AGCGGCGCGA CACGACGAAG GAGGACTGGA GCAGATGGCT GA
|
Protein sequence | MAQGTASIGD HGFLSDCHTA ALTTPDGTVD WLCVPRFDGP ALVSGILDPR GGGWTLEVEG ASPAGRAYVD DTLVLETLWR GTDTEVAVRD LLAVRRPEEG GAGLYRQGLL LRVVECRSGS TSVRSRLDAR PDFARAEPVW ERVDGGLREA SGPMLSGSPA PALARDGVPE YRVELAEGDT AVFALDYLEG GRRVGLGEGR ALLRETLDAW REWSGRTDYD GVGATHVRRS ALTLRGLLHE ESGALIAAPT TSLPEWPGGP RNWDYRYVWH RDAALVVLAF LRLGHAEEAG HYLRFLLRMC GQPIDWVPPV QAVDEQPPPE EETLDHLAGH AGSRPVRVGN DAYSQHQLDV YGHVLDAALS YEEATGGLGR GDVEQLSSMV DAACRVWREP DEGMWEVRSR PRHWTSSKVY AWVCLDRGIQ LATESGKAGG DVPLDKWRKE LDAVRQEVLD RGYDAEAGTF TQSYCSSHVD GSLLRIPLLG FLEGTDPRVL ATLERVDAEL GGEGGLVHRY DPGTTDDGLG TPEGAFLLCS FDMVSALVLA GRTEEARRRF EELCGSSGEL GLHAEEMAAD GTMLGNFPQA FTHLALIEAA VNLDQAGDGE ALHSWVRDRS SGATRRRRTG ADG
|
| |