Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1538 |
Symbol | |
ID | 9245388 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1886825 |
End bp | 1888096 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | glycoside hydrolase family 16 |
Protein accession | YP_003679473 |
Protein GI | 297560499 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCATCG CGCACCGGCC CCGCAGGTCC CTCGCCTCCG CGGCCCTGTC CTTCCTCACC GCAGCCGCGC TCGTGATCCC GGTCGGGGCC GCCTCGGCCG CCCCCGTCCC CGAGGACACG GCGGCCCCGG CGTCCGCCGA AGCCGCCCAG GCGGCGGCCC TGGTGTGGTC CGACGAGTTC AACGGCGCCG CCGGAAGCGC CCCCAACCCC GCCAACTGGA ACCACGAGAC CGGCGCCCAC GGCTGGGGCA ACAACGAACT CCAGAACTAC ACCAGCAGCC GCGCCAACTC CGCCCTCGAC GGCAACGGCA ACCTCGTCAT CACCGCACGC AGGGGAGCCA ACGGCGGCTA CACCTCCGCC CGCATGACCA CCCAGAACAA GGTCGAACAC GCCTACGGCC GCATCGAGGC CCGCATCAAG ATCCCCCGCG GCCAGGGCAT CTGGCCCGCC TTCTGGATGC TCGGCGCCGA CTTCCCCGAC ACCCCCTGGC CCGACTCCGG CGAGATCGAC ATCATGGAGA ACATCGGCCG CGAACCCCAC CTGGTCCACG GCACCCTCCA CGGCCCCGGC TACTCCGGGG GCAACCCCCT GACCGGCTCC TACATGCACC CGCAGGGCTG GTCCTTCGCC GACGACTTCC ACACCTTCGC CGTCGACTGG AGCCCCGGCT CCATCACCTG GTCCGTGGAC GGCAACGCCT ACCAGACCTA CACCCCGGCC GACACGCGCG GAAACCCCTG GGTCTACGAC CAGCCCTTCT TCATGATCCT CAACATCGCC GTGGGCGGTA ACTGGCCCGG CTACCCCGAC GGCACCACCC AGTTCCCCCA GCAGATGCTC GTGGACTACG TCCGGATCTA CTCGGACGGC GGCGGCCCCG GCGGCGGCAC CGGGACGATC ACCGCCTCCA ACGGCATCTG CCTCGACGTC GCCGGGGCCC AGACGGGCGA CGGCACCCCG ATCCAGCTGG CGCACTGCAA CGGCAACCAG GCCCAGCAGT GGACCGAGGG CTCCGACGGC ACGTTCCGGG CGTTCAACAA GTGCCTGGAC GTGGCGGGCG GCGCCACCGC CGCGGGCACC CCCGTACAGC TGTGGACCTG CAACGGGACC GGCGCGCAGC GGTGGACCCA CGACAGCGGG ACGCAGGCCC TGCGCAACCC GCAGTCGGGC CGCTGCCTGC AACCCCAGGG CCGGGCGCAG AGCGACGGCA CCCGGATGGT GATCGCCGAC TGCGACGGCA GCGCCGTCCA GCGCTGGAGC CTGAACGGCT GA
|
Protein sequence | MRIAHRPRRS LASAALSFLT AAALVIPVGA ASAAPVPEDT AAPASAEAAQ AAALVWSDEF NGAAGSAPNP ANWNHETGAH GWGNNELQNY TSSRANSALD GNGNLVITAR RGANGGYTSA RMTTQNKVEH AYGRIEARIK IPRGQGIWPA FWMLGADFPD TPWPDSGEID IMENIGREPH LVHGTLHGPG YSGGNPLTGS YMHPQGWSFA DDFHTFAVDW SPGSITWSVD GNAYQTYTPA DTRGNPWVYD QPFFMILNIA VGGNWPGYPD GTTQFPQQML VDYVRIYSDG GGPGGGTGTI TASNGICLDV AGAQTGDGTP IQLAHCNGNQ AQQWTEGSDG TFRAFNKCLD VAGGATAAGT PVQLWTCNGT GAQRWTHDSG TQALRNPQSG RCLQPQGRAQ SDGTRMVIAD CDGSAVQRWS LNG
|
| |