Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1022 |
Symbol | |
ID | 9244868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1248674 |
End bp | 1250950 |
Gene Length | 2277 bp |
Protein Length | 758 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003678971 |
Protein GI | 297559997 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.534913 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0223661 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCACG ACCCAGCCGA CCTGAGCGGG CTCACACTGG CGGAGAAGGC CTCGCTGAGC AGCGGCGCCG ACTTCTGGAC GACCAAGGCC GTGGGGGACG TCCCCTCGCT CGTGCTCACC GACGGGCCGC ACGGCGTACG CCGACAGGCC GGTGCCACCG ACCACCTGGG CCTGGCGGAG AGCCTGCCCG CGACCTGCTT CCCGCCCGCC GTCGGCCTGT CCCAGAGCTG GGACCCCGAC CTGGTCGGGC GGGTGGGCGC GGCCCTGGGC GCCGAGGCCG GGGCGCTGGG CGTGGACGTG CTCCTGGGCC CGGGGATCAA CATCAAGCGC GACCCGCGCG GCGGGCGCAA CTTCGAGTAC TTCTCCGAGG ACCCCCTCCT CACCGGCGTC CTGGGCGGCG CCTGGGTGAG CGGCCTCCAG AGCACCGGTG TGGGCGCCTC GCTCAAGCAC TTCGCCGCCA ACAACGCCGA GCACGACCGG ATGCGCTCCA GCTCCAACGT CGACCCGCGC ACGCTGCGCG AGATCTACCT GCGCGCCTTC CAGAGGGTCG TGCGCACGTC CGCCCCGTGG ACCGTGATGT GCTCCTACAA CCGCATCAAC GGCGTCTACG CCTCGGAGGA CCACTGGCTG CTCACCACGG TGCTGCGCCA GGAGTGGGGC TTCGACGGCG TGGTCGTGAG CGACTGGGGC GCGGTGCAGG ACCGGCCCGC CGCGGTCGCC GCCGGGTTGG ACCTGGAGAT GCCCGGTGAC GGCGGCGCGA GCGACGCGCG GCTCCTCGCG GCCGTGGAGG CCGGGGCCTG CTCGACGGAG GACGTGGACA CGGCCGCCTC CCGCGTGGCC GCCCTGGCCG CCAGGGCGCG CGGCTCCCGC CGGGACCGCG CGGCGGATCC CGAACGCGCC CCGGAGGTCG GCGCGCGGGA GATCGAGGAC CACCACGCCC TCGCCCGCGA GGTGGCCGCG CGGTGCGTCG TCCTGCTCAA GAACGACGGC GCGCTCCTGC CGCTCGCGCC CGACGGGTCC GTCGCGGTGA TCGGCGGCTT CGCCCGGAAC CCGCGTTACC AAGGCGGCGG CAGCTCCCAC GTGAACCCCA CGCGCGTGGA CGTCCCGCTG GAGGAGATCC GCGCCCTGGC CGGTTCCGGC GCGGTCGCCT TCGCCCCCGG CTACGCCGAC CGGGCGCAGG ACGGTACGGA ACCGGGCGAC GCGGCGGCCC TGCGTGAGGA GGCGGTGCGC GCCGCGTCCG AGGCGGACAC CGCCGTGGTC TTCCTGGGCC TGGCCGAGCA CCAGGAGTCC GAGGGCTTCG ACCGGGAGCA CATCGAACTC CCCGCCGAAC AGCTCGACCT GCTCGCGGCC GTCGTGCGGG CGCAGCCCCG CACGGTCGTC GTGCTCTCCC ACGGCGGCGT GCTGCGGCTC GCCCCCGTCG CCGAGCTGGC CCCCGCGGTC CTGGACGGGG CCCTCCTGGG CCAGGCGGGC GGCGGCGCGA TCGCCGACGT GCTGTTCGGC CGGGTCAACC CGTCGGGGCG GCTCGGCGAG ACGGTGCCGG TCCGGTTGCA GGACACCCCC GCCTACCTCA ACTTCCCCGG GGAGAACTCC GCGGTGCTCT ACGGCGAGGG CCTGTACGTG GGCTACCGCT GGTACGACGC CCGCGACATG GAGGTGGCCT TCCCCTTCGG GCACGGCCTG TCGTACACGA GCTTCGCCTA CTCCGACCTC GAACTGGAGC AGGACGGCGC CGGGATCACG GCGCACCTGA CCGTGACCAA CACCGGGGAC CGGTCCGGGC GCGAGGTCGT GCAGTTCTAC GTCGCCAAGC CCGACTCGGC CGTGTCGCGG GCGCCGCGTG AGCTGAAGGG CTTCACGGGC GTGACCCTGG AACCCGGTCG CAGCGAGCGC GTCAGCGTCC CGCTGTCCCG GGAGGACCTG GCGTACTGGG AGGTGCGGGC CGACCGCTGG GTGGTCGAGG GCGGCGAGTA CGTGGTGTCG GCGGGCGCTT CGAGCCGCGA CCTGCGGGTC ACGGCGGCGA TCGCCGTCGA GGGCGACGCG CTGCGCCTGC CGGTGACGCT CGACTCCACG CTGGGCGAGG TCATGGCCGA CCCCGGGGCG GCGGCCCTCC TCGCCCAGGC CTTCACGCCT CCGGCGGCCG AGGGCTCCGA CAACGCCATG GGGATGGACA TGGCGCGGAT GATGGCCTCG ATCCCGATCC GCCGCCTGGC GAGCTTCGGC CCGACGGGAC TGGACGAGCT CGAAGCACTC GTCGCGAGGA TCAACGCCGA GGACTGA
|
Protein sequence | MTHDPADLSG LTLAEKASLS SGADFWTTKA VGDVPSLVLT DGPHGVRRQA GATDHLGLAE SLPATCFPPA VGLSQSWDPD LVGRVGAALG AEAGALGVDV LLGPGINIKR DPRGGRNFEY FSEDPLLTGV LGGAWVSGLQ STGVGASLKH FAANNAEHDR MRSSSNVDPR TLREIYLRAF QRVVRTSAPW TVMCSYNRIN GVYASEDHWL LTTVLRQEWG FDGVVVSDWG AVQDRPAAVA AGLDLEMPGD GGASDARLLA AVEAGACSTE DVDTAASRVA ALAARARGSR RDRAADPERA PEVGAREIED HHALAREVAA RCVVLLKNDG ALLPLAPDGS VAVIGGFARN PRYQGGGSSH VNPTRVDVPL EEIRALAGSG AVAFAPGYAD RAQDGTEPGD AAALREEAVR AASEADTAVV FLGLAEHQES EGFDREHIEL PAEQLDLLAA VVRAQPRTVV VLSHGGVLRL APVAELAPAV LDGALLGQAG GGAIADVLFG RVNPSGRLGE TVPVRLQDTP AYLNFPGENS AVLYGEGLYV GYRWYDARDM EVAFPFGHGL SYTSFAYSDL ELEQDGAGIT AHLTVTNTGD RSGREVVQFY VAKPDSAVSR APRELKGFTG VTLEPGRSER VSVPLSREDL AYWEVRADRW VVEGGEYVVS AGASSRDLRV TAAIAVEGDA LRLPVTLDST LGEVMADPGA AALLAQAFTP PAAEGSDNAM GMDMARMMAS IPIRRLASFG PTGLDELEAL VARINAED
|
| |