Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0355 |
Symbol | |
ID | 9244190 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 429931 |
End bp | 431226 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | putative cellulose-binding protein |
Protein accession | YP_003678309 |
Protein GI | 297559335 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.647615 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGTCAA ACAACATCGA CACCCAGCTC AACAACATCT TCGACGAGGA CAACACCCCG CACGAATTCG ACGTGGTGCT GCGTGGGTAT GACCGCACCC AGGTGGATGA CTACGCGGCC AGCCTGCGCA ACGAGCTCAA GCAGTACTCC AAGCAGGTCG AGAAGTTCAA GGCCGAGCTC AACGCCAAGA ACCGCCAGCT CCAGGAGCGC GAGCGCCCCT CCTACTCCGG TCTGGGCTCG CGTATCGAGG AGCTGCTGCG CCTGGCCGAG GAGCAGGCCA ACGAGCTGGT CCAGAGCGCG CAGATCGACG CCAACGACAT CCGCTCGGCC GCCAAGATCG AGGCCGCCGA CATGCGCGCG GCCGCCGAGT CCGAGGCCAC CGAGGTGCGC GCCCTCGCCC AGCGCGAGGC CGACGAGACC CGTCAGACCG CCGAGTCCGA GGCGGAGGAG ATCTCCACCA CCGCCCGCCG CGAGGCCGAC GAGCTCACCT CCACCACCGA GCGCGAGGTG CAGAAGAAGC GCTCCGCGGT CGACCACGAG ATCGCCGAGA AGCGGGCGAC CTTCGAGGGC GAGATCGCCA AGCTGCGCAC CACCACCGAG CGCGAGTGCG CCCAGGCCCG CGCCGCGGCC AAGCGCGAGC GCGACGAGAC CATCCAGTCG GCCAAGAGCC AGGCCGAGGA GCTGCGCAAG AACGCCGAGC GCGCCTACGC CGAGTCCGAG GCCCGGCGCA CCGAGGCCGA GGACCAGTTC GAGCTCCAGC TGGCCGACCG CCGCGCCGAG GCCGAGCGCC AGGACGCCGA GCGCCTCGCC GCCGCCCAGG CCGCCACGCA GAAGATGGTC AACGAGGCCG AGGAGCGGGC CGCCAGCGCC GAGCAGCGCG CCACCAAGGC GAGCCAGCAG GCCGAGCAGA CCCGCCGCGA CGCCGAGAAC CACGCCAAGC AGCTGGTCGG CAACGCCAAG AAGAACGCCG CCCAGATCGA GGCCGAGGCC AAGTCCAAGG CCGAGCACCA GCTCGGGGAC GCCAAGTCCG AGGCCAACCG GATCATGACG GCCGCCAAGA AGGAGGTCGA CGAGCTCAAC CGCCAGCGCG ACAGCATCCA GTCGCACCTC CAGCAGCTGC GCCAGCTGCT CGGCGGTGGC GGCCCGGCCG CCCCGGCCCC GGTTCCCGCC GCGCCGGTCG CCCCCGCTCC GGCCCCGGCC GCGATCCCGC AGGAGCCCGC TCCGGCCGCC GAGGAGACCC GGCAGCCGGT GCACTCCGGC AAGGGCGCCG ACGACGAGGA CTGGTGGCAG GAGTAG
|
Protein sequence | MPSNNIDTQL NNIFDEDNTP HEFDVVLRGY DRTQVDDYAA SLRNELKQYS KQVEKFKAEL NAKNRQLQER ERPSYSGLGS RIEELLRLAE EQANELVQSA QIDANDIRSA AKIEAADMRA AAESEATEVR ALAQREADET RQTAESEAEE ISTTARREAD ELTSTTEREV QKKRSAVDHE IAEKRATFEG EIAKLRTTTE RECAQARAAA KRERDETIQS AKSQAEELRK NAERAYAESE ARRTEAEDQF ELQLADRRAE AERQDAERLA AAQAATQKMV NEAEERAASA EQRATKASQQ AEQTRRDAEN HAKQLVGNAK KNAAQIEAEA KSKAEHQLGD AKSEANRIMT AAKKEVDELN RQRDSIQSHL QQLRQLLGGG GPAAPAPVPA APVAPAPAPA AIPQEPAPAA EETRQPVHSG KGADDEDWWQ E
|
| |