Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1932 |
Symbol | |
ID | 9245782 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2354482 |
End bp | 2356050 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | polysaccharide deacetylase |
Protein accession | YP_003679865 |
Protein GI | 297560891 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.461546 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.430114 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCCCCC TCGTGGCTGT CACCTCACTA CGACGCAAAC TCTCCGTCAG CGCCGCCGTG GGTCTGCTCC TGGTCCCCGG CTGCCGTGCC GTGGAACAGG AGCTTCCCTA CGCCGAACCG GTCCGCTCCT CGCCCGACGA CCTGGCGGGG ATGGAGGCGA GCACCGTGTC GGTGACCACC GACGACAGCG TGGTGAGCTA CCGCTACCCG CGGTTGGGCG ACAACCACCC GCTGACGGTG GAGGCGCGCA CCGCCATGGC CGCGCGCCAG ACCGCCTTCC TGGAGGACCT GCCGGAGGGG GGGTCCCCCG AGCTGCACCA GGACACGGTC ATCCTCGCCG CCTCCCCTGA GGTGGTGGGC GCGCGGCTGA CCGCGACCGC CTCCGCGGGC GGGGACGAGA CCTTCGAGGC CGACACCCTG TGGTACGACG CGGGCAGCGA GGAGGTGCTG CCCTGGACCT CGCTGTTCCG GGACGAGGAG GCGATCGAGC AGGCCCACCT GGCCCTGGCC GACGTCCTGG AGGAGGGGTA CGACCTGCCC GAGCACCAGC TGCCCGGCCT GGTCGGCGAG GTCGCCTCGG GCGAGCGGAC CGCGCCGGAG GGCGGCGCGT CCGCTGACGG CCGGGACGGG ACGGCCGACG GTGGGGCGGA CGGGGGTTCC GAGCCGCTCG ACCTGTCCGA GCCCGGCCAG GCCCGCAAGG CCGCCGAGCG CTGGGCGGGC TCGCCGCTGG GCGACCTGGC ATTCAGCACC GCGGGCGGCC TGGCCGTGCG CATGGACCCC GACGCGGTGC CCGGCGCGGG CCGGGTCGGC GAGGTGCTGG TTCCGGTGGA GGCCGCCGAG GTCGAGGGGC TGCTGTCCGA GCTGGGCTAC CTGGCCCGGG AGGCGGCCCT GAGCGGGGAC GGCCTGGGCG ACGACCTCTC GCAGGACGGG GGCGGCCTGA GCGCCCAGGG GCACACCCTG GACTGCGAGC GCCTCAAGTG CGTGGCGCTG ACCTTCGACG ACGGGCCGGG GGAGCACACC GACCGGTTGC TGGACAGCCT GGCCGAGTAC GACGCCCACG CCACGTTCTA CGTGCTGGGT TCGCTCGTGG ACGACTTCCC CGCACCGGTG GAGCGCATGG CCGAGGAGGG CCACGAGCTG GGCAACCACA CCTGGAAGCA CGACGACCTG GCGAAGATGT CGGCCGACGG GATCCGCAAG GACATCGAAC GCACCAACGC GGCCGTGCGC GAGGTGACCG GGGTGGAACC GCCGACCATC CGGCCGCCCT ACGGGTCGCT GAACGGGACG GTGCGCAAGA CGGTGGAACA GCCCCTGGTC CTGTGGGACG TGGACACGCT GGACTGGAGG AGCCGCGACA CCGAGGACGT CAGTGAGGCG GCTCTGGACA ACACCGTGCC GGGGAGCGTG GTGCTCTTCC ACGACATCCA CGAGACCTCC GTGAAGGCGA TCCCGGACGT GCTGGCGGGG CTGCACCGGC AGGGCTACCA CTTCGTGACG GTCACCGACA TCTTCGGTTA CCAGGGCATG GAGTCGGGCG ACGTGTACAC CGACGCGCGG CTCTCGTAG
|
Protein sequence | MVPLVAVTSL RRKLSVSAAV GLLLVPGCRA VEQELPYAEP VRSSPDDLAG MEASTVSVTT DDSVVSYRYP RLGDNHPLTV EARTAMAARQ TAFLEDLPEG GSPELHQDTV ILAASPEVVG ARLTATASAG GDETFEADTL WYDAGSEEVL PWTSLFRDEE AIEQAHLALA DVLEEGYDLP EHQLPGLVGE VASGERTAPE GGASADGRDG TADGGADGGS EPLDLSEPGQ ARKAAERWAG SPLGDLAFST AGGLAVRMDP DAVPGAGRVG EVLVPVEAAE VEGLLSELGY LAREAALSGD GLGDDLSQDG GGLSAQGHTL DCERLKCVAL TFDDGPGEHT DRLLDSLAEY DAHATFYVLG SLVDDFPAPV ERMAEEGHEL GNHTWKHDDL AKMSADGIRK DIERTNAAVR EVTGVEPPTI RPPYGSLNGT VRKTVEQPLV LWDVDTLDWR SRDTEDVSEA ALDNTVPGSV VLFHDIHETS VKAIPDVLAG LHRQGYHFVT VTDIFGYQGM ESGDVYTDAR LS
|
| |