Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4906 |
Symbol | |
ID | 9248793 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | + |
Start bp | 35970 |
End bp | 37286 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003682795 |
Protein GI | 297563822 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.507884 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACCACG GACCACCGGG ATCGTCGGGA CCGCCCGGCC CGCCGGGCCT GAGGCGCCGC ACCCTGCTGC GCGGTCTGGC CGGCGCCGGA ACGCTGCTGG CCCTGCCCTC CCTGGCGGGC TGCGGAGCGG GCCGGCGCGG GGACCCGAAC CTGGTCACCC TCGCCTCCAA CCGGGCCAAC CCCGCCCAGC GCGAGGCCGT CGCCGAGAGC GTCGGGCTCT TCGAGGCGGA CTCCGGGCTG ACCGTGGAGG TCAACACCTT CCAGTCCACC GCCTTCCAGG AGAGCGTCAA CAACTACCTC CAGGGCACCC CGGACGACGT CATCGGCTGG TTCGCTGGCT ACCGGACGCG CTTCTTCGCC GAACGCGGCC TCATCAGCGA CGTCTCCGAG GTCTGGGACC GCCACTTCGG GGACGTCTTC ACCGACCAGG TGCGCGGGCT GTGCACGGCC GACGACGGCA AGCAGTACAT CGTCCCCGAC TCCACCGCGC CCTGGGCGGT CTTCCACCGC AGGAGCGTGT TCGAGGAGCA CGGCTACGAG GTCCCCGCCA CCCGCGCGGA GTTCGAGGAG CTGTGCGTGC GCATGCGCGC GGACGGGCTC GAACCCCTCG CCTCGGGCAT CCGCGAGGGC TGGCCCGCGA TGGGCATGTT CGACCACCTC AACCTGCGGC TGAACGGCCC CGAGTTCCAC CTGGAGCTGC TCAACGGCGA CCACTCCTGG GACTCCGCCG AGGTCAGGAG CGTCTTCGGC ACCTGGGCCG AGCTGCTGCC CCACCACCAG CCCGACCCCC TGGGCCGGGG GATCAACGAG GCCCAGACCG CCCTGGTCCG GCGCGAGGCC GGGATGATGC TCTGCGGCAT GTTCATCACC CACGTCTTCC CCGAGGGAGA GGACCTGGAC GACCTGGACT GCTTCGCCTT CCCCGAGTTC GACCCCGCGA TCGGCGCCGA CGCCGTCGAG GCGCCCATCG ACGGGTTCAT GCTCTCCGGG GACCCGCGCA ACCCCGACGG GGCCCGCGAA CTCGTGGGGC ACCTGGGCAC CCTCCGGGCC CAGGAGATCT ACACCGCCAT CGACCCGCAG GCGCTCCCCA CCCACCTGGA CGCCGACACC AGCGGGTTCA GCGCCCTCGA CGCCAAGGTC AACGACATGG TCGCCAACGC GGGAGCGCTC ACCCAGTACA TGGACCGGGA CACCCGTCCC GACTTCGCGT CCGTCGTCAT GATCCCCGCG CTCCAGCGCT TCCTCGAACA GCCGGACGAC ATCGCCGACC TGACCGCCAG CATCCAGCGC CAGAAGGTCT CCATCTTCGG CGGCTGA
|
Protein sequence | MHHGPPGSSG PPGPPGLRRR TLLRGLAGAG TLLALPSLAG CGAGRRGDPN LVTLASNRAN PAQREAVAES VGLFEADSGL TVEVNTFQST AFQESVNNYL QGTPDDVIGW FAGYRTRFFA ERGLISDVSE VWDRHFGDVF TDQVRGLCTA DDGKQYIVPD STAPWAVFHR RSVFEEHGYE VPATRAEFEE LCVRMRADGL EPLASGIREG WPAMGMFDHL NLRLNGPEFH LELLNGDHSW DSAEVRSVFG TWAELLPHHQ PDPLGRGINE AQTALVRREA GMMLCGMFIT HVFPEGEDLD DLDCFAFPEF DPAIGADAVE APIDGFMLSG DPRNPDGARE LVGHLGTLRA QEIYTAIDPQ ALPTHLDADT SGFSALDAKV NDMVANAGAL TQYMDRDTRP DFASVVMIPA LQRFLEQPDD IADLTASIQR QKVSIFGG
|
| |