Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3638 |
Symbol | |
ID | 9247507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4362559 |
End bp | 4364226 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003681543 |
Protein GI | 297562569 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.339986 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.415005 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCAGGA AGCCTTTCCG AAAGCCGCTG GCCCTCCTGG CCTCCGCGGC CGCGCTCACG CTGGTGGCCA CGGCGTGCGC CGAGAGTAAC CGCGAGGGCG GGGGGACCGA CGCGTCGGAA CCCTTCGTCT TCGCCTCCGC GGGCGACATC AAAACCCTCG ACCCCTTCCT CACCAGTGAC GGTGAGACCT TCCGTTACAG CAGGCAGGTA TTCGAAACCC TTCTCGAACA CGAATCGGGT GGGACCGAAA TCGTCGGCGG ACTCGCCGAG GACTGGGAGC AGTCCGAGGA CGGCACCGTC TGGACCTTCC ACCTGCGCGA CGGCGTCCTG TTCCACGACG GCGACGAGTT CAACGCCGAG GCGGTCTGCG CCAACTTCGA CCGCTGGTAC AACCTCACCG GCGGTTTCCA GAGCTCGAAC AACTCCTATT ACTGGCAGTC GATCTTCGGC GGCTTCGCGG AGAACGAGAG CGAGGACCTC GCCGAGTCCC GGTACGTCTC CTGCGAGGCC ACCGACGAGC TGACCGCGGT CATCACGATC GACGAGTACT CCTCGATCTT CCCCGGCGGC TTCAGCCTCG CCTCGTTCGG CATCATGAGC CCCAGCACGC TGGAGGCCAT CGCCGACGCC GAGATCACCG GCGAGGAGGG CAACTTCACC CTCCCCGAGT ACACCCAGAC GGCCGGAACC GTCGCGGGCA CCGGGCCCTT CACCGTCCAG GAGTGGGACC ACGACCAGGC CGAGGTGACC CTCCAGCGCT TCGACGACTA CTGGGGCGAG GCCGCGGGCT TCGAGACGAT GATCCTGCGC GCGATCCCCG ACGAGACCGC CCGCCGCCAG GCCCTGGAGG CGGGTGACAT CCACGGCTAC GACCTGGTCG CCCCCGCCGA CGTCGCCCCC CTGTCCGAGG CCGGGTTCCA GGTGCCCACC CGCGGCGTGT TCAACGTCCT GTACATGGCC TACCAGCAGG AGGCCAGCGA GGCGCTCGCC GACCTTGAGG TGCGCCAGGC CCTCGCCCAC GCCGTGGACC GCCAGCGCAT CGTCGACACG ATCCTGCCCG AGGGCGGCGA GGTCGCGAGC CAGTTCCACC CCGACACCCT CGACGGCTGG TCCCCGGACG TGCAGACCTA CGAGTACGAC CCCGAACTGG CCAGGGAGAT GCTGGCGGAC GCCGGGCAGG AGGACCTGAC CCTGGAGTTC TGCTACCCGA CCGACGTCAC CCGCCCCTAC ATGCCCGCGC CGCGCGACAT CTTCGACGTC ATCGCCGCGG ACCTGGAGGC GGTCGGCGTC ACCGTGGAGC CGGTCACCTA CGAGTGGACC GAGTACGTGC CGCGCACCAA CTCGGGTGAG TGCCCGCTGT ACCTGCTCGG CTGGACCGGC GACTACAACG ACGCCTACAA CTTCATCGGC ACCTGGTTCT CCCAGTACAA CAGCGAGTTC GGCTTCCGTG ACGAGGACCT GTTCGAGGCC ATGGAGGAGG CGAGCACCAA CCCGAACCAG GAGGAGCGCG TCGCCGCCTA CCAGGACCTG AACAACCAGA TCATGGACAT CCTGCCGGGG CTGCCCATCT CCAGCTCCCC GCCGTCCATC GCCTTCTCCG CGAACGTCAA CCCGCCCAAC GTCAGCCCGC TGACCCAGGA GCAGTTCGCC GAGGCCTCCT GGAAGTAG
|
Protein sequence | MFRKPFRKPL ALLASAAALT LVATACAESN REGGGTDASE PFVFASAGDI KTLDPFLTSD GETFRYSRQV FETLLEHESG GTEIVGGLAE DWEQSEDGTV WTFHLRDGVL FHDGDEFNAE AVCANFDRWY NLTGGFQSSN NSYYWQSIFG GFAENESEDL AESRYVSCEA TDELTAVITI DEYSSIFPGG FSLASFGIMS PSTLEAIADA EITGEEGNFT LPEYTQTAGT VAGTGPFTVQ EWDHDQAEVT LQRFDDYWGE AAGFETMILR AIPDETARRQ ALEAGDIHGY DLVAPADVAP LSEAGFQVPT RGVFNVLYMA YQQEASEALA DLEVRQALAH AVDRQRIVDT ILPEGGEVAS QFHPDTLDGW SPDVQTYEYD PELAREMLAD AGQEDLTLEF CYPTDVTRPY MPAPRDIFDV IAADLEAVGV TVEPVTYEWT EYVPRTNSGE CPLYLLGWTG DYNDAYNFIG TWFSQYNSEF GFRDEDLFEA MEEASTNPNQ EERVAAYQDL NNQIMDILPG LPISSSPPSI AFSANVNPPN VSPLTQEQFA EASWK
|
| |