Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1789 |
Symbol | |
ID | 9245639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2186931 |
End bp | 2188469 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003679723 |
Protein GI | 297560749 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.407024 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCATG CCCCTGGCGG AGCGACAGCG CCCGGTGACG CCGCTCCTCC CCTACGGCCG AACGTGATCG TCGCCGTCCT GGCCTTCGGC GGGATCGTCG TCTCGCTCAT GCAGACCCTG GTCATCCCGC TCGTGCCCGT CCTGCCCGAC CTGCTGGGCG CCACGCCCGG GGACACCGCG TGGGCGATCA CCGCCACGCT GCTCGCGGCC GCGGTCGCCA CCCCGACGGT CGGCCGCCTG GGCGACATGT ACGGCAAGCG CCGCATGCTG CTGTTCAGCC TCGCCGTCCT CGTGGCCGGC TCGGTGCTGT GCGCCCTGGC CCACAGCCTG GTGCCGATGG TCGTCGGCCG CGCCCTCCAG GGCCTGGCGG CCGGGGTCAT CCCCCTGGGC ATCAGCATCA TGCGCGACGT GCTGCCCCCC GAACGGCTCG GCGGGGCGAC CGCGCTGATG AGCGCCTCGC TCGGGGTGGG CGGCGCGCTC GGGCTGCCCG CCGCCGCGCT CGTGGTGCAG GAGGCGGACT GGCACGTGCT GTTCTGGGTC GCCGCCGGGC TCGCGACCGG GGCCGCCGTG CTGGTGCGCG CCCTGGTCCC CGCGTCCGGG GTGCGCGCGG GCGGCAGGTT CGACCTGCCC GGGTCGGCGG GCCTGTCCGT GGCGCTGCTC CTGCTGCTGC TCGCGGTCTC CAAGGGCTCG GACTGGGGCT GGGGCAGCGG TGTCCCCGCC GCCATGCTCG CGGTCGCGGT CGCGGTGCTC CTGGTGTGGG GCTGGTGGGA GCTGCGTACG CCGCACCCGC TGGTCGATCT GCGCGTCAGC GCGCGGCGCC AGGTCCTGCT CACCAACACC GCGTCCCTGG TGTTCGGCTT CTCCATGTTC GCCATGTCCC TGGTCGTCCC GCAGCTGCTC CAGATGCCCG AGGTCACCGG CTACGGCTTC GGACAGACGA TCCTGGTCGC GGGTCTGGTG ATGGCGCCCA ACGGGCTGGT GATGATGGCG ATGTCCCCGG TCTCGGCGCG TATCTCCCGG GCGAGGGGTC CCAAGACGAC CCTGATGGTC GGCGCACTGC TGGTCGCGCT CGGCTACGGC CTGAGCCTGG TGTCCATGTC CGCCCTCTGG CAGCTCGTGA TCGCCTCCAC CGTCATCGGC GCGGGCGTGG GCCTGGCCTA CGGGGCCATG CCCGCCCTGG TCATGGCGGC CGTGCCCCGG ACCGAGACGG CGGCGGCCAA CAGCCTCAAC ACCCTGATGC GCTCCATCGG CACCTCCGTG GCCAGCGCCG TCGCGGGCGT GGTCATCGCC AACGTGACCA TGACCGCGGG CGCGGAGGTC CTCCCCGCCC GGGAGGCCTT CACCCTGCTC CTGGGCCTGG GCGCCCTCGC CGCGCTCGCC GCCTTCGCCG TCGCCGCCTT CCTGCCCGGG CGGGGCAGGC CGGCCGACCT CGACCGGCCC GAACCCCCGC CCGCGGCCCC CGACCAGCGC GCGGACCCCT CGGAGGACGA AGGGACCGGA CGCGACCAGG CGCGGGAGCA CCCGAGCGGG CTCCGCTGA
|
Protein sequence | MTHAPGGATA PGDAAPPLRP NVIVAVLAFG GIVVSLMQTL VIPLVPVLPD LLGATPGDTA WAITATLLAA AVATPTVGRL GDMYGKRRML LFSLAVLVAG SVLCALAHSL VPMVVGRALQ GLAAGVIPLG ISIMRDVLPP ERLGGATALM SASLGVGGAL GLPAAALVVQ EADWHVLFWV AAGLATGAAV LVRALVPASG VRAGGRFDLP GSAGLSVALL LLLLAVSKGS DWGWGSGVPA AMLAVAVAVL LVWGWWELRT PHPLVDLRVS ARRQVLLTNT ASLVFGFSMF AMSLVVPQLL QMPEVTGYGF GQTILVAGLV MAPNGLVMMA MSPVSARISR ARGPKTTLMV GALLVALGYG LSLVSMSALW QLVIASTVIG AGVGLAYGAM PALVMAAVPR TETAAANSLN TLMRSIGTSV ASAVAGVVIA NVTMTAGAEV LPAREAFTLL LGLGALAALA AFAVAAFLPG RGRPADLDRP EPPPAAPDQR ADPSEDEGTG RDQAREHPSG LR
|
| |