Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5018 |
Symbol | |
ID | 9248907 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | - |
Start bp | 160147 |
End bp | 161415 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003682905 |
Protein GI | 297563932 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGGAGCG TCGGCGGGCT GGTCGCCGCC CAGGCCCTGT GCTACACCGC CACGCGGCTG GCGATGATCG CCATCCCCTG GTTCGTCCTG GAGGCCACCG GCAGCCCGGC GTCCATGGGG GCGGTGGCCT TCTTCGAGAT CGGCTCCTAC ACCCTGGCCC GACTGCTGGG CGGCCCGCTG CTGGACCGGA TGGGTCAGCG TGCCGTGAGC GTGCGCGCCG ACCTCGTCGC CGCGGCGGCG GTGGCCTGCG TCCCGCTGCT GCACACGGCC GGGCTGCTGT CCTTCCCCGT GCTCCTGGCG CTGGTGACCG TGATCGGCCT GGCCACCGGC CCCGCCGAGG CCGCCAAGGT CTCCATGGCC CCGGCCGTCG CCGAACGCAC GGGGACGCGC CTCGAACGGG TCACCGGACT CACCGGAACC GTGGACCGGC TGTCCACGAC CGTCGGACCG GTGGCCGCGG GCGGACTGGT GTCCCTGCTC GGCGCGCTCC CCGCCCTGTA CACGAACGCC GCGCTGCTCG CCGCGGCGGC GGTGGTACTG GCCGCCACCC AGCCCGGGGA GCGCCCCCGC CCGGGCGGGG ACCCCGAGGC GGACGCGGGC TACCCGACCA GGCTGCGCAC CGGCTGGCGG GCGGTCTGGG GCGACGCCAC CCTGCGCGTT TTGGTGGTCG TGCTCGCGGT CACCAACATG ATCGACATGT CCGTGGCCTC GGTACTGCTG CCGGTGTGGG TGGACGACAA CGGCATGGGT CCCGCGGTGG TCGGCCTCCT GGGCGGCGTG ATGGGCGCGG CCTCGGTGGT GGGCTCGCTC GCGGCCACGG CGGTCGGGCA CCTGCTGCCC CGGAGGGCGG TGTTCTTCGC GGGGCTGGTG CTGGCCGGGC CGCCCCGGCT GGTCGTGCTG GCGCTGGACG TGCCGCTGTG GGCGGTCCTG GCGGTGTGGG GCCTGTGCGG GCTCGCGGGC GGGGTACTCA ACCCGATCCT GGGCGCGGTG CTGTTCGAGC GCCTGCCCCG CCGAGCCGTG GGGCGGGGCA CGGCGACGAT CGGCGCGCTG ACCCGGATGG CGGCGCCGTT GGGCGCGCCC GTCGCGGGCG CGGCGGCCGG ACTGCTCGGA GCGGCGCCGG TGCTACTGGC GTGCGCGGCC CTCTACCTGG CCGCGGTCCT GCCGCCGCTC GTGGGCCGCG CGGCCGAGGG CATCGACAGG CACGGGACGG AGGCGCGCCG GTCAGCGGGC GGGGCGGGCG GGGCGGACGC GGGCACGGGC ACAGGGTAG
|
Protein sequence | MRSVGGLVAA QALCYTATRL AMIAIPWFVL EATGSPASMG AVAFFEIGSY TLARLLGGPL LDRMGQRAVS VRADLVAAAA VACVPLLHTA GLLSFPVLLA LVTVIGLATG PAEAAKVSMA PAVAERTGTR LERVTGLTGT VDRLSTTVGP VAAGGLVSLL GALPALYTNA ALLAAAAVVL AATQPGERPR PGGDPEADAG YPTRLRTGWR AVWGDATLRV LVVVLAVTNM IDMSVASVLL PVWVDDNGMG PAVVGLLGGV MGAASVVGSL AATAVGHLLP RRAVFFAGLV LAGPPRLVVL ALDVPLWAVL AVWGLCGLAG GVLNPILGAV LFERLPRRAV GRGTATIGAL TRMAAPLGAP VAGAAAGLLG AAPVLLACAA LYLAAVLPPL VGRAAEGIDR HGTEARRSAG GAGGADAGTG TG
|
| |