Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3224 |
Symbol | |
ID | 9247081 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 3853996 |
End bp | 3855297 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003681136 |
Protein GI | 297562162 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAGGTA GGTCACACGT GCGCCCACCC CCCGCGCAGC CCTCCCCCTC CCCGCGAACG CCCGCCGCTC TCGCCGAACC CACCAGACCC GTGCGCCGCC TGTGGGTGGC CGGGATCAGC CTGGCCAACC TCGGCATGTG GATGGCCTTC TTCGGCCCGC TCCAGGTCCT GCTGCCCGAA CAGGTCGGCC TGCTCGCCCC CGACGCCAAG GAGACCGCCC TGGCCTGGGT GACCGGGGCG GGCGCGGCCT GCTCCACCCT GGGCACCCCG CTGGCCGGGG CCCTGTCCGA CCGGACCACC GGCCGGTTCG GGCGCCGACG CCCCTGGATC CTGGCCGGTG CGCTCCTGGG CGGTCTGGGA CTGGTGGTGC TCGGCCGACA GGACGGCGTC CTCGGCGTCC TGGTGGGGTG GTCCTTCGTC CAGGCCGCGC TGTCCTGCCT GAACGCGACC CTGCTCGCGG CCGTGCCCGA CCACGTGCCG GTCCGCCAGC GCGGCGTGGT CTCGGGGTGG ATCGGCGTAC CGCAGTCGGC GGGCGTGGTG GTGGCGGTGC TCCTGGTGAC CGTGGTGGTC ACGGGGATCG CCCCCGGCTA CGCCCTCCTC GGCGCGCTGA CCGTCCTGTG CGTCCTGCCG TTCGCGCTGC TGGCCCCCGA CCCGCCGCTG CCCCGCGAGG CGCGCCCCTC CTGGAGCGGG TTCGCGCGCG GGCTGTGGGT GTCCCCGCGC CGCCACCCCG ACTTCGGCTG GGCGTGGCTG ACCCGTTTCC TCATGCAGAC GGGCAACGCC ATGTTCACCC TCTACCTGCT CTACTTCCTC ACCGACGCGG TGGGCTACGA GGAGCTGTTC CCCGGTTCCT CGGCCGCGGA CGGGCTGCTG GTGCTGATCG CCGTGTACAC GGCGGCGGTC GTGGCCACGA CCGTGCTGGC CGGTCTGGTC TCGGACCGCA TCGGCCGCCG CAGGGGGATG GTGTGCCTGT CCGGGGTGGT CTCGGCGGTG CCCGCCTTCC TGATCGCCGC GTTTCCGACG TGGCCGATGA GTCTGGTGTG CGCGGTGGTG CTGGGGGTCG GCTTCGGCGT CTACCTGTCC GTGGACAACG CCCTGGTCAC CGAGGTCCTG CCGAGCGCCG GCGGGCGGGC CAAGGACCTG GGGATCGTCA ACATCGCCAG CGCCGGACCG CAGGTCATCG CCCCGGCCCT GGCCGGTCCG ATCGTGGTCC ACCTGGGCGG CTACCCGGTC CTGTACACCG TGTGCGGGCT GCTCAGCCTG CTGGGCGGGG TGCTGGTGTG GCGGATCCGG GGTGTGGCAT GA
|
Protein sequence | MLGRSHVRPP PAQPSPSPRT PAALAEPTRP VRRLWVAGIS LANLGMWMAF FGPLQVLLPE QVGLLAPDAK ETALAWVTGA GAACSTLGTP LAGALSDRTT GRFGRRRPWI LAGALLGGLG LVVLGRQDGV LGVLVGWSFV QAALSCLNAT LLAAVPDHVP VRQRGVVSGW IGVPQSAGVV VAVLLVTVVV TGIAPGYALL GALTVLCVLP FALLAPDPPL PREARPSWSG FARGLWVSPR RHPDFGWAWL TRFLMQTGNA MFTLYLLYFL TDAVGYEELF PGSSAADGLL VLIAVYTAAV VATTVLAGLV SDRIGRRRGM VCLSGVVSAV PAFLIAAFPT WPMSLVCAVV LGVGFGVYLS VDNALVTEVL PSAGGRAKDL GIVNIASAGP QVIAPALAGP IVVHLGGYPV LYTVCGLLSL LGGVLVWRIR GVA
|
| |