Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1259 |
Symbol | |
ID | 9245109 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1563277 |
End bp | 1564896 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003679204 |
Protein GI | 297560230 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGGCG CGAAGCCGAC CCCCGCACGC GCGGGAGGGC GCGAGTGGCT CGGGCTCGGG GTGCTGGCCC TGCCCACCCT GCTGCTCTCA CTGGACATGA GCGTGCTCTA CCTGGCGCTG CCCCACCTGG CCGCCGACCT GCGGCCCTCC GGCAGCCAGC TGCTGTGGAT CATGGACGTC TACGGCTTCA TGATCGCCGG GTTCCTCATC ACCATGGGCA CCCTCGGCGA CCGCATCGGC CGCAGGCGCC TGCTCATGAT CGGCGCCGCC GTCTTCGGCC TGGCCTCCGT GGCCGCGGCC TTCGCGCCCA GCTCCGCCGC GCTCATCGCC ACCCGCGCTC TCATGGGCGT GGCCGGAGCC ACCCTCATGC CCTCCACCCT GGCCCTGATC AGCAACATGT TCACCGACCC GCGCCAGCGC GCGGTGGCCA TCTCGGTGTG GACGAGCTGT TTCATGGGCG GCACCGCCAT CGGACCGGTC GTGGGCGGAC TCCTCCTGGA GTGGTTCTGG TGGGGGTCGG TGTTCCTGCT CGGCGTCCCC GTCATGCTGC TGCTCCTGGT GTGCGCGCCC CTGCTGCTCC CCGAGCACCG CGCGCCCGAA CCCGGTCGGC TGGACCCGGT CAGCGTCGCC CTGTCCCTGG CCGCCATCCT GCCGGTCGTC TACGGCCTCA AGGCCGTTGC CGAGGGCGGG CCGCTCCTCG GACCGCTCGC CTCCCTCGCC TTCGGGCTGG TCATGGGCGC GGTGTTCGCC CGCCGCCAAC TGCGCCTGCC CGACCCGCTG CTGGACCTCG CCCTGTTCCG CCAGCCCTCC TTCGGGGTCG CGCTGGGCGT GATGATGGCG GGCGCGGTCA CCATGGGCGG CACGTTCCTG CTGATCAGCC AGTACCTCCA GATGGTCGCC GGGCACTCCT CGCTGGTCGC GGGGATGTGG CAGGTGCCCC CGGCGCTGGC GATGATCGCC GCCACCATGG CCGGAGGGCC GCTGGCCGCG CGCGTGGGCC GGGCCAACGT CATCGGCGGC GGCATGCTGG TGACCGCGTC GGGGTTCGCC CTGCTGTTCC TGGTCCCCGT CGAGGGCGGA ACAGCGCTCG TGGTCGCCGG GCTGCTGCTG GCCTCGGTGG GGCTGGGGCC CGGCGCCGCC CTGGTCACCG ACATCGTGGT GGGCTCCGCG CCCAGGGAGA GGGCCGGGGC CGCCGCGTCG ATGTCCGAGA CCAGCGGGGA GTTCGGCGTC GCCATGGGCG TGGCCCTGCT GGGCAGCCTG GCCTCGGCGG TCTACCGCGC CGAGGCCAGC GTTCCCGAGG GCCTGCCCGA GGAGGCGGGC CAGACCCTGC CCGCCGCCGT GGCCGTGGCC GCGGAGCTGC CCGCGGGCCT GGCCGAGAGC CTGCTCGGTC CGGCGCGCGA GGCCTTCACC TCCGGCATCA ACCTGGTCGG GGTGATCGGC TGCCTGGCCA TGGCCGTGTT CGGAGTGGCC GCCGTGGTCC TGCTGCGCCC CGCGCCGTCG GGGACACCGC CGGAGCGCGA ACCGGCGGAG GCACCGGTGG ACGCCGAGGG GGAGAGCGCC GCGGACACGC CCGAGGCCGG GGCCGCGAGC GCCCCGGCGC CCGGACCGGC GGGCGGCTGA
|
Protein sequence | MNGAKPTPAR AGGREWLGLG VLALPTLLLS LDMSVLYLAL PHLAADLRPS GSQLLWIMDV YGFMIAGFLI TMGTLGDRIG RRRLLMIGAA VFGLASVAAA FAPSSAALIA TRALMGVAGA TLMPSTLALI SNMFTDPRQR AVAISVWTSC FMGGTAIGPV VGGLLLEWFW WGSVFLLGVP VMLLLLVCAP LLLPEHRAPE PGRLDPVSVA LSLAAILPVV YGLKAVAEGG PLLGPLASLA FGLVMGAVFA RRQLRLPDPL LDLALFRQPS FGVALGVMMA GAVTMGGTFL LISQYLQMVA GHSSLVAGMW QVPPALAMIA ATMAGGPLAA RVGRANVIGG GMLVTASGFA LLFLVPVEGG TALVVAGLLL ASVGLGPGAA LVTDIVVGSA PRERAGAAAS MSETSGEFGV AMGVALLGSL ASAVYRAEAS VPEGLPEEAG QTLPAAVAVA AELPAGLAES LLGPAREAFT SGINLVGVIG CLAMAVFGVA AVVLLRPAPS GTPPEREPAE APVDAEGESA ADTPEAGAAS APAPGPAGG
|
| |