Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1781 |
Symbol | |
ID | 9245631 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2180395 |
End bp | 2181606 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003679715 |
Protein GI | 297560741 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.961264 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.558818 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACGGA CCCAGGCGAG GCCGCCCGGG CGGGACACCG GCGCGTCGCG CTCCCGGCTC GACGTCGTGC GCTGGCAGGT CGGGTACGGG ATGTTCGGCG TGCCCCAGGC CGCCGCCCCC ATCGCCTTCG CCCTGCTCGC CCTCCCCATC ACCGGCACGG CCGAGTCCGG CGCGGCTCTG GTCTTCGCCA TGACGGCCGC GCAGGTGCTC GGCGCCGTCC CCGTGTCCCG TCTGGGCCGC CGGTTCAACG GCGTCCACTA CCTGCGCGCA CTCATCGCCG TCCGAACGCT CGCGCTCGCC GCCGTCACCG TGCTGGCGGC GGTGCAGGCC CCCTTCGGGC TGCTCCTGGT CGCGGTCACC GCGGCGGGAG CCGTCAACGG CGCCGCGTAC GGCTACCAGC GGCTCCTGCT CAACCACCTC GTGGAACCGT CCGGGCTCCC CCGCGCGCTG GGCGTGGCCG CGACGCTGAA CGAGGTCGGC TTCGCTCTGT CCCCCGTGCT CGCCTCGGTT CTCGGCGCCG TCTCGCCCGT CTGGGCCATG GCGGCGGTCA CCGCGCTGGG CGTGGGCCCG CTGCTCCTGA TGCCGCGCGT ACCCGGGGCC CGCGGGCCGC AGGGCGGAGA GGCTCCCCGC GTGCGGACGC CGGTACCCCC CGCGGTGTTC CTGTGGCTGT TCTGCGCGGC CGCGAGCGCG GGGGCTGTCG CGGCCGTCGA GGTCGGAGCG GTCTCCTTCG CGCTGTCCTT CGGACTCGAA CCGGGCTGGG CCTTCCTGTT CGCCCTCGTG CTGTGCGCGG GCTCGGTCGC GGGCGGGGTC TGGGTGAGCG TGCGCAACCG CACGCCCGCC CCCTGGCAGG TCGTCGCCTT CCTGGCGGCG ACCACCGCGG GCTCCGGGCT GGTCCTGGTC GGCGGGCACC TCTCCCTGAC GCTCGCCGGC GCGGCCGTCA TCGGGCTCTT CCTGCCGATG CTGGGCACGT TCTACTCGCT CGCCCTGGAC GGGCTCGCGC CGCCGGACCG CCGCGCGGAG ATGTTCGCGC TCCTGCGCAC CGCGAGTTCG CTCGGCATCA TCGCCGTGAG CGGCCTGCTC GCCCTCCTCG GCCTGCGGGC CGCCCTCGTC GGCAGCTTCG CGCTCCTGCT GGTGGCGTCC TCCCTCGCGG CGGCGCACCA CGCGCGCTCC CGCGTCGCCG CGGCCCCGCC GACCGCGCCC GACGGAGTGT GA
|
Protein sequence | MARTQARPPG RDTGASRSRL DVVRWQVGYG MFGVPQAAAP IAFALLALPI TGTAESGAAL VFAMTAAQVL GAVPVSRLGR RFNGVHYLRA LIAVRTLALA AVTVLAAVQA PFGLLLVAVT AAGAVNGAAY GYQRLLLNHL VEPSGLPRAL GVAATLNEVG FALSPVLASV LGAVSPVWAM AAVTALGVGP LLLMPRVPGA RGPQGGEAPR VRTPVPPAVF LWLFCAAASA GAVAAVEVGA VSFALSFGLE PGWAFLFALV LCAGSVAGGV WVSVRNRTPA PWQVVAFLAA TTAGSGLVLV GGHLSLTLAG AAVIGLFLPM LGTFYSLALD GLAPPDRRAE MFALLRTASS LGIIAVSGLL ALLGLRAALV GSFALLLVAS SLAAAHHARS RVAAAPPTAP DGV
|
| |