Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0015 |
Symbol | |
ID | 9243842 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 19199 |
End bp | 20569 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003677974 |
Protein GI | 297559000 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.170297 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0989418 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACTT CCGACTCGAC GGCCCCAGGG CCGGGCGGCG CGATCGAGAC CAGGGAGCGG CGGCGCGTCC TCGCCGGGAC CATGGTCGGC ACCACCATCG AGTGGTACGA CTTCTTCATC TACGCGCAGG CCGCCGGCCT CGTGCTCGCC CCCCTGTTCC TGTCGCCGCT GACCGAGGAC AGCCCGGGGC TGGCCCAGGT CCTGTCCTTC GCGACCATCG GCATCTCCTT CCTCTTCCGG CCGCTCGGCG CGATCGTCGC GGGCGCCCTC GGAGACAGGT TCGGCCGCAA GCGCGTGCTC GTGGCGACCC TGGTCATGAT GGGGCTCGCC ACCTGCCTGA TCGGCCTGCT GCCCACCTAC GCCCAGATCG GCGTGGCCGC GCCCGTCCTG CTGATCATCC TGCGCATCCT CCAGGGCTTC TCGGCGGGCG GCGAGTGGGG CGGCGCGGCA CTGATGTCGG TGGAGCACGC ACCGGTCGAC AAGCGCGGCT TCTTCGGCGC CTACCCGCAG ATCGGAGTCC CCTGCGGCAT GATCCTGGCG ACCTTCGTCG TCTGGGTGAT CACCGCGGCC ATCGGCCCGG AGGCGTTCCT GGAGTGGGGC TGGCGCATCC CCTTCCTCCT GTCCTTCCTG CTGATCATCA TCGGCCACCT CATCCGCAAG TCCGTGGAGG AGTCCCCGGT CTTCAAGCTC ATGCAGGCGC GCAAGGCCGA GACCTCCGCC CCGCTGGGCC GACTGTTCCG CGAGCACACC CGTGAGGTCG TCCTCTCCGC GCTGATCTTC ATCGCCAACA ACGCCGCCGG GTACCTCGTC ATCGCCTACC TGGCGACCTA CGCCTCCCGG CCGGTCGAGG AGTTCGGCCT CGGCATGGAC CGCGGCCCCG TGCTCCTGGC GACCACCCTC GCCTCGTTCG GCTGGCTCAT CTCCACGCTC TACGGCGGCA TCCTGAGCGA CAAGCTCGGC CGGGTGCGGA CCTTCCAGCT CGGCTACGTG CTGCTGGCCG CCTGGTCCGT GCCGATGTGG TTCATGGTCG ACACCGGCAA CATCTACCTG TACTTCGCGG GCGTCTTCAT CTTCACGCTC ACCCTGGGCC TGAGCTACGG CCCCCAGTCG GCGCTGTACG CGGAGATGTT CCCGGCCGAG GTCCGCTACT CCGGCGTGTC CATCGGCTAC GCCCTCGGCG CGATCCTCGG CGGCGCCTTC GCGCCCATGA TCGCCGAGCT GCTGCTCACC GAGACCGGCG CCTCGTGGTC GATCGGCGTC TACATCGTCG TGGCCTGCGC GGTCTCCTTC CTCGGGGTCA CCCTGGTGAA GGAGCCCAAG GGCGTGGACC TGTACGCGGA CGGCACCAGG CCGAACGCGG TCGGCAAGTA G
|
Protein sequence | MATSDSTAPG PGGAIETRER RRVLAGTMVG TTIEWYDFFI YAQAAGLVLA PLFLSPLTED SPGLAQVLSF ATIGISFLFR PLGAIVAGAL GDRFGRKRVL VATLVMMGLA TCLIGLLPTY AQIGVAAPVL LIILRILQGF SAGGEWGGAA LMSVEHAPVD KRGFFGAYPQ IGVPCGMILA TFVVWVITAA IGPEAFLEWG WRIPFLLSFL LIIIGHLIRK SVEESPVFKL MQARKAETSA PLGRLFREHT REVVLSALIF IANNAAGYLV IAYLATYASR PVEEFGLGMD RGPVLLATTL ASFGWLISTL YGGILSDKLG RVRTFQLGYV LLAAWSVPMW FMVDTGNIYL YFAGVFIFTL TLGLSYGPQS ALYAEMFPAE VRYSGVSIGY ALGAILGGAF APMIAELLLT ETGASWSIGV YIVVACAVSF LGVTLVKEPK GVDLYADGTR PNAVGK
|
| |