Gene Ndas_0015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0015 
Symbol 
ID9243842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp19199 
End bp20569 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content69% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003677974 
Protein GI297559000 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.170297 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0989418 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACTT CCGACTCGAC GGCCCCAGGG CCGGGCGGCG CGATCGAGAC CAGGGAGCGG 
CGGCGCGTCC TCGCCGGGAC CATGGTCGGC ACCACCATCG AGTGGTACGA CTTCTTCATC
TACGCGCAGG CCGCCGGCCT CGTGCTCGCC CCCCTGTTCC TGTCGCCGCT GACCGAGGAC
AGCCCGGGGC TGGCCCAGGT CCTGTCCTTC GCGACCATCG GCATCTCCTT CCTCTTCCGG
CCGCTCGGCG CGATCGTCGC GGGCGCCCTC GGAGACAGGT TCGGCCGCAA GCGCGTGCTC
GTGGCGACCC TGGTCATGAT GGGGCTCGCC ACCTGCCTGA TCGGCCTGCT GCCCACCTAC
GCCCAGATCG GCGTGGCCGC GCCCGTCCTG CTGATCATCC TGCGCATCCT CCAGGGCTTC
TCGGCGGGCG GCGAGTGGGG CGGCGCGGCA CTGATGTCGG TGGAGCACGC ACCGGTCGAC
AAGCGCGGCT TCTTCGGCGC CTACCCGCAG ATCGGAGTCC CCTGCGGCAT GATCCTGGCG
ACCTTCGTCG TCTGGGTGAT CACCGCGGCC ATCGGCCCGG AGGCGTTCCT GGAGTGGGGC
TGGCGCATCC CCTTCCTCCT GTCCTTCCTG CTGATCATCA TCGGCCACCT CATCCGCAAG
TCCGTGGAGG AGTCCCCGGT CTTCAAGCTC ATGCAGGCGC GCAAGGCCGA GACCTCCGCC
CCGCTGGGCC GACTGTTCCG CGAGCACACC CGTGAGGTCG TCCTCTCCGC GCTGATCTTC
ATCGCCAACA ACGCCGCCGG GTACCTCGTC ATCGCCTACC TGGCGACCTA CGCCTCCCGG
CCGGTCGAGG AGTTCGGCCT CGGCATGGAC CGCGGCCCCG TGCTCCTGGC GACCACCCTC
GCCTCGTTCG GCTGGCTCAT CTCCACGCTC TACGGCGGCA TCCTGAGCGA CAAGCTCGGC
CGGGTGCGGA CCTTCCAGCT CGGCTACGTG CTGCTGGCCG CCTGGTCCGT GCCGATGTGG
TTCATGGTCG ACACCGGCAA CATCTACCTG TACTTCGCGG GCGTCTTCAT CTTCACGCTC
ACCCTGGGCC TGAGCTACGG CCCCCAGTCG GCGCTGTACG CGGAGATGTT CCCGGCCGAG
GTCCGCTACT CCGGCGTGTC CATCGGCTAC GCCCTCGGCG CGATCCTCGG CGGCGCCTTC
GCGCCCATGA TCGCCGAGCT GCTGCTCACC GAGACCGGCG CCTCGTGGTC GATCGGCGTC
TACATCGTCG TGGCCTGCGC GGTCTCCTTC CTCGGGGTCA CCCTGGTGAA GGAGCCCAAG
GGCGTGGACC TGTACGCGGA CGGCACCAGG CCGAACGCGG TCGGCAAGTA G
 
Protein sequence
MATSDSTAPG PGGAIETRER RRVLAGTMVG TTIEWYDFFI YAQAAGLVLA PLFLSPLTED 
SPGLAQVLSF ATIGISFLFR PLGAIVAGAL GDRFGRKRVL VATLVMMGLA TCLIGLLPTY
AQIGVAAPVL LIILRILQGF SAGGEWGGAA LMSVEHAPVD KRGFFGAYPQ IGVPCGMILA
TFVVWVITAA IGPEAFLEWG WRIPFLLSFL LIIIGHLIRK SVEESPVFKL MQARKAETSA
PLGRLFREHT REVVLSALIF IANNAAGYLV IAYLATYASR PVEEFGLGMD RGPVLLATTL
ASFGWLISTL YGGILSDKLG RVRTFQLGYV LLAAWSVPMW FMVDTGNIYL YFAGVFIFTL
TLGLSYGPQS ALYAEMFPAE VRYSGVSIGY ALGAILGGAF APMIAELLLT ETGASWSIGV
YIVVACAVSF LGVTLVKEPK GVDLYADGTR PNAVGK