Gene Ndas_1713 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1713 
Symbol 
ID9245563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2084658 
End bp2085956 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content76% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003679648 
Protein GI297560674 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.300423 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.77847 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCAT GTGCGTACTT GCGCACGTAT ATGGTGCGCC TGGTCGTGCT CTTCCTTCTT 
CTCCGGGTGG TGGTCGTGGT GTGGCGGGTG CTGACGGACT TCGCTTCCTA CCGCAGGTTG
TTCTACGCCC AGGTCGTGGC GCTGGCGGGC ACCGGGCTGG CCACGGTCGC CCTGGCGCTG
CTGGCCTACG AACTGGTTCC GGGCCGGGCG GGGCAGGTGG TGGCCACCGC CCTGACCGTC
AAGATGCTCG CCTACGTGGG CGCGGGGCCG CTGCTGACCG CGGCCCTGGT CCGGGCCCCG
CGCAAGGCGG TGCTGGTGGG CTCGGACGCG GTGCGCGCCC TGGCGGTGGC CTGTCTGCCG
TTCGTGGACC AGGTCTGGCA GGTGTACGCG CTCATCGCCG TGCTCCAGTG CGCCTCGGCG
ACCTTCACCC CCACCTTCCA GGCGGTCATC CCCGAACTGG TGGACGACCG CGGCTACACC
TCGGCGCTGG CCCTGTCGCG GCTGGCCTAT GACCTGGAGG CGCTGGCCTC GCCGCTGCTG
GCCGCTCTCG TGCTGCTCGC GATCCCCTTC GACGGCCTGT TCGCGCTGAC CGCGCTCGGC
TTCGCCGCCT CCGCCGCCCT GGTGGCCGCC ACGGCGCTGC CCGCCCCGAC GCCCGGGTCG
CGGAGTCCCC GCCGGGCCGC GGCGAGCTGG CGGGCCTTCG GCACCGACCG GCGCCTGCTG
GCGCTGACGG CGTTGAACAC GGCGGTGGCC GTGGTCACCG CCCTGGTCCT GGTCGACACC
GTCGTCCTCG TCCGCTCCCA CCTGGGCGGC GGCGACACCG CCGTGGCGTT GGTGCTGGCC
TGCTTCGGCG GCGGTTCCAT GGCGGTGGCG CTGGCCCTGG GGGCGCTGGT GGAGCGGTTC
GGCACGCGCG CGCTCATGCT CACCGGTCCC GGCGTCCTCA CGGCGGGAGC GGCGGCCCTG
GCGCCGGGCT GGGCCCTGGC GCCGGGACCG CTGGTGCTCG GCGCGGGATG GGCGGTGCTG
GGCGCGGGGT GCGCGCTGGT GTCCGCGCCC ACGGGGCGGC TGCTGCGCGA GGCCGCTCCC
GAGGGCGCGT TGGCGGGGGT CTTCGCCGCC CAGTTCTCGC TCTCCCACGC CTGCTTCCTG
CTCACCTACC CGCTGGCCGG GTGGGCCGGG GGCCTGGAAC CGGTCCTGGT GCTGGGCGGC
GCCGGGGTGC TCACGGGCGC GTGCGCCCTG GCGGCGGCGG GCCTGTGGCG CCCCGGCGCG
GTCACCGCGG ACCCCTCCCC CGCGGCCGGG GACCGCTGA
 
Protein sequence
MTSCAYLRTY MVRLVVLFLL LRVVVVVWRV LTDFASYRRL FYAQVVALAG TGLATVALAL 
LAYELVPGRA GQVVATALTV KMLAYVGAGP LLTAALVRAP RKAVLVGSDA VRALAVACLP
FVDQVWQVYA LIAVLQCASA TFTPTFQAVI PELVDDRGYT SALALSRLAY DLEALASPLL
AALVLLAIPF DGLFALTALG FAASAALVAA TALPAPTPGS RSPRRAAASW RAFGTDRRLL
ALTALNTAVA VVTALVLVDT VVLVRSHLGG GDTAVALVLA CFGGGSMAVA LALGALVERF
GTRALMLTGP GVLTAGAAAL APGWALAPGP LVLGAGWAVL GAGCALVSAP TGRLLREAAP
EGALAGVFAA QFSLSHACFL LTYPLAGWAG GLEPVLVLGG AGVLTGACAL AAAGLWRPGA
VTADPSPAAG DR