Gene Ndas_1259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1259 
Symbol 
ID9245109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1563277 
End bp1564896 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content75% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003679204 
Protein GI297560230 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGCG CGAAGCCGAC CCCCGCACGC GCGGGAGGGC GCGAGTGGCT CGGGCTCGGG 
GTGCTGGCCC TGCCCACCCT GCTGCTCTCA CTGGACATGA GCGTGCTCTA CCTGGCGCTG
CCCCACCTGG CCGCCGACCT GCGGCCCTCC GGCAGCCAGC TGCTGTGGAT CATGGACGTC
TACGGCTTCA TGATCGCCGG GTTCCTCATC ACCATGGGCA CCCTCGGCGA CCGCATCGGC
CGCAGGCGCC TGCTCATGAT CGGCGCCGCC GTCTTCGGCC TGGCCTCCGT GGCCGCGGCC
TTCGCGCCCA GCTCCGCCGC GCTCATCGCC ACCCGCGCTC TCATGGGCGT GGCCGGAGCC
ACCCTCATGC CCTCCACCCT GGCCCTGATC AGCAACATGT TCACCGACCC GCGCCAGCGC
GCGGTGGCCA TCTCGGTGTG GACGAGCTGT TTCATGGGCG GCACCGCCAT CGGACCGGTC
GTGGGCGGAC TCCTCCTGGA GTGGTTCTGG TGGGGGTCGG TGTTCCTGCT CGGCGTCCCC
GTCATGCTGC TGCTCCTGGT GTGCGCGCCC CTGCTGCTCC CCGAGCACCG CGCGCCCGAA
CCCGGTCGGC TGGACCCGGT CAGCGTCGCC CTGTCCCTGG CCGCCATCCT GCCGGTCGTC
TACGGCCTCA AGGCCGTTGC CGAGGGCGGG CCGCTCCTCG GACCGCTCGC CTCCCTCGCC
TTCGGGCTGG TCATGGGCGC GGTGTTCGCC CGCCGCCAAC TGCGCCTGCC CGACCCGCTG
CTGGACCTCG CCCTGTTCCG CCAGCCCTCC TTCGGGGTCG CGCTGGGCGT GATGATGGCG
GGCGCGGTCA CCATGGGCGG CACGTTCCTG CTGATCAGCC AGTACCTCCA GATGGTCGCC
GGGCACTCCT CGCTGGTCGC GGGGATGTGG CAGGTGCCCC CGGCGCTGGC GATGATCGCC
GCCACCATGG CCGGAGGGCC GCTGGCCGCG CGCGTGGGCC GGGCCAACGT CATCGGCGGC
GGCATGCTGG TGACCGCGTC GGGGTTCGCC CTGCTGTTCC TGGTCCCCGT CGAGGGCGGA
ACAGCGCTCG TGGTCGCCGG GCTGCTGCTG GCCTCGGTGG GGCTGGGGCC CGGCGCCGCC
CTGGTCACCG ACATCGTGGT GGGCTCCGCG CCCAGGGAGA GGGCCGGGGC CGCCGCGTCG
ATGTCCGAGA CCAGCGGGGA GTTCGGCGTC GCCATGGGCG TGGCCCTGCT GGGCAGCCTG
GCCTCGGCGG TCTACCGCGC CGAGGCCAGC GTTCCCGAGG GCCTGCCCGA GGAGGCGGGC
CAGACCCTGC CCGCCGCCGT GGCCGTGGCC GCGGAGCTGC CCGCGGGCCT GGCCGAGAGC
CTGCTCGGTC CGGCGCGCGA GGCCTTCACC TCCGGCATCA ACCTGGTCGG GGTGATCGGC
TGCCTGGCCA TGGCCGTGTT CGGAGTGGCC GCCGTGGTCC TGCTGCGCCC CGCGCCGTCG
GGGACACCGC CGGAGCGCGA ACCGGCGGAG GCACCGGTGG ACGCCGAGGG GGAGAGCGCC
GCGGACACGC CCGAGGCCGG GGCCGCGAGC GCCCCGGCGC CCGGACCGGC GGGCGGCTGA
 
Protein sequence
MNGAKPTPAR AGGREWLGLG VLALPTLLLS LDMSVLYLAL PHLAADLRPS GSQLLWIMDV 
YGFMIAGFLI TMGTLGDRIG RRRLLMIGAA VFGLASVAAA FAPSSAALIA TRALMGVAGA
TLMPSTLALI SNMFTDPRQR AVAISVWTSC FMGGTAIGPV VGGLLLEWFW WGSVFLLGVP
VMLLLLVCAP LLLPEHRAPE PGRLDPVSVA LSLAAILPVV YGLKAVAEGG PLLGPLASLA
FGLVMGAVFA RRQLRLPDPL LDLALFRQPS FGVALGVMMA GAVTMGGTFL LISQYLQMVA
GHSSLVAGMW QVPPALAMIA ATMAGGPLAA RVGRANVIGG GMLVTASGFA LLFLVPVEGG
TALVVAGLLL ASVGLGPGAA LVTDIVVGSA PRERAGAAAS MSETSGEFGV AMGVALLGSL
ASAVYRAEAS VPEGLPEEAG QTLPAAVAVA AELPAGLAES LLGPAREAFT SGINLVGVIG
CLAMAVFGVA AVVLLRPAPS GTPPEREPAE APVDAEGESA ADTPEAGAAS APAPGPAGG