Gene Ndas_1781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1781 
Symbol 
ID9245631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2180395 
End bp2181606 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content76% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003679715 
Protein GI297560741 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.961264 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.558818 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGGA CCCAGGCGAG GCCGCCCGGG CGGGACACCG GCGCGTCGCG CTCCCGGCTC 
GACGTCGTGC GCTGGCAGGT CGGGTACGGG ATGTTCGGCG TGCCCCAGGC CGCCGCCCCC
ATCGCCTTCG CCCTGCTCGC CCTCCCCATC ACCGGCACGG CCGAGTCCGG CGCGGCTCTG
GTCTTCGCCA TGACGGCCGC GCAGGTGCTC GGCGCCGTCC CCGTGTCCCG TCTGGGCCGC
CGGTTCAACG GCGTCCACTA CCTGCGCGCA CTCATCGCCG TCCGAACGCT CGCGCTCGCC
GCCGTCACCG TGCTGGCGGC GGTGCAGGCC CCCTTCGGGC TGCTCCTGGT CGCGGTCACC
GCGGCGGGAG CCGTCAACGG CGCCGCGTAC GGCTACCAGC GGCTCCTGCT CAACCACCTC
GTGGAACCGT CCGGGCTCCC CCGCGCGCTG GGCGTGGCCG CGACGCTGAA CGAGGTCGGC
TTCGCTCTGT CCCCCGTGCT CGCCTCGGTT CTCGGCGCCG TCTCGCCCGT CTGGGCCATG
GCGGCGGTCA CCGCGCTGGG CGTGGGCCCG CTGCTCCTGA TGCCGCGCGT ACCCGGGGCC
CGCGGGCCGC AGGGCGGAGA GGCTCCCCGC GTGCGGACGC CGGTACCCCC CGCGGTGTTC
CTGTGGCTGT TCTGCGCGGC CGCGAGCGCG GGGGCTGTCG CGGCCGTCGA GGTCGGAGCG
GTCTCCTTCG CGCTGTCCTT CGGACTCGAA CCGGGCTGGG CCTTCCTGTT CGCCCTCGTG
CTGTGCGCGG GCTCGGTCGC GGGCGGGGTC TGGGTGAGCG TGCGCAACCG CACGCCCGCC
CCCTGGCAGG TCGTCGCCTT CCTGGCGGCG ACCACCGCGG GCTCCGGGCT GGTCCTGGTC
GGCGGGCACC TCTCCCTGAC GCTCGCCGGC GCGGCCGTCA TCGGGCTCTT CCTGCCGATG
CTGGGCACGT TCTACTCGCT CGCCCTGGAC GGGCTCGCGC CGCCGGACCG CCGCGCGGAG
ATGTTCGCGC TCCTGCGCAC CGCGAGTTCG CTCGGCATCA TCGCCGTGAG CGGCCTGCTC
GCCCTCCTCG GCCTGCGGGC CGCCCTCGTC GGCAGCTTCG CGCTCCTGCT GGTGGCGTCC
TCCCTCGCGG CGGCGCACCA CGCGCGCTCC CGCGTCGCCG CGGCCCCGCC GACCGCGCCC
GACGGAGTGT GA
 
Protein sequence
MARTQARPPG RDTGASRSRL DVVRWQVGYG MFGVPQAAAP IAFALLALPI TGTAESGAAL 
VFAMTAAQVL GAVPVSRLGR RFNGVHYLRA LIAVRTLALA AVTVLAAVQA PFGLLLVAVT
AAGAVNGAAY GYQRLLLNHL VEPSGLPRAL GVAATLNEVG FALSPVLASV LGAVSPVWAM
AAVTALGVGP LLLMPRVPGA RGPQGGEAPR VRTPVPPAVF LWLFCAAASA GAVAAVEVGA
VSFALSFGLE PGWAFLFALV LCAGSVAGGV WVSVRNRTPA PWQVVAFLAA TTAGSGLVLV
GGHLSLTLAG AAVIGLFLPM LGTFYSLALD GLAPPDRRAE MFALLRTASS LGIIAVSGLL
ALLGLRAALV GSFALLLVAS SLAAAHHARS RVAAAPPTAP DGV