Gene Ndas_0293 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0293 
Symbol 
ID9244127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp365604 
End bp366935 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content76% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003678248 
Protein GI297559274 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.410279 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGAT CTCGTTCTTC CCCATCCGGG AACGGAACAC CCTCGGAGGC GGCCCCGACC 
GGGGCGGCCG ACGACAGCGG CACCACCGAC ACCACCCCGC CCGCCCTCCC CGGTCTGCGC
GGACGGGGCT TCGCCCTGCT CATCGCGGCG ACCGCGGTCG GCCTCTGCGG CTTCGCCGCC
GTCCTGCCCC TGGTCCCCCT GTGGGCCAGC CGGGGCGGCG CGGGCGAGTT CGGCGCCGGG
AGCACCACCG CCGTCTTCAT GCTCACCACC GTGCTCACCC AGCTCGCCAT GCCCTGGCTG
CTGGAGAGGG GCGGTTACCG CTGGGCCTTC CCGGTCGGCG CCCTGGTGAT GGGGCTGCCC
ACCCCGCTGT TCGCCCTGAC CGCCGACCTG GGACCGCTGC TGGCCGTCTC GGCGGTGCGC
GGAGTCGGCT TCGGCATGGT CAGCGTCGCC GGAACCGTGC TCGCCGCCCG CCTGGTCCCG
CACCACCAGG TCGGCCGGGC CACCGGTTAC TACGGGCTGG CGGTCGGTCT GCCGCAGGTG
CTCATCCTGC CCGGCGGCGT GGCGCTGGCC CTCAACATCG GCTTCGACAC CGCGTTCTGG
CTCACCGGTC TGTGCTCGGT CCTGGGCGGG GTCCTGGCCT GGGGCATCTG GTACGCCGAC
GGCGGCCGCA ACCGTGCGGC GCTGCGGAGC ACGGTCCGGG CCGCCCCCGA ACCCGGCCCC
GCCCCGGACG CGTCCGCGGG CCGTCCGCTG CTGCGGGCGC TGGCGGCGCC ACTGGTGCTC
ATGCTGCTGA CCGCCTCGTC GGCCGGCGCG ATCATCACCT TCCTGGCCAT CCCCCTGGAG
CAGGCGGCGT GGCTGGTGGG CGGCGCCCTG GCCGCCTACG CGCTGGCGGT GGTGGCGGGC
CGCTGGACCG CCGGGATGCT GCACGACCGC CACCGGCGCG CCCTCCTGCT GCTGCCGGGC
ATGGTCGGCG CGGTCGCGGG GATGGCCCTG GTCACCGCCG CGCTGTGGTC GGTCGGGGAC
GCGCCGGGTA CGGGGACGGC GCTGCTGGTG CTCCTGGGGT CGGCCGTGTT CGGCCTCGGC
TTCGGCGCGG TCCAGAACGA GACCGTCACC CTCATGCTGA ACCGCGCCGG TCCGGCCGGG
TACGGCCGGG CCAGCGCGGT GTGGAACATC GGCTACGACG CGGGTTCGGG CGCCGGGGCG
ATGGTCCTGG GGCTGCTGAT CCAGCTGACG GGGTACGGGC CCGCGTTCGC GGTCACGGCC
GTGGCGCTGC TGGCATCGGT CCCGCTCGCC CTCGGCGGGC GCGCGGGGCG CGGGGCCCGC
GCGCGAGGCT GA
 
Protein sequence
MIRSRSSPSG NGTPSEAAPT GAADDSGTTD TTPPALPGLR GRGFALLIAA TAVGLCGFAA 
VLPLVPLWAS RGGAGEFGAG STTAVFMLTT VLTQLAMPWL LERGGYRWAF PVGALVMGLP
TPLFALTADL GPLLAVSAVR GVGFGMVSVA GTVLAARLVP HHQVGRATGY YGLAVGLPQV
LILPGGVALA LNIGFDTAFW LTGLCSVLGG VLAWGIWYAD GGRNRAALRS TVRAAPEPGP
APDASAGRPL LRALAAPLVL MLLTASSAGA IITFLAIPLE QAAWLVGGAL AAYALAVVAG
RWTAGMLHDR HRRALLLLPG MVGAVAGMAL VTAALWSVGD APGTGTALLV LLGSAVFGLG
FGAVQNETVT LMLNRAGPAG YGRASAVWNI GYDAGSGAGA MVLGLLIQLT GYGPAFAVTA
VALLASVPLA LGGRAGRGAR ARG