Gene Ndas_1665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1665 
Symbol 
ID9245515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2034419 
End bp2035735 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content70% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003679600 
Protein GI297560626 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.395823 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGCG AGGTCGCGGA GCAGGTCGAC GAGAAGGACC TGCGGCGGGG TGTGTTCGCC 
GGGGCGGTGG GCGTCTTCGT CCACTGGTTC GACTGGGCCG TCTACGCCTA CCTGGCCACC
ACCATGGCGC AGGTGTTCTT CCCCGAGCAG GACGGCACCA CCGCCCTGCT GTCGGTCTTC
GCGGTCTTCG CCGTGGCGTT CTTCGTGCGT CCGCTCGGCT CGGTGCTCTT CGGCCACCTC
GGCGACCGCT TCGGGCGCAA GACCACGCTG TCGATCGTCA TCATCTCGAT GGCGGCGGGC
ACGCTCATGC TCGGGCTGCT GCCCAGCTAC GAGTCCGTCG GCATCCTCGC GCCGATCCTG
CTGGTGGTCG CCCGCATCAT CCAGGGGCTC GCCGCGGGCG GGGAGTTCGG CTCGGCCGCC
GCCTTCCTCG CGGAGTTCTC GCCGCCCAAG CGCCGCGGGT TCGGGTGCTC CTGGATCGAG
TTCGGCTCGG TCGGCGGGTT CCTGTGCGCG TCGTTCGCCG TGTGGGCCCT GCACGCGTCC
TTCCCGGCCG AGGTCGTCCT CGACTGGGCG TGGCGGATCC CCTTCCTGCT GACGGTGCCC
ATGGCCGCGG TGGGCCTCTA CATCCGCCTG CGCATCGAGG ACACGCCCGA GTACCGCGCG
CTGGAGGACA TGAACAACGT CCCGAGCCAG CCCGTCGTCG AGGTGTTCCG CTCCAACGGC
AGGCAGTTCC TCCAGACGGT CGGCATCGAG ACCTTCATGA ACTCCACCTT CTACATCGTC
CTGGTGTACC TGATCACCTA CCAGGAGGAG ATCGTGGGGG TGCCCGCCGA CCGGGCGGCC
CTGCTCTCCG CGGTGGCCTC GGTCGTCGCC ATGGGGATCA TCCCGCTCTC GGGCAGGATC
TCGGACCGCG TGGGCCGCAA GCCGGTGCTC TACACCGCCG CCGCGCTGCT GATCGCGGCC
TCCGTGCCGC TGTTCTGGCT GATGCAGGTG CAGACCTCGT GGGCGGCGTT CGCCGCGACC
TTCGGCCTCG CCGCGATCCT GGCGGTCATC CTGGGCACCC ACGCGTCCGC CGTGGCGGAG
CTGTTCCCGA CCCGGACCCG GCAGAGCGGG CTGTCGATGG CCTACAGCGT CGCCGGGGCG
TTCTTCGCGG GAACCCTGCC GTACCTGATG ACCTGGCTGA TCTCCCTCAC CGGCAGCAGC
ATGGTCCCCG CCTTCACCAT GGTCGTGATC GGCGTCATCG GCGCGGTCAC ACTGCGCACC
ATGCCCGAGA CCAGCGGCTC CGACCTGCTG CACGAGAGCG ACCGGGCCTC TCGCTGA
 
Protein sequence
MTSEVAEQVD EKDLRRGVFA GAVGVFVHWF DWAVYAYLAT TMAQVFFPEQ DGTTALLSVF 
AVFAVAFFVR PLGSVLFGHL GDRFGRKTTL SIVIISMAAG TLMLGLLPSY ESVGILAPIL
LVVARIIQGL AAGGEFGSAA AFLAEFSPPK RRGFGCSWIE FGSVGGFLCA SFAVWALHAS
FPAEVVLDWA WRIPFLLTVP MAAVGLYIRL RIEDTPEYRA LEDMNNVPSQ PVVEVFRSNG
RQFLQTVGIE TFMNSTFYIV LVYLITYQEE IVGVPADRAA LLSAVASVVA MGIIPLSGRI
SDRVGRKPVL YTAAALLIAA SVPLFWLMQV QTSWAAFAAT FGLAAILAVI LGTHASAVAE
LFPTRTRQSG LSMAYSVAGA FFAGTLPYLM TWLISLTGSS MVPAFTMVVI GVIGAVTLRT
MPETSGSDLL HESDRASR