Gene Ndas_2473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2473 
Symbol 
ID9246323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2935318 
End bp2936520 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content74% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003680399 
Protein GI297561425 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.852555 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.95033 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCTGG CCCTGCTCGC CCTCGCCGTC GGCGCGTTCG CCATCGGCAC CACCGAGTTC 
GTGATCATGG GACTGCTGCC CGAGGTCGCC GCCGACCTCG GCGTACCGGT TCCGACCGCC
GGATACCTCA TCTCCGCCTA CGCCCTGGGC GTGGTGGTCG GCGCACCGCT GCTCACCGCG
CTGAGCACGC GGCTGCCCCG CAAGACGGCG CTGCTGGTCT TCATGGGGCT CTTCCTGGTG
GGCAACCTCG CCACCGTCCT GGCCCCGTCC TTCGGCGCCG TGTTCGCCTC GCGGGTCCTG
GCCGGACTGC CCCACGGCGC CTACCTGGGA GTGGGCTCGC TGGTCGCGGC GCACCTGGCC
GGTCCGGGGC GGCGGGCCAC GGGCGTCTCG CGGATGTTCC TCGGCCTGAC CGTGGCCAAC
ATCGTCGGCG TGCCCGCGGG CACCTTCCTG GGCCAGGCGA TGGGCTGGCA CGCGGCGTTC
CTGGTCGTGG CCGGGATCGC CCTGCTCGCC CTGCTCGGCG TCGCGGTGTT CGTGCCGCAC
CAGCCCGCGG CCTCCGGCAC GGGCCTGCGC CACGAGATGC GCGAACTGGG CCGCTCCCAG
GTCGTCCTGG GCCTGGTCAC GGCGGTCTTC GGGTTCGCCG GGGTGTTCGC CGTCTACAGC
TACGTCTCGC CGATGATGAC GGAGCTGGCG GGCATGTCCG CGTCCGGGGT GCCGATCGTC
CTGGCCCTGT TCGGCACCGG CATGACCCTG GGTTCGCTGA TCGTGGGACC CCTGGCCGAC
CGGGCGCTGC GCCCGACGAT CTACGGGTCG CTGGCCGCGC TTGCGGTGGT GCTGGTGGTG
TTCACGGTGA CCGTGCACAA CCCCTGGACG GCCGCGGTGA CGGTGGTGGT CCTGGGCGCG
GTGGGGTTCG GCGTGGCCAC CCCGCTCCAG GTCCTGGTGA TGAACAAGGC CGCCGAGGCG
CCGACGATGG CGGCGGCCTC CAACCACTCG GCGTTCAACC TGGCCAACGC CGCCGGGGCG
TGGCTGGGCG GCATCGGGAT CAGCGCGGGC CTGGGGTACA CCTCCCCGGC CGCGATCGGG
GCGGTGCTGG CCGTGGTGGG CCTGGGCATC GCGGTGTTCG CGGGCGTCCT GGACCGCCCG
GGGCGGCGGC GCGACCGCGA AGGCCGGACG GTGGCGGCCC CGGAGCCCAG CGCTCTGCGC
TGA
 
Protein sequence
MPLALLALAV GAFAIGTTEF VIMGLLPEVA ADLGVPVPTA GYLISAYALG VVVGAPLLTA 
LSTRLPRKTA LLVFMGLFLV GNLATVLAPS FGAVFASRVL AGLPHGAYLG VGSLVAAHLA
GPGRRATGVS RMFLGLTVAN IVGVPAGTFL GQAMGWHAAF LVVAGIALLA LLGVAVFVPH
QPAASGTGLR HEMRELGRSQ VVLGLVTAVF GFAGVFAVYS YVSPMMTELA GMSASGVPIV
LALFGTGMTL GSLIVGPLAD RALRPTIYGS LAALAVVLVV FTVTVHNPWT AAVTVVVLGA
VGFGVATPLQ VLVMNKAAEA PTMAAASNHS AFNLANAAGA WLGGIGISAG LGYTSPAAIG
AVLAVVGLGI AVFAGVLDRP GRRRDREGRT VAAPEPSALR