Gene Ndas_0272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0272 
Symbol 
ID9244106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp338942 
End bp340366 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content75% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003678227 
Protein GI297559253 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCCG CGGCCGACTC CGCGGCGTCC GGGGGCACCG GGCCCGGGGG CGCCGCGGTG 
GGAGGGGCCG CTCCGGAGGC CGCCGCCCGG AGCACGGTCG GCGGGCCGGG CGCTCCGCTG
CCCCGTTCCG TGCACGCCTG GTACGGCAGC GGCGCGGTGG CCACGGGGAT CTTCAACACG
GTGCCCGGGC TGCTGCTCCT CATCTACCTG ACCGACACGC TCGCGGTGAG CCCCGCCCTG
GCCGGGGCGG TGGTGCTCCT GCCCAAGGTG GTCGACCTTC TGGTCAGTCC GTATATCGGG
ATCTGGTCGG ATAGGACGCG TTCGCCGTGG GGGCCGCGGC GCCCGTGGAT GCTGGCGGGG
GCGCTGACCC TGCCGGTGCT GTTCGCCGCG ATGTTCGCCG GGCCGCCGCT GAAGGGCGGC
TCCGCCGCGG TGTACGTGGC GGCCGTGTTC GTGGCCGCCG CGCTGGCCTC GTCGGTGTTC
CAGGTGCCGC ACACGGCCAT GCCCGGGGAG ATCACCTCCG ACTACCACGA GCGGTCCACG
TTCAACACGT GGCGGACCGC GTTCGTCGGG CTCGCCCTGA TGCTGGGCGG CGCCTTGGCG
CCGGTCGTCC AGTCGGCCCC CGCGGACCCC GTGGCCGGGT ACCGGCTGAT GGGCCTGCTC
ATGGGCTGCG TGGTGCTGGT GTCGATGCTG GGCTCGGTGG TGGGCACGCG CCGGGCGCCG
CGCCCGGTGT TCGCGCACCG CACGGAGGGT CTGGCGGCCC AGCTGCGGGT GGCCTTCGCC
CACCGGCACT TCCGGGTGCT CTTCCCCGCC AACCTCCTGA TGGCCACGGC GGGCGGCACC
ATGGTCGCGG GCGTGCCGTA CGTGACGGCC AACGTCATGG GCGAGCCGGG CTACACGAGC
GTGCTCATGG TGTGCGTGCT GGTGCCGCTG ATCGCTTCGG CCCCGCTGTG GCGGTGGCTG
TCCCTGCGGG TGGACAAGCG CCGCGCCGCC GGGTACGCGG CGGCGGTGTT CGCCCTGGGC
GGTCTGGGTC TGCTGCTCAT CCCGATCTGG GGCCTCCCCG GCGCCGTGCT GTCCTCCGTG
CTGGTGGGCG TGGGTCTGTC GGGGACGACG CTGCTGCCGT GGTCGATGCT GGCCGACTGC
CTGGCCACCG CCGACGCCTC CGGGCGGCGG CAGGGCGGCG TGCTCTCGGG CGTGTGGACC
GCCGGGGAGG CCATGGCGCA GTCGGTGGGA ACCGGGCTGC TGTCGCTGGC CTTGGCGGTG
AGCGGCTACG TGGAGTCCGG GGCCGGGGAG GCGGTCCGGC AGAGCGACGA GGCGCTGCGC
GGCATGCTGG TCGGCAGCAC GCTGGTGCCC GCCGCGGTGA TGCTGTGCTG CCTGGTGCCG
CTGGCGTTCT ACCGGCTGAC CGCCGAGGAG GCGGGCCCGC GCTGA
 
Protein sequence
MTAAADSAAS GGTGPGGAAV GGAAPEAAAR STVGGPGAPL PRSVHAWYGS GAVATGIFNT 
VPGLLLLIYL TDTLAVSPAL AGAVVLLPKV VDLLVSPYIG IWSDRTRSPW GPRRPWMLAG
ALTLPVLFAA MFAGPPLKGG SAAVYVAAVF VAAALASSVF QVPHTAMPGE ITSDYHERST
FNTWRTAFVG LALMLGGALA PVVQSAPADP VAGYRLMGLL MGCVVLVSML GSVVGTRRAP
RPVFAHRTEG LAAQLRVAFA HRHFRVLFPA NLLMATAGGT MVAGVPYVTA NVMGEPGYTS
VLMVCVLVPL IASAPLWRWL SLRVDKRRAA GYAAAVFALG GLGLLLIPIW GLPGAVLSSV
LVGVGLSGTT LLPWSMLADC LATADASGRR QGGVLSGVWT AGEAMAQSVG TGLLSLALAV
SGYVESGAGE AVRQSDEALR GMLVGSTLVP AAVMLCCLVP LAFYRLTAEE AGPR