Gene Ndas_5018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5018 
Symbol 
ID9248907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp160147 
End bp161415 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content77% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003682905 
Protein GI297563932 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGAGCG TCGGCGGGCT GGTCGCCGCC CAGGCCCTGT GCTACACCGC CACGCGGCTG 
GCGATGATCG CCATCCCCTG GTTCGTCCTG GAGGCCACCG GCAGCCCGGC GTCCATGGGG
GCGGTGGCCT TCTTCGAGAT CGGCTCCTAC ACCCTGGCCC GACTGCTGGG CGGCCCGCTG
CTGGACCGGA TGGGTCAGCG TGCCGTGAGC GTGCGCGCCG ACCTCGTCGC CGCGGCGGCG
GTGGCCTGCG TCCCGCTGCT GCACACGGCC GGGCTGCTGT CCTTCCCCGT GCTCCTGGCG
CTGGTGACCG TGATCGGCCT GGCCACCGGC CCCGCCGAGG CCGCCAAGGT CTCCATGGCC
CCGGCCGTCG CCGAACGCAC GGGGACGCGC CTCGAACGGG TCACCGGACT CACCGGAACC
GTGGACCGGC TGTCCACGAC CGTCGGACCG GTGGCCGCGG GCGGACTGGT GTCCCTGCTC
GGCGCGCTCC CCGCCCTGTA CACGAACGCC GCGCTGCTCG CCGCGGCGGC GGTGGTACTG
GCCGCCACCC AGCCCGGGGA GCGCCCCCGC CCGGGCGGGG ACCCCGAGGC GGACGCGGGC
TACCCGACCA GGCTGCGCAC CGGCTGGCGG GCGGTCTGGG GCGACGCCAC CCTGCGCGTT
TTGGTGGTCG TGCTCGCGGT CACCAACATG ATCGACATGT CCGTGGCCTC GGTACTGCTG
CCGGTGTGGG TGGACGACAA CGGCATGGGT CCCGCGGTGG TCGGCCTCCT GGGCGGCGTG
ATGGGCGCGG CCTCGGTGGT GGGCTCGCTC GCGGCCACGG CGGTCGGGCA CCTGCTGCCC
CGGAGGGCGG TGTTCTTCGC GGGGCTGGTG CTGGCCGGGC CGCCCCGGCT GGTCGTGCTG
GCGCTGGACG TGCCGCTGTG GGCGGTCCTG GCGGTGTGGG GCCTGTGCGG GCTCGCGGGC
GGGGTACTCA ACCCGATCCT GGGCGCGGTG CTGTTCGAGC GCCTGCCCCG CCGAGCCGTG
GGGCGGGGCA CGGCGACGAT CGGCGCGCTG ACCCGGATGG CGGCGCCGTT GGGCGCGCCC
GTCGCGGGCG CGGCGGCCGG ACTGCTCGGA GCGGCGCCGG TGCTACTGGC GTGCGCGGCC
CTCTACCTGG CCGCGGTCCT GCCGCCGCTC GTGGGCCGCG CGGCCGAGGG CATCGACAGG
CACGGGACGG AGGCGCGCCG GTCAGCGGGC GGGGCGGGCG GGGCGGACGC GGGCACGGGC
ACAGGGTAG
 
Protein sequence
MRSVGGLVAA QALCYTATRL AMIAIPWFVL EATGSPASMG AVAFFEIGSY TLARLLGGPL 
LDRMGQRAVS VRADLVAAAA VACVPLLHTA GLLSFPVLLA LVTVIGLATG PAEAAKVSMA
PAVAERTGTR LERVTGLTGT VDRLSTTVGP VAAGGLVSLL GALPALYTNA ALLAAAAVVL
AATQPGERPR PGGDPEADAG YPTRLRTGWR AVWGDATLRV LVVVLAVTNM IDMSVASVLL
PVWVDDNGMG PAVVGLLGGV MGAASVVGSL AATAVGHLLP RRAVFFAGLV LAGPPRLVVL
ALDVPLWAVL AVWGLCGLAG GVLNPILGAV LFERLPRRAV GRGTATIGAL TRMAAPLGAP
VAGAAAGLLG AAPVLLACAA LYLAAVLPPL VGRAAEGIDR HGTEARRSAG GAGGADAGTG
TG