Gene Ndas_0995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0995 
Symbol 
ID9244841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1216474 
End bp1217658 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content76% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003678945 
Protein GI297559971 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGAAGAG GCCGCGTCAG GGGGGCGTCG TGGCTCTGGC CGCTGTTACT GAGCGTCATG 
TTCACCCACA CCGCGCTCAA CCTCGCCCGT CCCCTGGTCT CGTACCGGAC GATCGCCCTG
GGCGGTGACG CCGTGGCGGT GGGCCTGGTC ACCGCCGCCT ACGCGCTGCT GCCCCTGCTG
GTGGCGGTCC CGCTGGGGCG GGCGACCGAC CGGTCCCGGC GCATGGCGTG GATCGTCGGC
CTGGGCGCGG CGGTGCTCGG CGCGGGCTCG CTGCTGCTGG CGTACGGCAC GGACCTGTTC
GCGATCGCCG CGGCCAGCAC GGTGCTGGGC ATGGGCCACC TGCTGTGCAT GGTCGCGGGT
CAGGGGCTCA TCGCGCGCCT GGCCGCACCG GGGAACCTGG ACCGCGACTT CGGGTGGTTC
ACCGCGGCGG CCTCCCTGGG CCAGCTCGTG GGCCCCCTGC TGTCGGGTGC GATGCTGGCC
GACGCCTCGG GGGACGCGCT GCTGTCGGCG ACCTCCTCCG CCCTGCTCGT CGCCGCGGTG
ACCGGCGGAC TGGGGCTGGT GCCGATCCTG GCCTTCGTCC GGGTTCGCAT GCCCGCGCCC
TCGCGCAAGA AGGGGTCGGA GAGGACCCCC GCGCGGGAAC TGCTGCGCAG GCCGGGCCTG
CCCTCGGGGC TGTTCGCCAG CCTCGCGCTG CTCTCGGCCG TGGACATCCT CACCGCCTAC
CTGCCGCTGG TCGCCGAGAA CCGCGGCATC CCGCCGATGA CGGTCGGCGT CCTGCTGAGC
CTGCGCGCGG GGTTCTCGCT GCTGTCCCGC CTGGTGCTGT CCCGGCTGGT GCGGCGCTGG
TCGCGCGAGA CGCTGATCGC GGTCAGCGCC GGGGCCGCGG GCCTGTCGAT GGCGGCGGTG
GCCCTGCCGG TCAGCAACGT GTTCGTGCTG GGGGCGGTGC TGGCCGTCGG GGGGTTCCTG
CTGGGTCTGG GCCAGCCGCT GACGATGTCG GCGGTGGCCA CGGCAGCCCC GGAGGGCTCG
CGCGGGGCCG CCCTGGCGCT GCGGATCTGG GGCAACCGGC TCGGGCAGGT CGGGATCCCT
GCCGTGGGCG CGGGGGTCGC GGGCGCGGTG GGCGCGCCCG GGGCGCTGTG GTTCGCGGCG
GTGGTGCTCG TGGCGTCGGC GGTCACGGCG GCCAAGCAGA TCTGA
 
Protein sequence
MGRGRVRGAS WLWPLLLSVM FTHTALNLAR PLVSYRTIAL GGDAVAVGLV TAAYALLPLL 
VAVPLGRATD RSRRMAWIVG LGAAVLGAGS LLLAYGTDLF AIAAASTVLG MGHLLCMVAG
QGLIARLAAP GNLDRDFGWF TAAASLGQLV GPLLSGAMLA DASGDALLSA TSSALLVAAV
TGGLGLVPIL AFVRVRMPAP SRKKGSERTP ARELLRRPGL PSGLFASLAL LSAVDILTAY
LPLVAENRGI PPMTVGVLLS LRAGFSLLSR LVLSRLVRRW SRETLIAVSA GAAGLSMAAV
ALPVSNVFVL GAVLAVGGFL LGLGQPLTMS AVATAAPEGS RGAALALRIW GNRLGQVGIP
AVGAGVAGAV GAPGALWFAA VVLVASAVTA AKQI