Gene Ndas_1335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1335 
Symbol 
ID9245185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1641000 
End bp1642313 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content72% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003679273 
Protein GI297560299 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCGTCG GCAGCGGCAC CGCCAACGCC GTCGGCCCCC AGATGTACCT GGCCACCATC 
GGCCTGTTCG TACTGCCGAT CGTCGAGGAC ACCGGCTTCA GCCGCACGAC CGTCACCGGG
GCCTTCTCGG TCGCGGCGGT CGGGATGGCG ATCGGCCTGG TCATCGTCGC CCAGCTCGTG
GACCGCTTCG CCGCGCGGTA CATCCTCGTG CCGGGGTTCG TGCTGTTCGC GGCCTCGATG
GCGCTGATCG GGCTGGTGCC GCCGGTCGAG TGGGTCTACC TCGTCCCGTG CTTCTTCGTG
GGCTTCTTCG GGGCGGGGAC GGCCGTGCCC GCCACCAGGG CGGTGGTGAG CTGGTTCGAC
AACAACCGCG CCCTCGCCGT CGGAGTGGTG ACGGGCATCA TCGGCCTGGG AACGGCCCTT
GCCCCCATCC TGGCCGGAGC GCTCATCGAA GGCGTCGGAT GGCGGGGGGC CTACGGCCTC
ATGGCGCTGA TCTCGGTCCT GGTGTCGGTC ACGATGGTCA CCCTGTTCGT GCGCGCCCGC
GCCGAGCGGC ACGTCCGCGG ACGACTCGTC CAGGAGACCC GGGTGGAGGG CCGTGAGGTC
AGCCTCGAAC TCCCCGGCCT GACGGTCGGC GAGGCGGTCC GCACCCGGCA GTTCTGGGCC
ATCGCGCTCG GACTGGGACT GGTAGGGGTC GTCGTCTACG GCCTCCAGGT CCACCTCGTG
CCGATGATGA CCGACCGGGG GCTGAGCGCC GACCAGGCCG CCACCCTGCT GGTCGTCTTC
GGTCTCGCCT CGCTGGTGGG CCGGGTGGCG GGCGGCCTCA TCCTCGACCG GGTGCACGCG
TGCGTCATCG GTCCGATCGT GATGATCGCC CCCATCGCCG GGATGTTCTT CCTGGAGCCG
CCGTTCGGCG GCGCCGTCGT CGCGGTCGCC TTCATCGGCG TCGCCTTCGG CATCGAGGGC
GACCTGCTCG CCCTGCTCAT CACCCGCTAC CTGGGCACGC GCTACTTCGG TCGGATCCTG
GGCCTGGTCC AGGCCGCGTT CCTCCTGGGC AGCGCGCTGG GGCCGCTGCT CCTCGGACTG
GGGTACGACC TGCTGGGCTC CTACGACCCC GTCATGCCCG TCCTGATGGG CGTCCTCGTC
GTCGGCGCGG TCCTCATCGC GACCCTGGGC CGCTACGTCT ACCCCGCCGT CAACGGCTTC
GACCGTCTCG CCGCCCGCGA CGAACTCGCC GCCGCCGAGG TGCTGAGCGA CATCGCCGGG
ACCGGCGACG CCCACGGCTC CCCGGACAGG CCGCGGGCCG AGGCCCACGG CTGA
 
Protein sequence
MLVGSGTANA VGPQMYLATI GLFVLPIVED TGFSRTTVTG AFSVAAVGMA IGLVIVAQLV 
DRFAARYILV PGFVLFAASM ALIGLVPPVE WVYLVPCFFV GFFGAGTAVP ATRAVVSWFD
NNRALAVGVV TGIIGLGTAL APILAGALIE GVGWRGAYGL MALISVLVSV TMVTLFVRAR
AERHVRGRLV QETRVEGREV SLELPGLTVG EAVRTRQFWA IALGLGLVGV VVYGLQVHLV
PMMTDRGLSA DQAATLLVVF GLASLVGRVA GGLILDRVHA CVIGPIVMIA PIAGMFFLEP
PFGGAVVAVA FIGVAFGIEG DLLALLITRY LGTRYFGRIL GLVQAAFLLG SALGPLLLGL
GYDLLGSYDP VMPVLMGVLV VGAVLIATLG RYVYPAVNGF DRLAARDELA AAEVLSDIAG
TGDAHGSPDR PRAEAHG