Gene Ndas_1521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1521 
Symbol 
ID9245371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1865081 
End bp1866382 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content76% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003679456 
Protein GI297560482 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.542357 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGACAC ACACCGCTCA GCCAGGGGAC GCCCTCACCG AGTCCGAACG GGCCCGGCTC 
CAGCGGCGCA CCGTCCTGGT CCTCATGCTC GCCCAGGTGG TCGGGGGCGT GGGCATGGGC
GCGATGATCG CGGTGGGCGC GCTCATCGCC CTCGAACTCA CCGGCTCCGA CACCTGGTCC
GGCATGGCCA CCACCATGAT CACCCTGGGC GCCGCGGTGT TCGCCCTGCC CCTGGCCTCG
CTGGCAGCCC GGCGCGGCCG CCGACCCGGC CTCGCGCTGG GCTGGATGCT CGGCGCGCTC
GGCGGCACCG TCGTCATCGC CGCCACGGTG CTGGAGGCGT TCGCGCTCTT CCTCGTCGGC
ATGGTTCTGG TCGGCGCCGG GACCGCCACC AACCTCCAGG CGCGCCACGC CGCCGCCGAC
CTGGCCTCCG AGCGCAGCCG CGGCCGGGAC CTGTCGATCG TGGTCTGGGC GACCACCGTC
GGTTCGGTGG CGGGCCCCAA CCTCACCGGC CCCGGCGCGC GCGTGGCCGA CCTGTTCGGC
CTGCCCGCCC TGCTGGGCCC GGTGCTGTTC ACCACCACCG GGTTCGTCCT GGGCGGACTG
CTGATCCTCG CCCTGCTGCG CCCGGACCCG CTCCGCGCGG CCACCGCGGA CCGCCCCTCC
GGCGGCGCGG CCCCCTCCGG AGCCCGGCTG TCCGTGGCCG GGGCGCTGCG GGTGGTCGCG
GCCGGCCCGG GGGCGCTGCT CGCGGTGGTC GGCATCGTGG CCAGCCACAC CGTGATGGTC
GCGGTCATGA CCATGACGCC GGTGCACATG TCGCACCACG GAGCGGCGCT GACCGTGATC
GGCCTGACCA TCTCCCTGCA CATCGCCGGC ATGTACGCCC TGTCGCCGGT GGTGGGGTGG
CTCACCGACC GGTTCGGGCG GGTGCCCGTC CTGCTGGCCG GTCAGGGGAT CCTGATCGCC
GCCGCGGTGG TGGCGGGCAC GGCCGGGCAC GATGAGGCCA GGGTGACGGT CGGTCTGGTG
CTGCTGGGGC TGGGGTGGTC GTTCGGGCTG GTGTCCGGGA CGGCGCTGCT GGCGGAGTCC
CTGGCGGCGG ACGTGCGCCC CCGCGTGCAG GGTGTGAGCG ACCTGGTGAT GAACCTGGGC
GGCGCGGCGG CGGGCGCCCT GTCGGGTGTG GTGCTGGCTC AGGCGGGCTT CGGCGGCCTC
AACCTGTTCG CGGCCGTGTT CACCGTGCCG GTGTTCGTGC TCGCCGTGCG CGCCCGGCTG
GCCCGGGCGG ACCGGCGGGA GCGGACCGGG GAAGCGCGCT GA
 
Protein sequence
MSTHTAQPGD ALTESERARL QRRTVLVLML AQVVGGVGMG AMIAVGALIA LELTGSDTWS 
GMATTMITLG AAVFALPLAS LAARRGRRPG LALGWMLGAL GGTVVIAATV LEAFALFLVG
MVLVGAGTAT NLQARHAAAD LASERSRGRD LSIVVWATTV GSVAGPNLTG PGARVADLFG
LPALLGPVLF TTTGFVLGGL LILALLRPDP LRAATADRPS GGAAPSGARL SVAGALRVVA
AGPGALLAVV GIVASHTVMV AVMTMTPVHM SHHGAALTVI GLTISLHIAG MYALSPVVGW
LTDRFGRVPV LLAGQGILIA AAVVAGTAGH DEARVTVGLV LLGLGWSFGL VSGTALLAES
LAADVRPRVQ GVSDLVMNLG GAAAGALSGV VLAQAGFGGL NLFAAVFTVP VFVLAVRARL
ARADRRERTG EAR