Gene Ndas_2088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2088 
Symbol 
ID9245938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2508674 
End bp2510152 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content76% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003680020 
Protein GI297561046 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTCCT TCCGAACTCC CGACGCGCGC TCAGGCCGCC GGGAGCGCCC CGCCGCTGCC 
CGCCGCTGGT GGGCGCTCGC CGCGGTGTCG ATGCTCCAGT TCCTCATCGC CGTGGACGTC
ACGGTCGTCA ACATCGCCCT GCCCTCCATC GGCGCCGACC TGGGCGCCGG ACCGCAGGGC
CTGACCTGGG TCGTCGCCGG TTACACGCTG GTCGGCGGCG GCCTGCTGCT GCTCGGCGGG
CGCGTCTGCG ACCTGCTGGG GCGGCGGCGC ATGCTGCTCC TGGGCGCCGC CCTGTTCGGC
GCTGCCTCGC TCGTCGCGGG CGCCGCGCCG GACCTGCCCT GGCTCCTGGC CGCCCGCTTC
GGGCAGGGGG CCGGTGAGGC GCTCGCCTCC CCGGCGGCGA TGTCGCTGAT CGCCGTGCTC
TTCCCCGACC CCCAGGAACG CGCCAGGGCC CTCGGCGTCT GGGGCGCCAT CTCCAGCTCC
GGCCTGGTGG GCGGGGTGCT GCTGTCCGGC GTGATCACCG AGCTGCTGCA CTGGCGGTGG
ATCTTCCTCG TCAACCTCCC CGTCGTCGCC GTGGTCCTGG TCGCGGTCCC GCTGCTGGTC
GCGCCCGACC GCGTCCGCGG CGCGGTGCCG GGTGCCCAGG CCCCGGTCGC TCCGGCCGCC
GAGACCGGCG CCGCACGGCG CCGCCCCGAC GTCCTGGGCG GACTGCTGCT GACGGCCGGT
CCCCTGGCCC TGGTGCACGG GGTGCTCCAG GCGGCGGAAC ACTCCTGGAC GGAGCCGCGT
GTGTACGTTC CCGTGCTCGC GGGTGCCGCG GCGATGGGGC TGTTCGTCCT GGTCGAGGCC
AGGTCGCGCA ACCCGCTGGT ACCGCTGTCC TTCTTCGCCG ACCGCACCAG GCTGGTCGCC
AACGGGGCGA CGGTGCTGCT CAGCGCGGCG CTGTCGACCT CCTTCCTGCT GGTGACGCTC
TACCTCCAGC ACGTGCTGGG GATGAGCCCC ATGCAGGCGG GGCTGGCGTA CGTGCCGTTC
TGCGCGGCGA TCCTGGTCTC GGCGACCGTG GTCGGCCGGA TCATCGGAGC CCTGGGCCTG
TCCGGGGCGG CGATCGCGGG CCTGCTCTCG GCCGCGGCCG GGACCGCCTG GTTGTCGCGG
CTGCCGGTGG ACGGCGGTCT GTGGGCCGAC GCCATGCCCG GCATGCTCCT GGTCGGTGTC
GGCATGGGCG TCGGCCTGAT CGCCCTGCAG AACGCCGCCC TGCACGGGGT GACCGAGGCC
GACGCGGGGG TCGCCTCCGG CGTCCAGCGC TCGGTCGACC AGCTCGGCGG GGCCACCGGC
ATGGCCGTCC TCGTCGGCGC GGCCGTCGCC GCGGGCGCCT CCGGTGACGT CGCCGCCTCC
GTCGAGGGCT ACCGCACCGC CTTCTCCTGG GCGACCGTCG GCGTCCTGCT CGCCGCCGCC
GGTGTGGCCC TGTCCGCCTG GCGGTCCGGC CGCGCCTGA
 
Protein sequence
MNSFRTPDAR SGRRERPAAA RRWWALAAVS MLQFLIAVDV TVVNIALPSI GADLGAGPQG 
LTWVVAGYTL VGGGLLLLGG RVCDLLGRRR MLLLGAALFG AASLVAGAAP DLPWLLAARF
GQGAGEALAS PAAMSLIAVL FPDPQERARA LGVWGAISSS GLVGGVLLSG VITELLHWRW
IFLVNLPVVA VVLVAVPLLV APDRVRGAVP GAQAPVAPAA ETGAARRRPD VLGGLLLTAG
PLALVHGVLQ AAEHSWTEPR VYVPVLAGAA AMGLFVLVEA RSRNPLVPLS FFADRTRLVA
NGATVLLSAA LSTSFLLVTL YLQHVLGMSP MQAGLAYVPF CAAILVSATV VGRIIGALGL
SGAAIAGLLS AAAGTAWLSR LPVDGGLWAD AMPGMLLVGV GMGVGLIALQ NAALHGVTEA
DAGVASGVQR SVDQLGGATG MAVLVGAAVA AGASGDVAAS VEGYRTAFSW ATVGVLLAAA
GVALSAWRSG RA