Gene Ndas_1702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1702 
Symbol 
ID9245552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2073764 
End bp2075185 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content69% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003679637 
Protein GI297560663 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.254486 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.472051 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCCGA CAAGCGTCTC CGCGCGTTTT CTCCCCCTGG CGGTCGCCGC GCTGGCGGTC 
ATCGGCTTCG TCTCGGTACT GGTCAGTTCG GTCACGTCGA TGGCGCTCAC CACCCTGGCG
GAGGCTCTTG GGACCTCCAT GAGTTCGATC GTGTGGGTTA CGACGGTCTT CTTGCTGACG
GCAGGACTGG CGCTCCCCTT CGCCGGGTGG GCCGTGGACC GATTCGGCGG CCGCCCGGTG
CTGCTGGTGG GTCTCGCGGT TTTCGCGGCG GGTGCGCTGG GAAGCGGATC GGCGGTGACG
TTCGAACAGC TCATCGCGGC ACGTGCCGTG CAGGGGCTCG GCGGCGGAGT CCTGGAGTCC
GCCTGCCTGG CCTTGATCTC GCAGATCACC GATCGCCGAC GCATCGGCGC GGTCATGGGG
CTGATGTCCA TGGTGATCAA CCTCGCCCCG GCGGTCGGAC CCGTCATCGG CGCCGCTCTG
TTGTCCGCCG CGGGTTGGCG CAGCGTCTTC CTCTTCGCCG TACCGCCCAT CCTCCTGGCT
GGCGTGCTCC TGGCTCTTTC CCTGGGTCAG TGGAGAACGT CCCCCGGCAC CGACTCATCA
GCCCCAGGGG CAGCTCAGGC CCACCGGTTC GACCTTGTCG GCCTGGCCCT GCTCGGTCTC
GGCTTCACCG CAAGCCTGTT CGCCATCGGT CGGCTCTCCG CTGGAACGCC ATGGTCAACG
CTCACCGCGG GTGTACTGGG CGCACTGCTG TTGGTGGTCT ACGTCCGGCG TTCTCTGCGC
GTACCCGCGC CGATCATCGA CCCGCGGCTG TTCACCGACC GTCGCTTCTC CGGAGCCGCC
GCGATCATGG GTATGGGTGG CGTGCTGCTG TTCTCCACAC TCACTCTCGT GCCCCTGCTG
GCCGCACGGT CATGGGATCT CAGCGGGCTC ACCGAGGCCG TGCCACTGGC GGCCTTCGGA
ACGGGGATGC TGGTGTCGAT GAGCGTCGCT GGAGCGCTCT CGGACCGCGT CGGATCCCGA
AGGATCGTCA CCACCGCGGC GGCCTGTTCA GCGGGTTCCC TGGCTCTCCT GGCGGCCTGC
GCGCAACTCC TGCCCTCCCC CGCGTTGGTG TGCACGCTCC TTCTGTGCCT CACCGGCCTC
TCGTTCGGAG CGGTCTCCGC GCCGACTTTC GCCAGCATCT ACCGCATACT CCCGGCAGCG
ATGGCCGGGA GGGGCACCAC GGCCGTCCTG CTCGTCGTTC AGCTCGGCGC GGCACTCGGC
GTCACCGGGA TCGGATCCCT GGTCGGTGCT GTGGGCAGTC GTTCCCACAC AACCGTGCTC
GTGCTGCTCG CGGGGCTCAT GCTCACAGCA GCCGGTGTCG CCGCACTGGC CCTCTCCCGG
GAGGCGAGTG CGCGACACGC CGATGATCGG ACCACGCGGT AG
 
Protein sequence
MMPTSVSARF LPLAVAALAV IGFVSVLVSS VTSMALTTLA EALGTSMSSI VWVTTVFLLT 
AGLALPFAGW AVDRFGGRPV LLVGLAVFAA GALGSGSAVT FEQLIAARAV QGLGGGVLES
ACLALISQIT DRRRIGAVMG LMSMVINLAP AVGPVIGAAL LSAAGWRSVF LFAVPPILLA
GVLLALSLGQ WRTSPGTDSS APGAAQAHRF DLVGLALLGL GFTASLFAIG RLSAGTPWST
LTAGVLGALL LVVYVRRSLR VPAPIIDPRL FTDRRFSGAA AIMGMGGVLL FSTLTLVPLL
AARSWDLSGL TEAVPLAAFG TGMLVSMSVA GALSDRVGSR RIVTTAAACS AGSLALLAAC
AQLLPSPALV CTLLLCLTGL SFGAVSAPTF ASIYRILPAA MAGRGTTAVL LVVQLGAALG
VTGIGSLVGA VGSRSHTTVL VLLAGLMLTA AGVAALALSR EASARHADDR TTR