Gene Ndas_4215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4215 
Symbol 
ID9248089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5033047 
End bp5034636 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content72% 
IMG OID 
Productdrug resistance transporter, EmrB/QacA subfamily 
Protein accessionYP_003682113 
Protein GI297563139 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.269985 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCAC AGACCGTCCC GAAGGCTGCT CCGGAGACGG AGACGGAGCG GAAGCAGCGC 
TGGACGGAGA GCCCCTGGGC GACCCTGGTC GCCATCGCCT TCGGCGTGAT GATGGTGGCG
CTGGACGGCA CCATCGTGGC CGTCGCCAAC CCCGCGATCG GCACGAGCCT GGGCGCCTCG
CTGGCCGAAC TCCAGTGGGT GACCCACGGC TACCTGCTCG GCCTGGCGGT CTTCCTGATC
ACGGCGGGCA AGATCGGCGA CCGCTTCGGC TACCGCAACA CCTACCTCGT GGGCGCGGTC
GGCTTCGTAC TCAGCTCGGT GGCGATCGCG CTGTCGGCCG GTGTCATCAT GCTCGTCGCC
TTCCGGGTGC TGCAAGGGGT TTTCGGCGCG CTGCTCCTGC CCTCGGCGAT GGGTCTGCTG
CGCGCCAGCT TCCCGCCCAG CAAGCTCGGC CGGGCCTTCG GCGTCTTCGG CAGCCTCATC
GGCGCCGCCA CGGCGGGCGG CCCCATCCTG GGCGGCGTAC TGGTCGGCTC CTTCGGCTGG
GAGTCCGTCT TCTACATCAA CGTCCCGGTG GGCGCGGTCG CCCTCGGACT CGGTCTGTGG
CTGCTGGCCG CCAACAAGCC GACCGACGCC GGAAGCCGGA TGGACGTGCC CGGCATCGTG
CTGCTGAGCA TCGCCATCTT CGCCCTGGTG TGGGCCCTGG TGGAGGCGCC CAGCGTGGGC
TGGGGCCACC CCGTCACACT GGGCTCGCTC GCCGTCACGG CGGTCTTCGC CGTGGCCTTC
CTGGTGTGGG AGCGGCGCCC GGAACAGCCC CTGCTCCCGC TCGCCCTGTT CGCCAACCCC
TCGGTCTCCA TCGGCGCCGT CCTGACGGTC GCGATGGCGC TGAGCCTGAT GGGGTCGCTG
TTCTTCATCA CCTTCTACCT CCAGGGCGTG CGGGGCATGA GCCCCGCCCA GACCGGACTC
CAGCTCATCT CCATGACCGC GCTGATGGCG GTCACCTCGC CGATCGCGGG CCGGGTCCTG
GACCGGGTCG GCGCCCGGCC GCCCACGACG GTGGGCCTGC TGCTGGCGTC GGCGGGCATG
TTCATGCTGT CGCTGCTGCG GACCGACACG GGAGTGCTCT ACATCTCGGC CGCGTTCGTC
CTGCTCGGCA TGGGCCTGAG CCTGATCATG ACCGGTGCCA CGGCGGCCAT CATCGGCAAC
GCGCCCGTGC GCTACGCCGG TGTGGCCTCG GCGGTGCAGC AGGCCGCCAT GCAGCTGGGC
GGCTCGCTCG GCACGGCGGT CCTGGGCGCG GTCATGTCCG CGACCATCGT CGCCACCCTG
CCCGGCCACT TCGCCGCCGC GGGGCTGGCC GAACCCGCCG CCGAGGAGTT GGCGGACATG
CAGAGCGCCG TGGCCCAGGG CGGCGCGGTG CTGCCCGAGG GCGCGACCCC GGAGGTCGTG
GCGGCCGTGA CCGGGGCCAG CAACCTGGCG TTCATGGAAG GGCTGCAACT CGCCTTCACC
ATCGCGGCGG CGGTGATGCT GGTCGCCGCG GTGCTGTCGC TGTTCATGCG CTCGGGCAGG
ATGACCGACG GCCCGGCCGT CCACATCTAG
 
Protein sequence
MTAQTVPKAA PETETERKQR WTESPWATLV AIAFGVMMVA LDGTIVAVAN PAIGTSLGAS 
LAELQWVTHG YLLGLAVFLI TAGKIGDRFG YRNTYLVGAV GFVLSSVAIA LSAGVIMLVA
FRVLQGVFGA LLLPSAMGLL RASFPPSKLG RAFGVFGSLI GAATAGGPIL GGVLVGSFGW
ESVFYINVPV GAVALGLGLW LLAANKPTDA GSRMDVPGIV LLSIAIFALV WALVEAPSVG
WGHPVTLGSL AVTAVFAVAF LVWERRPEQP LLPLALFANP SVSIGAVLTV AMALSLMGSL
FFITFYLQGV RGMSPAQTGL QLISMTALMA VTSPIAGRVL DRVGARPPTT VGLLLASAGM
FMLSLLRTDT GVLYISAAFV LLGMGLSLIM TGATAAIIGN APVRYAGVAS AVQQAAMQLG
GSLGTAVLGA VMSATIVATL PGHFAAAGLA EPAAEELADM QSAVAQGGAV LPEGATPEVV
AAVTGASNLA FMEGLQLAFT IAAAVMLVAA VLSLFMRSGR MTDGPAVHI