Gene Ndas_3675 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3675 
Symbol 
ID9247544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4411099 
End bp4412697 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content74% 
IMG OID 
Productanibiotic ABC transporter efflux pump 
Protein accessionYP_003681579 
Protein GI297562605 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCGT TCACCGGCAC CGGCGCCCTC ATCCGGTTCG TCCTGCGCCG CGACCGCCTC 
CGGCTGACCG TCTGGACCCT GGCCCTGGTC GGCACCGTCG CCGCGACCGT TCCGACGCTC
GACGACATGT TCTCCACCGA CGCCCAGCGC CAGGCCCGGG CGGCTCTCAT GGAGACCCCG
ACCGGCGTCC TCTTCGGCGG CCCCGGATAC GGGCTGGACG ACTACCAGCT CGGCCCGATG
GTGGTCAACG AGCTGACCAT GAGCGTGCTC ATCGCCCTGG CGGTCATGAG CGTCCTGCAC
GTGGTCCGGC ACACCCGCGC CGAGGAGGAG AGCGGCCGCG CCGAACTGCT GCGCGCCAGC
GTCCTGGGCA CGAGCGCCCA GATGACCGCC GCGCTGGTGA CCATCTCGGT CGTCAACCTG
CTCATCGGCG GCCTGGCCGC GCTGACCATG GCGGGCAACG GCCTGGCGGT CGCCGACTCC
GCGGCCTACG GGCTGGGGCT GGCCCTGGCC GGGATCTCCT TCGGCGCCGT CGCCGCCGTG
TGCGCGCAGG TCACCGAGCA CGGACGCGCC GCCGCGGGCC TGGCCTTCCT GGTCACCGGC
GTGCTGTTCC TGTCCCGGGT CGTCGGCGAC ATGGCCGAGG AGGGCGGCAA CGCGCTGTCC
TGGCTGTCGC CGTTCGCCTG GGTCCAGCAG ACACGCGTGT TCGACGACCT GCGCTGGTGG
CCGCTGGCGC TGTACGCGGT CCTCGTGGCG GCGCTCTTCG CGCTGGCCTA CGCCCTGGCG
GACCGGCGCG ACCTCGGCGC CGGCCTGGTC CCCTCCCGGC CCGGTCCGGC CGGTGCGGGC
GGGCTGCTCA ACGGCGTGTT CGCGCTGCAC CTGCACCAGC AGCGCGGGGC GATCCTCGCC
TGGGCGGCCG CGGTCTTCCT CTTCGCCCTC GCGTTCGGCT CGCTGGCCAC GGAGGTGGAG
GGGATGCTGG AGGAGAACCC CGACCTGCTG GCCGTCCTCG GCGACTCCGC CGACGACGTC
ACCGGGGGCT TCCTCGGCAC CATGAGCGGT TACGTGCTGA TGGCCGCGTC GGCCTACGCC
GTCATGTCCG TGCTGCGGGC GCGGGGCGAG GAGACCTCCG GGCGCGCCGA GCTGACCCTG
TCCGCGGCCG TGGGCAGGGT GCGCTGGTTC GGCGGCGCCC TGCTGGTCAG CGTGCTGTCC
TCGGCGGTGA TCGTGGTGGC GGGCGGCGTC GGCATGGGCC TGAGCGCGTC GGCGGCTCTG
GAGGACCCCT CCTGGACGTG GACGATGACC GAGGCCGCCC TGGCCCAGCT GCCGGTGGCC
CTGCTGTTCG CCGCTCTGAC CGCGCTGCTG GTGGGAACCG CCCCGCGCCT GACCCCGCTG
GTGTGGGCGT GGCTGGGCTA CAGCCTCCTG GTCTCGCTGC TCGGACCGAT GCTGGGCCTG
GACGACCGGC TGCTGGACCT GAGCGCGTTC GGGCTGCTGC CCCAACTGCC CGCCGACGAT
TTCGACGCCG CCCCGGTGGC CGTCGCGCTG GGGGCGGCGC TCGCGGCCAA CGCGGTCGCC
CTGGCGGGCT TTCGCCGCCG GGACCTGGCC AGCGTCTGA
 
Protein sequence
MNAFTGTGAL IRFVLRRDRL RLTVWTLALV GTVAATVPTL DDMFSTDAQR QARAALMETP 
TGVLFGGPGY GLDDYQLGPM VVNELTMSVL IALAVMSVLH VVRHTRAEEE SGRAELLRAS
VLGTSAQMTA ALVTISVVNL LIGGLAALTM AGNGLAVADS AAYGLGLALA GISFGAVAAV
CAQVTEHGRA AAGLAFLVTG VLFLSRVVGD MAEEGGNALS WLSPFAWVQQ TRVFDDLRWW
PLALYAVLVA ALFALAYALA DRRDLGAGLV PSRPGPAGAG GLLNGVFALH LHQQRGAILA
WAAAVFLFAL AFGSLATEVE GMLEENPDLL AVLGDSADDV TGGFLGTMSG YVLMAASAYA
VMSVLRARGE ETSGRAELTL SAAVGRVRWF GGALLVSVLS SAVIVVAGGV GMGLSASAAL
EDPSWTWTMT EAALAQLPVA LLFAALTALL VGTAPRLTPL VWAWLGYSLL VSLLGPMLGL
DDRLLDLSAF GLLPQLPADD FDAAPVAVAL GAALAANAVA LAGFRRRDLA SV