Gene Saro_0033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0033 
Symbol 
ID3916036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp33831 
End bp35222 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content69% 
IMG OID640442758 
ProductMATE efflux family protein 
Protein accessionYP_495316 
Protein GI87198059 
COG category[V] Defense mechanisms 
COG ID[COG0534] Na+-driven multidrug efflux pump 
TIGRFAM ID[TIGR00797] putative efflux protein, MATE family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0259659 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCCGC TCCCCATGTT TGAAAAGCCG CCCGTACACT GGACCGAAGA ACTGCGCGCG 
ATGCTCCGTC TCGCCGCGCC GATGGTCGGG GCGAACCTGC TGCAGATGGC GGTCTTTGCG
GTGGACGTGG TGTTCGTGGC GCGGCTCGGG CCGGTGGCGC TTGCCGCCTC CTCGCTGTCG
GTGTCGCTCT TCGGCCTTCT CGTCTGGAGC CTCTCGGGCC TTGTCGGCGC GGCATCGCCA
CTGATCGCGG CCGAACTCGG TCGGCGCCGG CACGCCGTTC GCGAAGTGCG CAGAACCGTG
CGCATGGCGG CCTGGGCGGG AACCCTGGCT GGCCTGTTCG CCATGGGCGT CTGCCTGCTC
GGCGGTCCGC TGATGCGCGT CACCGGCCAG CAGCCCGAAG TCATCGCGCT TGCCGTCCCG
TTTCTCAACG TCCTGATGTG GGCGGCGATT CCTTCCACGA TCTCGGCGCT GCTGCGCACC
TTCGTGGCGA CGCTCGGCCG TCCGACGATC GGCACGGTCA TCACCGGCAT GGCAGTGGCC
ATCAACGCCT TCGGCAACTG GGTCTTCGTC TTCGGCAACC TCGGCGCGCC GGAAATGGGG
CTCACCGGCT CTGCCTTGTC GAGCATAGTC ACGACCTGCG CGATGGTCCT GGCCTATATC
GTGGTGATCC GCTCCGACCG CCGCCTGCGT CGCTACAGGC TGGCCGGCCG GTGGTGGAAG
CCGGAGTGGA AGCGTTTCGC CGACGTGCTG CGCGTGGGCC TGCCGATCAC CGGCACGATC
CTTGCCGAGG CGGGGATGTT CAACGGCGCC GCCTTCCTGA TGGGACGCAT CGGCGAGGTC
GAGCTTGCCG CCCATACGAT CGCCCTCCAG TTCGCCGCAA TCGCCTTCCA GGTGCCGTTC
GGTGTCGCCC AGGCTGCGAC CATCCGCGTG GGCCTGGCCT TCGGTGCTGG CGAGCGGGCC
GCCATCGCCC GGGCAGGCCG GGTCGCCGTC GTGCTCGGCA TGGGCTTCAT GGTGGTGACC
GCCAGCATCA TGCTCTTCGC CCCGCGCGCG ATCCTGCATC TCTACGTCGA TCCCGACGCG
CCCGAAAACC GCGCAATGGC CGTCCTCGCG GTGCAGTACA TGGCGGTTGC CGCAGCGTTC
CAGTTGTTCG ACGGCGCGCA GGCCGTGGGC GCTGCCCTGC TGCGGGGCCT GCAGGACACG
CGTATTCCCA TGGCCTTCGC CCTGTTCGGC TACTGGCTGC CCGGGCTGGG GACGGCCGTC
GGCCTCGGGC TCTTCTCTCC GCTCGGCGGG CTGGGCGTAT GGATCGGCCT CATGGTCGGC
CTCGTCGTCG TGGCCAGCCT CATGCTATGG CGTTGGCGGA ACCGTGCGCG ACTGGGCCTC
CTCCCGGCCT GA
 
Protein sequence
MAPLPMFEKP PVHWTEELRA MLRLAAPMVG ANLLQMAVFA VDVVFVARLG PVALAASSLS 
VSLFGLLVWS LSGLVGAASP LIAAELGRRR HAVREVRRTV RMAAWAGTLA GLFAMGVCLL
GGPLMRVTGQ QPEVIALAVP FLNVLMWAAI PSTISALLRT FVATLGRPTI GTVITGMAVA
INAFGNWVFV FGNLGAPEMG LTGSALSSIV TTCAMVLAYI VVIRSDRRLR RYRLAGRWWK
PEWKRFADVL RVGLPITGTI LAEAGMFNGA AFLMGRIGEV ELAAHTIALQ FAAIAFQVPF
GVAQAATIRV GLAFGAGERA AIARAGRVAV VLGMGFMVVT ASIMLFAPRA ILHLYVDPDA
PENRAMAVLA VQYMAVAAAF QLFDGAQAVG AALLRGLQDT RIPMAFALFG YWLPGLGTAV
GLGLFSPLGG LGVWIGLMVG LVVVASLMLW RWRNRARLGL LPA