Gene Saro_1219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1219 
Symbol 
ID3916517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1268657 
End bp1271791 
Gene Length3135 bp 
Protein Length1044 aa 
Translation table11 
GC content64% 
IMG OID640443956 
Producthydrophobe/amphiphile efflux-1 HAE1 
Protein accessionYP_496498 
Protein GI87199241 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.164773 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTCGC GCTTCTTCGC CCACCGCCCG ATTTTCGCCT GGGTTCTTGC CATCGTCATC 
ATGGCGGCGG GCCTGTTCGC CGTCAGCACC ATGGGCGTGG AACAGTACCC CGACATCGCC
CCGCCCACCG TCTCGATCAA TGCGACTTAT GGCGGCGCCG ATGCCTCTAC CGTCGAGAAC
AGCGTCACCC AGGTTCTCGA GCAGCAACTC AAGGGCCTCG ACGGCCTGCT CTATTTCAAT
TCGAACTCGT CCAACGGGTC CTCCCAGATC ACCGTCACCT TCGACAAGGG CACCGATCCC
GATACCGCCC AGGTCCAGGT CCAGAACGCC ATCTCGCGTG CGGTCAGCCG GCTTCCGGCG
GTGGTCCAGC AGCAGGGCGT GAACGTCAAC AAGTCGCAAT CCGACATGCT GATGGTGGTG
TCGATCTATG ACAAGACCGG GCGCACGACC AACGCGGATA TCTCGGATTA CCTGTCGACC
CACTTCCAGG ACCCCATCTC GCGCGTCGAA GGCGTGGGCA ACGCGCAGAT ATTTGGCGCG
TCTTATGCGA TGCGCATCTG GCTCGATCCG CTGCGGCTTG CCGCGGTCCA GCTCATGCCT
TCGGACGTTG TAACCGCACT CCAGGCCCAG AACACGCAAG TCGCTGCGGG CGAGATCGGC
GCGAACCCCG CGCCCGACGG GCAGCGCCTC AACGCCACGG TCACCGCGCG CTCGCGCCTC
CAGACGCCCG AGGAATTCGA GAACATCGTC GTCAAGACGC AGGTCGATGG CTCGGTCGTG
CGCCTGCGCG ACGTTGCCCG GGTGGAGATG GGCGAGGAAA GCTACGGTTC GATCAGCCGT
TTCAATGGCA TGCCGGCAAC CGGCCTGTCG GTCAGCCTCG CTTCGGGCGC GAACGCGATG
AAGACCGCGC AGCTGGTCAA GGATACCGTG GCCGAGCTGT CGAAGGATCT GCCCGCCGGA
TACGAAGTGG CCTATCCACG CGACAGCAGC ACTTTCGTCA AGCTATCCAT CGAAGAGGTC
GTCTGGTCGC TTGGCGAGGC GATCGTGCTG GTCGTGATCG TGATGTTCGT GTTCCTGCAA
AGCTGGCGCG CGACGCTGAT CCCGGCAATC GCGGTTCCGG TCGTCCTGCT TGGTACTTTC
GGCGTGCTGT CGCTTGCCGG CTATACGATC AACACGCTGA CGATGTTCGG CATGGTCCTT
GCCATCGGTC TGCTGGTCGA CGACGCCATC GTCGTCGTCG AAAACGTCGA GCGCGTGATG
CACGAAGAAG GGCTGTCCCC GCTCGACGCC ACGGTCAAGT CGATGGACGA GATCACATCG
GCGCTCATCG GCATCACGCT GGTGCTGACG GCGGTATTCG TGCCGATGGC GTTTTTCGGC
GGCTCGACCG GCGCGATCTA TCGCCAGTTC TCGGTCACGA TCGTTTCGGC CATGACACTG
TCGGTCCTCG TCGCGCTGGT CTTCACGCCC GCGCTTTGCG CCACGCTGCT CAAGCCGGTT
GCGGAACATG AAAAGCCTGG CGGTCGCTTC TTCCGCAGGT TCAACGATGG CTATGCCAAT
CTCGAGGGGC GTTACCGCAC CCGCCTTGCC CGGGTCGTCG GGCGGCCGCT GCCGTGGATG
ATGGTCTTTG CGGGGATCAC GCTGGGCATG GTCCTGCTCT ACGTGCGGCT GCCGACCAGC
TTCCTGCCCG CTGAAGACCA GGGCAACCTC TCGATCAGCT ACCAGCTTCC GGTCGGCTCC
ACCTCGGAAC AGACCCGCGC CGTAGCCAAC CAGCTTTCGG ACTATATCCG CACTTCGGAA
AAGGAGGACG TGAAAGCCGT GTTCGTGGTG ATGGGCCGGG GCCAGGCGGG TTCGGCGCAG
AATGCCGGGC AAGGCTTCAT CCTGCTCAAG GACTGGTCGG AACGTTCCGG CAAGGAACAC
AGCGCGGCCG CCATCGCCGA GCGGCTTAAT GCCTATTTCC GCAAGTCCCG GGATGCGCAG
ATCTTCGTGC TCGATCCACC GCCGATCCGT GGCCTTGGCA GTTCCGGCGG TTTCGAGATG
TGGCTTCAGG ACGCGCTGGG GGCGGGCCGT GATGCGCTGA CCGCCGCGCG TCGCCAGCTT
ACCAAGCTGG CTGGCGAGGA CGAGCGCCTG GCGCAAGTCC GCCTGTCCGG GCTCGAGGAC
ACGCCGCAGC TCAAGGTCGA CATTGATGAC GATGCGCTGA CGGCCTTCCA GATCAGTCCG
GCAAGCGCGA ACTCGACGTT ATCCATCGCC TGGGGCGGGC TTTACGTGAA CGACTTCGTC
GACCGGGGGC GCGTGAAGCG CGTCTACGTC CAGGGCGATG CTCCCTATCG CAACGAACCG
TCGGATCTGG GCCAGTGGTT TGTGCGCAGC AATGGCGGGC AGATGGCGCC GTTCTCGGCC
TTCGCCTCTT CACACTGGAC ACAAGGGCCG GTCCGGCTCG ACCGCTTCAA TGGCGTTCCG
GCGCAGCAAT TGCAGGGCAG CGGCGCTCCC GGCGTCAGTT CGGGCGATGC GATGAAGATC
ATGGAGGACA ACGCGGCAAA GCTCGACGGC AACTTCAGCG TGGCGTGGAG CGGCCTGTCC
TATCAGGAGC GCGCCGCTTC CAGCCAGTCG CTCCTGCTCT ACACCGCATC GATCTTCTTC
ATCTTCCTGT GCCTTGCCGC GCTCTACGAA AGCTGGTCGG TGCCGGTCGC GGTGCTGCTG
GTCATCCCGC TCGGCATCAT CGGCGCGGTC CTTGCCGTGA CCTTGCGCGG CTACAACAAC
GACATCTATT TCCAGGTGGC GCTGCTGACC ACCATCGGCC TCTCGGCCAA GAACGCGATC
CTCATCGTGG AGTTCGCCGA AGCTGCGCTG GCGCGCGGGG TGGAACCGGT GGCGGCGGCG
CTCGAAGGCG CGCGACTTCG CCTGCGTCCG ATCATCATGA CGAGCGTCGC GTTCATCGCC
GGCGTTATCC CGCTCGCTAT TGCCGACGGG GCCGGGGCCA ACGGTCGTCG CGCCATCGGC
ACCGGCGTCA TGGGCGGTAT GCTTTCGGCG ACAATTCTGG CGATCTTCCT CGTGCCACTG
TTCTTCGTGC TGGTGAAGGG CTGGTTCCGC CGCAAGCCCG CCACGCCCGT CGAAACCGTG
GGAGAGCCCG CATGA
 
Protein sequence
MISRFFAHRP IFAWVLAIVI MAAGLFAVST MGVEQYPDIA PPTVSINATY GGADASTVEN 
SVTQVLEQQL KGLDGLLYFN SNSSNGSSQI TVTFDKGTDP DTAQVQVQNA ISRAVSRLPA
VVQQQGVNVN KSQSDMLMVV SIYDKTGRTT NADISDYLST HFQDPISRVE GVGNAQIFGA
SYAMRIWLDP LRLAAVQLMP SDVVTALQAQ NTQVAAGEIG ANPAPDGQRL NATVTARSRL
QTPEEFENIV VKTQVDGSVV RLRDVARVEM GEESYGSISR FNGMPATGLS VSLASGANAM
KTAQLVKDTV AELSKDLPAG YEVAYPRDSS TFVKLSIEEV VWSLGEAIVL VVIVMFVFLQ
SWRATLIPAI AVPVVLLGTF GVLSLAGYTI NTLTMFGMVL AIGLLVDDAI VVVENVERVM
HEEGLSPLDA TVKSMDEITS ALIGITLVLT AVFVPMAFFG GSTGAIYRQF SVTIVSAMTL
SVLVALVFTP ALCATLLKPV AEHEKPGGRF FRRFNDGYAN LEGRYRTRLA RVVGRPLPWM
MVFAGITLGM VLLYVRLPTS FLPAEDQGNL SISYQLPVGS TSEQTRAVAN QLSDYIRTSE
KEDVKAVFVV MGRGQAGSAQ NAGQGFILLK DWSERSGKEH SAAAIAERLN AYFRKSRDAQ
IFVLDPPPIR GLGSSGGFEM WLQDALGAGR DALTAARRQL TKLAGEDERL AQVRLSGLED
TPQLKVDIDD DALTAFQISP ASANSTLSIA WGGLYVNDFV DRGRVKRVYV QGDAPYRNEP
SDLGQWFVRS NGGQMAPFSA FASSHWTQGP VRLDRFNGVP AQQLQGSGAP GVSSGDAMKI
MEDNAAKLDG NFSVAWSGLS YQERAASSQS LLLYTASIFF IFLCLAALYE SWSVPVAVLL
VIPLGIIGAV LAVTLRGYNN DIYFQVALLT TIGLSAKNAI LIVEFAEAAL ARGVEPVAAA
LEGARLRLRP IIMTSVAFIA GVIPLAIADG AGANGRRAIG TGVMGGMLSA TILAIFLVPL
FFVLVKGWFR RKPATPVETV GEPA