Gene Saro_0039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0039 
Symbol 
ID3916042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp41050 
End bp44232 
Gene Length3183 bp 
Protein Length1060 aa 
Translation table11 
GC content65% 
IMG OID640442764 
Productacriflavin resistance protein 
Protein accessionYP_495322 
Protein GI87198065 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACTTCC AGAACATCTC GGCCTGGTGC ATTCGCAACC CGGTCGTCCC GATCGTCCTG 
TTCCTGGGGC TGACCCTGGC GGGCCTCGTG TCCTTCATGC AGATGAAGGT GCAGGACAAC
CCGGACATCG AGTTCCCGAT GGTGATCGTC TCGATCGCGC AGCCCGGTGC CGCGCCGACC
GAGATCGAGA ACCAGATCAC GCAGCGCGTG GAATCGGCGG TCCGTTCGAT CGCCGGGGTC
GACACGCTCA CCTCGACGGC ATCGGAAGGC AACAGCCAGA CGATGGTCCA GTTCAAGATC
GGGCAGGACA TCAACGCCGC GGTCAACGAG GTCAAGAACC AGGTCGACCA GATCCGCAGC
GACCTGCCCG AAGGCATTCT CGAGCCGCAG GTCTTCAAGG TCGAGACGTC CTCGAACCCC
ATCGCCTACT TCGCCGTTGC CGCCGACGAC ATGACCATCG AGCAACTCTC GTGGTTCATC
GACGACACCA TCGCCAAGCG GCTGCTTACC GTGCCAGGCA TGGCGGAAGT GAGCCGGTCG
GGCGGCGTCA ACCGCGAGAT CGCGGTCACC ATCGACCCGA TGCGCATGAA CGCGCTGGGC
GTCACCGCCA GCCAGGTCAA CGCGGCGCTG CGCCAGGTCA ACATCAACGC GGCGGGCGGC
AAGACCGAAG TCGCCGGTTC GCGCCAGTCG GTGCGCGTGC TGGGCAATGC GCGTGATGCC
TATGCCCTGT CGCAGACCGA GATCGCGCTT TCGGGCAACC GCACCGTACG GCTCAAGGAC
GTCGCGGATG TCCGCGATTC CTACAGCGAG CTGACTTCGA TCGCGAAGTT CAACGGCAAG
GCGGTCGTCA CGTTCTCGAT GTCGCGTGCG CGCGGGGAAT CCGACGTCTC GGTCTATGAC
GGCGCGCTCG AGGAGATGCG CAAGATCGAG AAGGAGCAGG GCGGCAAGGT CCACTTCGAG
TTGCTGTTCA CCTCGGTCAG CTACACCAAG GACCAGTACC GCTCGTCCAT GAACGCGATG
GTCGAGGGCG CGGTCCTTGC GGTCATCGTC GTGTTCTTCT TCCTGCGCGA CTGGCGGGCG
ACGATCGTCT CGGCCATCGC CATTCCGCTC TCGTCGATCC CGACGTTCTG GGTGCTGGAC
CTGATGGGTT TCACGCTCAA CCAGATGTCG CTGCTGGCGC TGGGCCTCGT GGCGGGCGTG
CTGGTCGACG ACGCCATCGT CGAGATCGAG AACATCGTGC GCCACATGCG CATGGGCAAG
AGCGCATACC AGGCGGCGAT CGACGCGGCC GACGAGATCG GCCTTGCCGT GGTGGCGACG
ACCTTCTCCA TCGTCGCGGT GTTCTTCCCG GTGGCACTGA TGCCTGGCGT TTCGGGGCAG
TTCTTCAAGA ACTTCGGCCT GACCGTCGTC ATCTCGGTGC TGTTCTCGCT GCTTGTCGCG
CGCATGATCA CGCCGATGAT CGCGGCCTAT TTCCTCAAGG CCCACGGCGA GGCGGAGCAT
GGCGGCGGGC CGTGGATGGA CCGCTACATG AAGGTGCTGG GCTGGTCGCT CGATACCGCG
AAGGCGGCGG CGTATCGTGC CGCGCATCCG GGACGACGTT TCCGCGCCCG GCTGCGCGAC
CATCGTCTGT GGATGATGGG CATCGGCTTT GGCGCGCTGC TGCTGACGCT TGTCATGTTC
GCGGTGATCC CGACGCAGTT CTTCCCTGAT ACCGACAGCG ATTCGAGCAC GGTCAGCATC
GAGATGGTCC CCGGCACCAC GCTGCAGCAA ACCGAGAAGA AGGTGCAGGA GATCGTCACG
CTCCTCAGCA AGGAACAGGA GGTCAAGTCG ACCCTTGCTT CCGTGCGCGA GGCGAAGGCG
AACATCTACG TCAACCTGAA GGCGGACCGC GAACGCTCGA GCCTCGAGTT CGAACGCCAG
ATGACGCCGC GACTGCAGCA GATTGCGGAC GTGCGTGCCA ACTTCCAGGC CCAGGGTCCG
AGCGGACCGG GCGGCGGTTC GGGGCGTCCG ATCAGCATCA TGCTGGCCGG GTCCGATCCC
GAGCTGCTGC AGCGTACCGC CCAGACGCTG GTCGAGCAGA TGAGCGCGCT GCCGCTGCTG
GTGGCGCCGC GCATCAGCGC CGACCTGCGC CGTCCCGAAG TCATCATCAA GCCGCGCCTC
GACCTGGCCG CGAACCTGGG CGTGACCACC CAGGCGCTGA GCCAGGTGAT CCGTATCGCG
ACGCAGGGCG AGATCGACCA GAACAGCGCC AAGTTCTCGC TTTCCGACCG CCAGGTGCCG
ATCCGGGTCA AGTTGCCCGA GGATTCGCGC CGCGACCTGT CGATCATCGA GAACCTGCCG
GTGCCGACCG CATCGGGCGG TTCGGTCCCG TTGTCTCGCG TGGCCGAGAT CGGTTTCGGC
TCCGGCCCCA CGCAGATCCA GCGCTATAAC CAGAGCCGTC GCGTCTTCGT TGGCGCCGAC
CTGCCCCCGG GCGTGGTCAA GGGCGAGGCG ATGGCAGCGA TCATGAAGCT GCCGATCATG
AAGAACCTGC CGCAGGGCGT TTCCAACACG GCCGCCGGCG AGGACAAGTT CCAGGCCGAG
ATGATGAAGA ATTTCGGCAT CGCGGTCGCA TCCGGCGTGC TGCTGGTGTT CTCGGTGCTG
GTGCTTCTCT ATCACCGCTT CATTTCGCCC CTGGTCAACA TGGGCTCGCT GTTCCTGGCG
CCGCTGGGTG GCCTGATCGC GATCTGGCTG CTTGGCCAGT CGCTTTCGCT GCCGGTGTTC
ATCGGCATCC TGATGCTGTT CGGCATCGTG GCGAAGAACT CGATCCTGCT GATCGACTTC
GCGCTGGAAG AGATGGCGGC GGGCAAGGGC AAGCTCGAGG CGGTGATGGA AGCGGGGCAC
AAGCGCGCGC AGCCCATCGT CATGACCACG GTTGCGATGG TCGCGGGCAT GGTGCCGACC
GCTGTTGCCA TCGGCGGCGA TTCCGGATGG CGCGCGCCGA TGGGCATAGT GGTGATCGGG
GGGCTCACGC TTTCCACGCT GCTGACCCTG CTGATCGTGC CGGCCGGGTT CAGCCTTGCC
GATGGTGTCG AGAAGCGGAT CGGTCCGTGG CTGGCCCGCA AGGTGCTGGG GTATGAGCCC
GAGCATCGCC ATGCCGCGAC CGCGCGGAAC GATACGGCGC ACGGGGCATT CCCGGCGGAG
TGA
 
Protein sequence
MNFQNISAWC IRNPVVPIVL FLGLTLAGLV SFMQMKVQDN PDIEFPMVIV SIAQPGAAPT 
EIENQITQRV ESAVRSIAGV DTLTSTASEG NSQTMVQFKI GQDINAAVNE VKNQVDQIRS
DLPEGILEPQ VFKVETSSNP IAYFAVAADD MTIEQLSWFI DDTIAKRLLT VPGMAEVSRS
GGVNREIAVT IDPMRMNALG VTASQVNAAL RQVNINAAGG KTEVAGSRQS VRVLGNARDA
YALSQTEIAL SGNRTVRLKD VADVRDSYSE LTSIAKFNGK AVVTFSMSRA RGESDVSVYD
GALEEMRKIE KEQGGKVHFE LLFTSVSYTK DQYRSSMNAM VEGAVLAVIV VFFFLRDWRA
TIVSAIAIPL SSIPTFWVLD LMGFTLNQMS LLALGLVAGV LVDDAIVEIE NIVRHMRMGK
SAYQAAIDAA DEIGLAVVAT TFSIVAVFFP VALMPGVSGQ FFKNFGLTVV ISVLFSLLVA
RMITPMIAAY FLKAHGEAEH GGGPWMDRYM KVLGWSLDTA KAAAYRAAHP GRRFRARLRD
HRLWMMGIGF GALLLTLVMF AVIPTQFFPD TDSDSSTVSI EMVPGTTLQQ TEKKVQEIVT
LLSKEQEVKS TLASVREAKA NIYVNLKADR ERSSLEFERQ MTPRLQQIAD VRANFQAQGP
SGPGGGSGRP ISIMLAGSDP ELLQRTAQTL VEQMSALPLL VAPRISADLR RPEVIIKPRL
DLAANLGVTT QALSQVIRIA TQGEIDQNSA KFSLSDRQVP IRVKLPEDSR RDLSIIENLP
VPTASGGSVP LSRVAEIGFG SGPTQIQRYN QSRRVFVGAD LPPGVVKGEA MAAIMKLPIM
KNLPQGVSNT AAGEDKFQAE MMKNFGIAVA SGVLLVFSVL VLLYHRFISP LVNMGSLFLA
PLGGLIAIWL LGQSLSLPVF IGILMLFGIV AKNSILLIDF ALEEMAAGKG KLEAVMEAGH
KRAQPIVMTT VAMVAGMVPT AVAIGGDSGW RAPMGIVVIG GLTLSTLLTL LIVPAGFSLA
DGVEKRIGPW LARKVLGYEP EHRHAATARN DTAHGAFPAE