Gene Saro_3903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3903 
Symbol 
ID5077387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009426 
Strand
Start bp73068 
End bp74597 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content59% 
IMG OID640481010 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_001165672 
Protein GI146275511 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.753199 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAGA CCGCTTCATC AGGCGCAGGA ATAGCCCCGT CCCGGCAGCT TGTTGCCGGG 
CTCGTGCTTG CCCTTGCCAA TTTCATGGTC ATTCTCGACC TGAGCATCGC CAATGTCTCG
ATCCCGCATA TCGCGGGCAA TCTGGGCATC ACGCTTGAAC AAGGCGCATG GGTCATCACC
TCCTACGCGG TCGCCGAGGC GATCTGCGTT CCCTTGACCG GATGGGTCGC CGGACGCTTC
GGATCGGTAC GCACCTTCCT GTTCAGCATG GTCGGGTTCG GGATCTGCTC ATTCCTGTGC
GGGATCTCAG TCACGATGGG CATGCTGGTC GCCAGTCGTA TCGGACAGGG AATTTTCGGC
GCCTTCCTGA TGCCCATGTC GCAAACCTTG CTGCTCAGCG TGTTCCCGCC CGAGAAGCGC
AATATGGCGA TGGGACTATG GGCTGTTACA CTGCTGATGG GACCTGCTCT TGGTCCGATG
ATCGGTGGCT ATCTCACAGA AAATTATTCC TGGCATTGGA TATTCCTGAT CAATGTGCCC
GTCGCCATCC TGTGCATCGT GGTTGGTTTC GCGCTGCTCA GGCCGATCGA GACCGAGCGG
CAGATCCTGC CGATCGACTA TGTCGGCCTT GCCTTGCTGG TGCTTTGGGT CGGCTGCCTC
CAGATCGTGC TTGACCTTGG CCGTAACCAC GACTGGTTCG CCGACCCGAT GATCGTGGCG
CTAACGATCA CCTCTGCTGT GGGTTTTATC GTTTTCATTA TCTGGGAGTT GGGCGAGGAT
CACCCGATCG TCGATTTGCG AGTTCTGCGG CACCGTGGCT TCAGCGTCAG CCTGACTGTT
CTGTCGCTCG CATTTGCCGG ATACTTCGCG GCGTTTGTCG TTGTCCCCCA GTGGCAGCAA
GCCTGGCTCG GATTTCCTGC GACAGCAGCC GGCTTGTCCT CTTCGTTTTC GGCCATGGGC
GGACTGATCA CCGTTCCGCT GGTGGCTTTC CTGATGAGCC GCCTCGACCT GCGTTTCCTT
GTTTCGTGCG GCGTTACCTG GATTGCGGCG ATGACGCTCG TGCGCACAAC CTGGACAACA
GACTCCGATT TCTGGACCCT CAGCATTCCG CAGTTCGTTC AGGGGCTGGG CACTCCCTTC
ATGATGCTTC CTCTCATGAC CCTGACGCTC AATACGGTTA AGGAAAACGA GGTGGCATCG
GCCGCCGGCC TGCAAAGCTT CATGCGAACC ATCGCCACCG CCGTAGCGAC TTCCATCACT
CTGTCTTATT GGGGTGATAC TCAGCGCATC GCGCGCAGCG ATGCGGTGGC GGTCCTGCAG
CCGGAAGCGG CGCAGGCAAG CCTTTCCAGC CTCGGCTTCC CTTCTGAACA GATAAGGCAA
GTGCTCAGCA ACATGGTTGA GCTGGAGGCG ACAACGCTGG CGCTTATTCA TACCTTCTGG
GCGACCACAG CCGTTCTGCT TTTTGCCGCC GCCCTGATCT GGCTTGCCCC ACGACCGTCG
AAATCAGGCG GCATTACGAT GGGCCACTAA
 
Protein sequence
MSETASSGAG IAPSRQLVAG LVLALANFMV ILDLSIANVS IPHIAGNLGI TLEQGAWVIT 
SYAVAEAICV PLTGWVAGRF GSVRTFLFSM VGFGICSFLC GISVTMGMLV ASRIGQGIFG
AFLMPMSQTL LLSVFPPEKR NMAMGLWAVT LLMGPALGPM IGGYLTENYS WHWIFLINVP
VAILCIVVGF ALLRPIETER QILPIDYVGL ALLVLWVGCL QIVLDLGRNH DWFADPMIVA
LTITSAVGFI VFIIWELGED HPIVDLRVLR HRGFSVSLTV LSLAFAGYFA AFVVVPQWQQ
AWLGFPATAA GLSSSFSAMG GLITVPLVAF LMSRLDLRFL VSCGVTWIAA MTLVRTTWTT
DSDFWTLSIP QFVQGLGTPF MMLPLMTLTL NTVKENEVAS AAGLQSFMRT IATAVATSIT
LSYWGDTQRI ARSDAVAVLQ PEAAQASLSS LGFPSEQIRQ VLSNMVELEA TTLALIHTFW
ATTAVLLFAA ALIWLAPRPS KSGGITMGH