Gene Saro_3402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3402 
Symbol 
ID5077551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp1392 
End bp2579 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content72% 
IMG OID640481126 
ProductBcr/CflA subfamily drug resistance transporter 
Protein accessionYP_001165788 
Protein GI146275628 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00710] drug resistance transporter, Bcr/CflA subfamily 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.762536 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGGCAG CCGCCGCGCC CGGGCCCGCA ACGATCATCA CTCTCGCCGC GATCGCGGCA 
ATGGGGTCGA TGGCGATCCA CATGCTCGTC CCGGCGCTGC CTCTCCTCGC GCACGAAATG
GCGGTGGGCG AGGCCCGCGC GCAACAGGCG GTCAGCGTCT ATCTGGCGGG GCTCGCCGGA
GGCCAGCTGA TCGCCGGCCC GCTGGCCGAC CGGCTCGGCC GCCGCCCGGT CATGCTCTGG
GGCCTTGCCT GCTACATCGC GGGCGCACTG GGCGCCGCGC TCTCACCTGC CATGCCCATC
CTGCTTGCGG CGCGGCTGCT CCAGGCGCTT GGCGGCGCGG CGGGGGTGGT CAGCGCACGG
GTCATCGTCG GCGAACTCTA CGGCCGCGAG GAAGCGGCCG CGCGGCAGGC GACGCTCATG
TCCATCGTTC TGATATCCCC CGCTCTCGCG CCGGTCGTTG GCGGCGTGAT CGCGGATTTC
GCAGGCTGGC GCACGGTTTT CCTGATGCTT GCCGCCACGG GTCTCGCAGG CCTCGTGTCC
GCCAGAATGA TCCTGCCCGC TCACACCCCG GCCGTCGCCG CCACCGCAGA GGGCACGCAC
CTGCGCCCGC CCCTCATCCA CGGCTATGCC CGCCTGTTCC GCAACCGTCG CTTCGTCCTC
ACAACCGTCG CGCTCGCGGC GTCGAGCGGC AGCCTCTACA TGTTCCTGGG CGCGGCCCCG
TTCCTGCTGA TCGGCAAGGG CGGGCTCAGC CCGTCCGAGG CCGGCATCGG CCTTCTGATC
GTGGCCGGCG CTGGCATTGT CGGCACTCGT CTCATGCGCC TTGTCCAGCG ACGCGGCGAT
GCGGTGGTCT TCGGCACGGC AAGCGCCGCC ACGGGCGCCA TCTCGGCACT GCTCCTGGCT
GCGCTGGGCT TTCACGACCC TTTCGCCCTG CTCGCGCCCG TTACCCTTCT CGGCCTTGGC
GCCGGCCTTA CCGGGCCCGC CGCGATCAGC GAGGTCGCCT ATGCCGAGGC CGGGCTCGCG
GCCACCGCCA CCAGCCTCGC CGGGGCTCTG CAGATGCTGG CCAGCAGCCT TGCCATGACC
GCGCTCGGCC TCTTCGCCCC GCTCGATTCG CTGCGGGTCT GCCTTGCCCT TGCGCTGTCG
TCCGCAGTGG GCCTGACAAG CGCCCTGTTG CGTCGGGGCA ACGCCTGA
 
Protein sequence
MKAAAAPGPA TIITLAAIAA MGSMAIHMLV PALPLLAHEM AVGEARAQQA VSVYLAGLAG 
GQLIAGPLAD RLGRRPVMLW GLACYIAGAL GAALSPAMPI LLAARLLQAL GGAAGVVSAR
VIVGELYGRE EAAARQATLM SIVLISPALA PVVGGVIADF AGWRTVFLML AATGLAGLVS
ARMILPAHTP AVAATAEGTH LRPPLIHGYA RLFRNRRFVL TTVALAASSG SLYMFLGAAP
FLLIGKGGLS PSEAGIGLLI VAGAGIVGTR LMRLVQRRGD AVVFGTASAA TGAISALLLA
ALGFHDPFAL LAPVTLLGLG AGLTGPAAIS EVAYAEAGLA ATATSLAGAL QMLASSLAMT
ALGLFAPLDS LRVCLALALS SAVGLTSALL RRGNA