Gene Saro_3307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3307 
Symbol 
ID3915954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3525997 
End bp3527214 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content68% 
IMG OID640446092 
Productmajor facilitator transporter 
Protein accessionYP_498576 
Protein GI87201319 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0906362 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCACC GCGCCTCGTT CGGTATCGTC TTCGCGATCG TGATGATCGA CATGCTGGGC 
TTCGGGATCG TCACCCCGGT GCTGCCCGGC CTGATCATCG AGCTGACCCG CGTCGACATC
GGCACAGCGG CGGAATACGC GGGCTGGCTG GGCGCCGGAT ATGCGACGAT GCAGTTCGTG
TTCGCGCCGG TCATCGGCAA CCTGTCCGAC CGGTTCGGGC GCAGGCCGGT GCTTCTTGCC
GCGATCCTGA TGCTCGGGCT GGACTACCTG CTTCAGGCCA TGGCCCCGCA TTTCTGGTGG
CTGATCATCG GCCGGCTGCT GGCGGGCGTC ACCGGCGCGA GCTTTTCGGC CGCCTACGCC
TATATCGCCG ACGTGACCCC GCCCGAAAAG CGCGCGGCAA ACTTCGGAAT GATGGGCCTT
GCCTTCGGGT TCGGCTTCGT GGTCGGTCCC GCGATGGGGG GACTTCTTGG CGCGATAAGC
CCGCGCCTGC CGTTCTATGC GGCATCCGCC CTCGCATTGA CGAACTTCGT GTTCGGCATG
TTCTTCCTGC GCGAATCGCT CGCACCGGAA AACCGCCGCC CATTCGACTG GCGGCGGGCA
AACGCACTGT CGTCGCTGCG CGCGCTGCGG GGACGGAGCC GGACCGTGCT GTGGTTCGTT
GCTGCGCTGG GCGCGTGGCA GCTCGCCCAC GTGGTCTACC CGGCGGTCTG GCCCTATTTC
GCCATCGCCG CCTACGGTTT TTCGACCCGC GACGTCGGAC TGGCACTGGC GATGGTAGGG
TTTTCGAGCG CGCTGGTTCA GGGCTTCGGC CTGCGCTTTG CCCTGCCGCG GCTGGGCGAG
CGCCGCGCCG TGGTGCTGGG GGTGGCCGGA CTGTGCGCAT CGGCCGTGCT CTACAACCTC
GCCCAGCACA CCTGGCAGGT CTATCTGGCG ATTGCCGTGG GCGCCTTGCA GGGCTTCGTC
CAGCCGCCGA TCGCCGCGTT CAACAGCCGC GCGGTCGATG CGCGCAGCCA GGGCGAATTG
CAGGGCGCGG TGCAATCGAT CGGCTCGATC GCGGCCATCG TCGGTCCGCC GCTCTATACC
CAGACGCTAG CGCGGTTCAG CGGGCCGCAC GCCATCGTCA ACCTGCCGGG GATGCCGATG
CTGCTTTCGG CGGCGATCTC GCTGATGACG CTGGCCCTGT TCTGGAAGGG CGCCTCGCTG
CTGCGCGAGA GCGAATGA
 
Protein sequence
MQHRASFGIV FAIVMIDMLG FGIVTPVLPG LIIELTRVDI GTAAEYAGWL GAGYATMQFV 
FAPVIGNLSD RFGRRPVLLA AILMLGLDYL LQAMAPHFWW LIIGRLLAGV TGASFSAAYA
YIADVTPPEK RAANFGMMGL AFGFGFVVGP AMGGLLGAIS PRLPFYAASA LALTNFVFGM
FFLRESLAPE NRRPFDWRRA NALSSLRALR GRSRTVLWFV AALGAWQLAH VVYPAVWPYF
AIAAYGFSTR DVGLALAMVG FSSALVQGFG LRFALPRLGE RRAVVLGVAG LCASAVLYNL
AQHTWQVYLA IAVGALQGFV QPPIAAFNSR AVDARSQGEL QGAVQSIGSI AAIVGPPLYT
QTLARFSGPH AIVNLPGMPM LLSAAISLMT LALFWKGASL LRESE