Gene Saro_0804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0804 
Symbol 
ID3915858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp854402 
End bp855700 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content67% 
IMG OID640443535 
Productmajor facilitator transporter 
Protein accessionYP_496083 
Protein GI87198826 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACAAA CCCCGCGCGA GGTGATCGAG CAGACGCCGA TGGGCGTCCG CCAGTGGATC 
GCCGTGGTCC TGATGATCGC GCTGAACGCG CTCGACGGGT TCGACGTCCT GTCCAGCGCC
TTTGCCGCCC CCGGCATCGC CAAGGAATGG GGCATCCAGC GCGACGCGCT GGGTGTCGTG
CTGTCGATGG AACTGGTCGG CATGGGCTTT GGCTCGATCC TGCTGGGCGG CGCGGCGGAC
CGCTTCGGGC GGCGTCCGAC TATCCTCGGC TGCCTGTTGG TCATGGCCAC CGGCATGTGG
CTGGCGACGA CTTCGGCGAG CCCGTCGGGC CTTGCCCTGT GGCGCTTCAT CACCGGCCTC
GGCATCGGCG GCATGCTGGC GGCGATCAAC GCGGTGACGG CCGAATTCTC CAGCCTCAAG
GGCCGTTCGC TGGCCATGGC GCTGATGGTC ATCGGATACC CGATCGGCGC GACGGTGGGC
GGGACCATCG CCGGAATGCT GCTGAAGGGT GGCGACTGGC GGCTGGTCTT CGAATTCGGC
GCGATCGCCA CGGCGGTGTT CATCCCGCTC GTGTTCCTGT TCGTGCCCGA GACGCCGGAT
TACTACGTCA CCCGTCGCGA GCCGGACGCG CTCGACAAGG TCAACGCCAG CCTGCGCAAG
CTGGCCCTGC CGCTCGCCAC GATCCTGCCG CCTGCGCCGG CAGTGGTCGA CAAGCCGAGC
GTGTTCGACA TCTTCAAGCC CGGCCTGATC CGGACGACCC TGCTGTTCAC GCTAGGCTAT
TCGTTCCACG CGGTGACGTT CTACTACATC CTCAAGTGGA GCCCCAAGAT CGTCGCGGAC
TTCGGTTACA CCCAGCCCGA GGCTGCGAGC GTGCTGGTCT GGGCGAACAT CGGCGGGGCG
ACCGGCGGGG CGCTGTTCGG ATTTGCCATG CACAAGTTCG GGTTGAAGTG GCCGACCATC
GCGATGCTGG TTGGCGGCGC GATTGCGGTC GTGGCTTTCG GCTTCGGACG AGAGAGCCTC
GACGGGTGGA AGATGGCGGT GTTCTTCACC GGCTTCACCA CCAACGCCGC GATCGTCGGC
TTCTACGCCC TCTTCGCCAA GGGCTTCCCG ACCCACGTGC GGGCGACCGG CACCGGCTTT
GCCATCGGCG CCGGACGCAT CGGCGCAGCG GGTTCGCCGA TCCTGGCGGG CGTGCTGTTC
ACGCAGGCAG GCCTCGGTCT GCTGGGCGTC TCGGTCGTGA TGGCGATGGG ATCGGTCGTG
GCAGCGCTGC TGCTGCTGAT GCTGCGCAAG GAAGTCTAG
 
Protein sequence
MSQTPREVIE QTPMGVRQWI AVVLMIALNA LDGFDVLSSA FAAPGIAKEW GIQRDALGVV 
LSMELVGMGF GSILLGGAAD RFGRRPTILG CLLVMATGMW LATTSASPSG LALWRFITGL
GIGGMLAAIN AVTAEFSSLK GRSLAMALMV IGYPIGATVG GTIAGMLLKG GDWRLVFEFG
AIATAVFIPL VFLFVPETPD YYVTRREPDA LDKVNASLRK LALPLATILP PAPAVVDKPS
VFDIFKPGLI RTTLLFTLGY SFHAVTFYYI LKWSPKIVAD FGYTQPEAAS VLVWANIGGA
TGGALFGFAM HKFGLKWPTI AMLVGGAIAV VAFGFGRESL DGWKMAVFFT GFTTNAAIVG
FYALFAKGFP THVRATGTGF AIGAGRIGAA GSPILAGVLF TQAGLGLLGV SVVMAMGSVV
AALLLLMLRK EV