Gene Saro_1298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1298 
Symbol 
ID3917930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1342714 
End bp1344042 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content66% 
IMG OID640444035 
Productmajor facilitator transporter 
Protein accessionYP_496576 
Protein GI87199319 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCAG AGACGGCAGA CGAGCGTAGC CTGTGGGATT CGGTGCGGCC CTATCTTGAA 
AAGGAATCGC TGGCCGCCTT CTTCCTCGGC GTATCCTCGG GCTTTCCCTA TGCGATGATC
GGCGCGACGC TGACGACGCG GCTGGCGCAG GACGGGATCG ACAAGAAGAC CGTTACCGCC
TTCACGCTGG CTTTCCTCGT CTACAACCTC AAGGTCTTCT GGGCCTGGCT GGTCGATGGC
GTGCGCCTGC CGTTGCTGGG CAGGCTGGGG CAGCGCGTTT CGTGGATGCT GCTGGCAGGG
TCGCTGGTCA TGGCGGCGGT CGCCAACCTT GCGCTGGTCG ATCCGGCGGC GGACCTTGGC
GCGACGGTGC TTGCCGCCGT GCTGGTAGGC GTTGCGGGCG CGACGTTCGA CATCGTGATC
GACGCCTATC GCATCGAGAC ATTGAAGCCA TATCAGCTCG GCACCGGTTC GGGCATGAGT
CAGTACGGCT GGCGCATCGG TTCGGCCGGG GCGGGCGCGC TGGCGCTGAT CGTGGCCGGG
CGTTCGGGGT GGAGCGCGGC CTATCTTGCC TGCGCGCTCT TCGCGCTGCC CGCCATGCTT
ACCGCGCTGT TCCTGGGAGA ACCCGCACGG CACCGCGAGC CGACCAGGCG GAAAGGCGTG
GGCGAGGTCG TGGCATCGAT CATCGGCCCG TTCGGCGAGT TCTTCCGCCG GCACGGCGCG
TGGCTCGTCC TGCTGTTCAT CCTCGTCCAC AAGGTCGGCG ACACGCTGGC GAACCTGACC
TTCCGCCTGT TGTTCGACGA CCTCGGCTTC ACCAACGACG AAATCGCCAT CTGGGACGTG
GGCGTGGGCT TCTGGGCCTA CCTGATCGGC GTGTTCATCG GCGGCGTGGC CTATGCCCGG
ATGGGACTCA AGCGCTCTGT CCTTCTGGCG CTGGTGCTGA TGGCGGTGTC GAACCTGTCG
TTCGCGGCGC TCGCGGCGGC TGGTCATTCC AACATCGGCA TGGCGGGCGC CATCGGCTTC
GAAAACATGG CCTCGGGTTA TGGCGGCGTC GTCGTGGTCG CCTATTTCTC GGCGCTGTGC
GACCTGCGCT ACACCGCCGC GCAATACGCG CTGATTTCGG CCGGGGCGAG CGTGGTCGGA
CGTTTCGCCA CCGGGACCAC AGCGGGCGCG TTGATCGAGG GCATGGGCTA CGTGAACTTC
TACCTGCTTA CGACCGTGCT GGCGCTGCCG GGCATCGTGC TGTTCTGGTG GATGAGCCGC
AGCGGCCTGG TCGATGCGGC GATGGGCACG GCCGGCGAAG AGAAGTCGGA CGCCGATCCG
CTTACCTGA
 
Protein sequence
MSAETADERS LWDSVRPYLE KESLAAFFLG VSSGFPYAMI GATLTTRLAQ DGIDKKTVTA 
FTLAFLVYNL KVFWAWLVDG VRLPLLGRLG QRVSWMLLAG SLVMAAVANL ALVDPAADLG
ATVLAAVLVG VAGATFDIVI DAYRIETLKP YQLGTGSGMS QYGWRIGSAG AGALALIVAG
RSGWSAAYLA CALFALPAML TALFLGEPAR HREPTRRKGV GEVVASIIGP FGEFFRRHGA
WLVLLFILVH KVGDTLANLT FRLLFDDLGF TNDEIAIWDV GVGFWAYLIG VFIGGVAYAR
MGLKRSVLLA LVLMAVSNLS FAALAAAGHS NIGMAGAIGF ENMASGYGGV VVVAYFSALC
DLRYTAAQYA LISAGASVVG RFATGTTAGA LIEGMGYVNF YLLTTVLALP GIVLFWWMSR
SGLVDAAMGT AGEEKSDADP LT