Gene Saro_2097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2097 
Symbol 
ID3917745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2233152 
End bp2234426 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content68% 
IMG OID640444850 
Productgeneral substrate transporter 
Protein accessionYP_497370 
Protein GI87200113 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00320357 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCCGCGC AGACACTCGA TGAGCGAAGA CTTGCCCGCC GCGCCGTGAT TGCCGCGACC 
ACAGGCAACG CGCTGGAATT CTACGATTTC ATCACGTTCA GTTTCTTTGC CATCCAGATC
GGCAGGGTCT TCTTCCCGTC GGAGGACCCT TTCGTCAGCC TGATGGCCTC GCTCGCCACG
TTCGGAGTCG GCTTCATCGG CCGCCCGCTC GGCGCATGGG CCATCGGCGC GTGGGCGGAC
CGGCACGGGC GCAAGCCGGC GATGCTGCTC AGCATGACGC TGATGGGCAT CTCCGTCGCG
GTCCTCGCGC TCACGCCGTC CCATGCCGCC ATCGGCGCCG CCGCCCCGGT CATCGTCGTG
CTGGCCCGGC TGGTCCAGGG CTTCGCGCTC GGCGGAGAGG TCGGCTCGGC AACCACCTAC
ATGATGGAAT GCGCCAGCCA TGATCGCCGG GCCTGGGCGA TAAGCTGGCA AGGCGCCAGC
CAGGCCATCG CATCTTCCGC CGGATCGCTG GTCGGCCTCG GCCTCAGCCT TGTCCTGACG
CCGGACCAGC TGACCGACTG GGGCTGGCGC GTGGCCCTGC TCGCGGGGAC GGTGATCGTG
CCGTTCTCTC TCGTCATCCG CCGGTCGCTT CCCGAAACGA TCGATGCGCC CGATCACGTG
CCCGCCGGGC ACATCCCGCC CGGCGTGTGG CGGACGGTCG TGCTGGGCAT GATGATGGTC
TCCGGGGCGA CCATCGCAAC CTACCTGTTC AATTACATGG CGACCTATGG CCAGAACACG
CTCGGCTTCA CGGCCAGCGT CTCGCTGGGC ACGACGCTGG CCGTCAACGT CGCGCGCTTC
GCCGCGATCC TGCTGGGCGG CTGGCTCAGC GACCGCTTCG GGCGGCGTCC CCTGATGATC
CTGCCCTGGG CGGTCTTCGC CGCGGCCATC GTGCCGGCCT ATGTCTGGCT GACATCGGCG
CACGACGCCT TCGTCTTCAT CGCGGTCAAC ACCGCGCTCG CCTTCTGCTC GACGGTGCCT
TCCGGCGCTG TCTACGCCGC GATTGCCGAA AGCCTGCCCA AGGCCAGCCG CGCGAAGACC
TTCGCGCTGG TCTACGCGCT GCCGGTCACT TTCCTGGGCG GATCGACGCA GTTCGTCATC
ACCTGGCTGC TCAAGGTCAC CGGAGAACCC ATGGCGGTCG CCTGGTACAT GCTCGGCGCC
GCGCTCCTCG CGCTTGCCGG GATGGTTCTC GTTCGCGAGA GCGCGCCCTC CCGCCTGCGC
GCCTCTCCGG CCTGA
 
Protein sequence
MAAQTLDERR LARRAVIAAT TGNALEFYDF ITFSFFAIQI GRVFFPSEDP FVSLMASLAT 
FGVGFIGRPL GAWAIGAWAD RHGRKPAMLL SMTLMGISVA VLALTPSHAA IGAAAPVIVV
LARLVQGFAL GGEVGSATTY MMECASHDRR AWAISWQGAS QAIASSAGSL VGLGLSLVLT
PDQLTDWGWR VALLAGTVIV PFSLVIRRSL PETIDAPDHV PAGHIPPGVW RTVVLGMMMV
SGATIATYLF NYMATYGQNT LGFTASVSLG TTLAVNVARF AAILLGGWLS DRFGRRPLMI
LPWAVFAAAI VPAYVWLTSA HDAFVFIAVN TALAFCSTVP SGAVYAAIAE SLPKASRAKT
FALVYALPVT FLGGSTQFVI TWLLKVTGEP MAVAWYMLGA ALLALAGMVL VRESAPSRLR
ASPA