Gene Saro_3163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3163 
Symbol 
ID3918205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3376371 
End bp3377777 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content67% 
IMG OID640445947 
Productsugar transporter 
Protein accessionYP_498432 
Protein GI87201175 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.956524 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCAAC AACCGCAGCC GGGCGGCAAT CTCGGCTTCA TCGGGCTTAT CGTGGCGGTT 
GCCACGATTG GCGGCTTCAT GTTCGGCTAC GATTCGGGTG CCATCAACGG CACGCAGGAC
GGGCTCAAGC ATGCCTTCGG ACTGGGCGAA GCCGAGCTTG GCCTGACCGT CAGCGCACTC
CTGCCCGGTT GCGCGCTCGG CGCGTTCCTG GCGGGACGGT TTGCCGACAT CTGGGGCCGC
CGCGCGGTCA TGATGATGGC GGCAGCGCTG TTCATTCTCT CGGCGCTCGG CTCGGGGGCA
GCGCCCTCGG CCATCGTTCT TGCAGTTGCG CGCTTCTTCG CGGGCGCGGC GGTGGGTGCC
GCAAGCGTGC TTTCGCCGGC CTATATCTCC GAAGTCACTC CGGCCCACGT GCGCGGACGT
CTGTCGAGCG CGCAGCAGGT CATGATCATT TCGGGCCTGA CCGGAGCTTT CCTCGCAAAC
TACTGGCTGG CTGGGGCCGC CGGGTCCTCG CTCGGTGCGT TCTGGTGCGG TTATCCGGCA
TGGCGCTGGA TGTTCTGGGT CCAGGCAGCG CCGGCCATCC TGTTCCTCGT CACGCTTCTG
CTCATCCCCG AAAGCCCGCG CTTCCTCGTG GCGAAGGGCC GCACCGAAGA AGCCCGCTCG
GTTCTCGCCC GTCTGTTCGG CGATGCCACC GCGGACGCCA AGCTCGGCGA AATCCGCGCC
TCGCTTGCCG CCGATCACCA GCCGAGCCTT GCCGACATCC GCAAGCCGGG CGGCGGCTGG
CGCCCCATCG TCTGGGTCGG CATCGGCCTT GCCGTGTTCC AGCAGCTCGT CGGCATCAAC
GTGGTGTTCT ACTATGGCGC GGTGCTGTGG CAGGCGGTCG GCTTTTCCGA GGCGGACGCG
CTCAAGATCA ACATCCTTTC GGGCGTCGTC TCGATCGCGG CCTGCCTCGT CTCGATCGGC
CTCGTGGACA AGCTTGGCCG CAAGCCGCTG CTGCTGATCG GTTCGGCGGG GATGACCGCG
ACGCTGGGCG CGCTTGCCTG GTGCTTTGCG CAGGCATCGA CCGGCCCCGA CGGCGCGCTT
GTCCTGCCCG AGGGCGTCGG CACCATCGCG CTCTACGCGG CCAACATCTA CGTCGTTTTC
TTCAACATGA GCTGGGGCCC GGTGATGTGG GTCATGCTGG GTGAAATGTT CCCCAACCAG
ATGCGCGGAT CGGCCCTGGC GGTGGCGGGC GCGGCGCAGT GGCTGGCGAA TTTCGCGGTC
AGCTCGAGCT TCCCGTGGCT GGCAGGCAAC ATCGGCCTGC CCGTCACTTA CGCCGCCTAC
ACCCTGTTCG CCGCCATCTC GCTGGTCTTC GTCTGGACCT CGGTCAAGGA GACCAAGGGC
AAGGAACTCG AGGCCATGGA AGGCTGA
 
Protein sequence
MSQQPQPGGN LGFIGLIVAV ATIGGFMFGY DSGAINGTQD GLKHAFGLGE AELGLTVSAL 
LPGCALGAFL AGRFADIWGR RAVMMMAAAL FILSALGSGA APSAIVLAVA RFFAGAAVGA
ASVLSPAYIS EVTPAHVRGR LSSAQQVMII SGLTGAFLAN YWLAGAAGSS LGAFWCGYPA
WRWMFWVQAA PAILFLVTLL LIPESPRFLV AKGRTEEARS VLARLFGDAT ADAKLGEIRA
SLAADHQPSL ADIRKPGGGW RPIVWVGIGL AVFQQLVGIN VVFYYGAVLW QAVGFSEADA
LKINILSGVV SIAACLVSIG LVDKLGRKPL LLIGSAGMTA TLGALAWCFA QASTGPDGAL
VLPEGVGTIA LYAANIYVVF FNMSWGPVMW VMLGEMFPNQ MRGSALAVAG AAQWLANFAV
SSSFPWLAGN IGLPVTYAAY TLFAAISLVF VWTSVKETKG KELEAMEG