Gene Saro_1540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1540 
Symbol 
ID3917215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1589321 
End bp1590565 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content64% 
IMG OID640444281 
Productmajor facilitator transporter 
Protein accessionYP_496815 
Protein GI87199558 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGCGC GCATTTCAGA CAATCCCACG GTGCCCGTCC GCGAAAACTG GCTGCTGCTC 
ATTCCGCTGG CACTCGCCGG CTTCGTCCTC ATCGGCGGCA TGCTCGTCTC GCTGACCGTC
TATACCGCGG TCATGCAAGC AAGGTTCGGG TGGAACGAGA CCGAGCTAGG CGCAGGGCCC
GTTGCGCTTC TGCTGGGCAT GAGTGTCGCA AATCTCGCGG TCGGCCCGGC CATGCAGCGT
CTCGGCGTGC GGGCGATTTT CGCCGGCGGT TGCCTGGTTG CCGCCGCCGG ATGGGCGGCG
GCGGGATCGG TCACCCAGCT TTGGCAATTC ATGGTAGCAA TGGGCCTTGC CGGTTTTGGC
GCAGGTGCGG CGACGATCGT CCCCGGTATC GCCGTCATCA CCAGCGCGTT CCGCAAGAAC
AGGGGCTTGG CCATCGCACT GTTCATCGGA AGCTGTGCTC TTGCCAGTTC GGCGATGCCG
ATCTTCACCG CATACCTGAT CGAAACCGTG GGCTGGCGCA CCGCCTTCCG GATCGTCGGC
ATGGCTGGTG TGCTGCTTTG CCCGCTGCTC GTGCGCTTCC TGCCCGGCAC GCTCTCGATC
GGCCAGCACG ACGAATGCAT CGATCCCGGC GACGCGGCGC GCCGGCCCCG AACCGCCGCA
CTGCGGCTTC CTGCCTTCTG GATATTGACG GCCGCATTGA CCGTCAGCCA ACTGTGCATG
AACGGTGTCC TGTTCAATAT CGTCGCCTTT CTCCGCAAGA ACGGCCTGAC TGACAGCGCG
GCGGTCGACC TCTACAGCCT CACCAATTTC ATGAGCCTGC CCGGCCTTCT GATCGGCGGC
CTTCTTTCCG ATCGCGTTAG CGCCCACAGG CTCCTTCCCG CTATCGTCGC AGTGCAGGCT
CTCGGGACCT TCGCCTTGCT TGGCATCGGT CACCAAGGGG CGCTTGGACT TTTCGCGACG
ATCGGCTTCG TCGTTTTTTG GGGCGGCGTG GCAGGCCTTC CGTCGCAATC CGCGTCGCTG
CTTCTGGGCG AATTGGTAGG CCCGCAAAGC TTTGCCTCCC TGCTCGGCAT CGTCTTCACC
ATAAACGGCT TCGTAGGTGC CCTCGCCCCC GTACTGACTG GCTGGCTGCA TGACGTAAGC
GGAAGCTACG TCCTGCCATT TGGCCTGTTC GCCGCGTGCC TTCTGGCGGC GGCCCTTGCC
TGTCGCCTTG GCTCCATGAG CCAGCCCAAT TCCATGCATG CGTGA
 
Protein sequence
MHARISDNPT VPVRENWLLL IPLALAGFVL IGGMLVSLTV YTAVMQARFG WNETELGAGP 
VALLLGMSVA NLAVGPAMQR LGVRAIFAGG CLVAAAGWAA AGSVTQLWQF MVAMGLAGFG
AGAATIVPGI AVITSAFRKN RGLAIALFIG SCALASSAMP IFTAYLIETV GWRTAFRIVG
MAGVLLCPLL VRFLPGTLSI GQHDECIDPG DAARRPRTAA LRLPAFWILT AALTVSQLCM
NGVLFNIVAF LRKNGLTDSA AVDLYSLTNF MSLPGLLIGG LLSDRVSAHR LLPAIVAVQA
LGTFALLGIG HQGALGLFAT IGFVVFWGGV AGLPSQSASL LLGELVGPQS FASLLGIVFT
INGFVGALAP VLTGWLHDVS GSYVLPFGLF AACLLAAALA CRLGSMSQPN SMHA