Gene Saro_2623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2623 
Symbol 
ID3917038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2837522 
End bp2839213 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content63% 
IMG OID640445382 
Productgeneral substrate transporter 
Protein accessionYP_497893 
Protein GI87200636 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAACC TTGAGATAGC GGATGATTTT TCGGCTTTGC CGAAACATCA TCAGGCCACG 
CACAACGAGA AGCTGATCAT AACCGCGTCT TCGCTAGGCA CGGTGTTCGA GTGGTACGAC
TTCTACCTCT ATGGTCTCCT CACCGCGATC ATCGCGGCGA AGTTCCTGAC AGGGCTCAAC
CCGACGACCT CGTTCATCAT GGCGCTGCTC GTCTTCGCGG CGGGCTTCAT CGTCCGTCCT
TTCGGGGCAC TCGTGTTCGG CCGCATCGGG GACATGGTGG GCCGCCGCTA TACCTTCATC
GTGACGCTGC TGGTCATGGG CCTGTCGACC TTCCTTGTCG GTTGCCTGCC CACCTATGAA
ACCGTCGGCG TGGCCGCGCC GATCATGCTC GTCGTCCTGC GCATGTTCCA GGGCCTGGCC
CTGGGCGGGG AATACGGCGG CGCCGCAACC TACGTGGCCG AGCACGCGCC CGAAGGAAAG
CGGGGGCTCT ATACGAGCTG GATCCAGATC ACCGCGACGG CCGGGCTGGC CATGGCCTTG
CTGATCGTGA TCCTCGTGCG CTCGCCGGTC ACCGGCGTGG GTGAGGAGGC GTTCAAGGAC
TGGGGCTGGC GCGTGCCCTA CCTGATCTCG GGCCTGTTCC TCTGCGTGGG GCTGTGGCTG
CGCCTGAAGC TGCATGAATC GCCCGTCTTC CAGAAGATGA AGGACGAAGG CACATCCTCC
AAGCGTCCGC TGGGCGAAGC CTTCGGCGAA TGGAAGAACC TCAAGATCGT GCTGATCGCG
TTCTTCGGCG CCATCGCGGG CCAGGCCGTG GTCTGGTACA CCGGCCAGCT CTACGCGATG
TACTTTCTCG AAAAGATGCT CAAGGTCGAT GGCCTTACCG CCAACACGCT GATCATCGTC
GCGCTTGCCT GTGCCACGCC GTTCTTCCTG TTCTTCGGCT GGCTGTCGGA CAAGATCGGT
CGCAAGAAGA TCATCCTGGC CGGTTGCGCG CTCGCTGCGC TCACCATGTT CCCGGCGTTC
AAGGCGCTGA CCGAAGCGGC GAACCCCGCC CTTGCCCATG CCCAGGCCAA TGCGCCGGTA
AAGGTCTTGG CCAACCCCGC CGAATGCTCG TCGCAGTTCG ACCCGGTAGG CGCGAACAAG
TTCGACACCA CAAGCTGCGA CATCGTCAAG AACGCGCTCG CCAAGGCGGC GGTGAACTAC
GAGAACGTCG CGGCACCCGC CGGCACCATT GCTTCGATCC GGATCGGCAG CACCACCATC
GTCGCTCCCG ATCCGTCGAA GGTTTCGGGT GACGAGAAGA AGGCCGCCAT CGCCGCCTTC
GCCAAGCAGG TCACCTCCGA CCTGGAGACC GTGGGCTACC CGGCCAAGGC CGACCCTGAC
CAGATCGACA AGCCGGTCGT CGTGGCGATC CTGTTCTATC TCGTCCTGCT GGTGACCATG
GTCTACGGTC CCATCGCGGC GATGCTGGTC GAACTGTTCC CCAGCCGCAT CCGCTACACC
TCGATGAGCC TGCCGTACCA CATCGGCAAT GGCTGGCTCG GCGGCCTGCT CCCGGCCATC
GGTTTTGCCA TGGTCGCGGC CAACGGGGAT ATCTACCACG GCTTCTGGTA CCCGGTGATC
GTGGCCGCCG CCACTTGCGT CATAGGCCTG GTGTTCCTGC CGGAGACGTA CAAGCGGAGC
ATCGACGACT AA
 
Protein sequence
MSNLEIADDF SALPKHHQAT HNEKLIITAS SLGTVFEWYD FYLYGLLTAI IAAKFLTGLN 
PTTSFIMALL VFAAGFIVRP FGALVFGRIG DMVGRRYTFI VTLLVMGLST FLVGCLPTYE
TVGVAAPIML VVLRMFQGLA LGGEYGGAAT YVAEHAPEGK RGLYTSWIQI TATAGLAMAL
LIVILVRSPV TGVGEEAFKD WGWRVPYLIS GLFLCVGLWL RLKLHESPVF QKMKDEGTSS
KRPLGEAFGE WKNLKIVLIA FFGAIAGQAV VWYTGQLYAM YFLEKMLKVD GLTANTLIIV
ALACATPFFL FFGWLSDKIG RKKIILAGCA LAALTMFPAF KALTEAANPA LAHAQANAPV
KVLANPAECS SQFDPVGANK FDTTSCDIVK NALAKAAVNY ENVAAPAGTI ASIRIGSTTI
VAPDPSKVSG DEKKAAIAAF AKQVTSDLET VGYPAKADPD QIDKPVVVAI LFYLVLLVTM
VYGPIAAMLV ELFPSRIRYT SMSLPYHIGN GWLGGLLPAI GFAMVAANGD IYHGFWYPVI
VAAATCVIGL VFLPETYKRS IDD