Gene Saro_1886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1886 
Symbol 
ID3917107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1992136 
End bp1993641 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content66% 
IMG OID640444630 
Productmajor facilitator transporter 
Protein accessionYP_497160 
Protein GI87199903 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.156331 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGAGCAT CACTGAACGC AATGCCCCGG CAGAAGCCAC GCCAGGATTT CCGGGGCCTG 
TGGAACATCA GCTTCGGCTA CTTCGGCATC CAGCTCGCCT TCGCGCTGCA AAACGCCAAC
GTCAGCCGCA TCTTCCAGTC GCTCGGCAGC GCGGTCGACG ATCTTGCGTT CCTGTGGATC
GCCGGGCCCG TTACCGGCCT GATCGTGCAG CCGCTGATCG GCTACTACTC CGACCGCACC
TGGGGACGCC TCGGGCGCCG CCGCCCCTAT TTCCTGGCCG GTGCCGTGCT TGCGGGGCTT
GCGCTAGTCG GCCTGCCCAA CACCGGCTTG CTCCTCCTCG CTGCGGCGTT CCTGTGGATG
CTCGATGCCT CCCTCAACGT GGCGATGGAA CCCTTCCGCG CCTTCGCTGG CGACATGACG
CCCGACGACC AGCGTGCGCA GGCCTTCGCC TTCCAGACCT GGTTCATCGG CGCGGGCGCG
GTCGTCGGCA GCCTTGCCCC GGCCCTGTTC AACTGGCTCG GCATCGCCAA CACCGCTCCG
GAAGGCATCA TTCCGCCCTC CGTGCGCTAC AGCTTCTACC TTGGCGCTGC CGCAATGGTT
CTCGCGGTGG GATGGACCGT GCTGCGCGTG CGGGAATACA GCCCCGACGA AATGGCCGCT
TTCGAAGCGG ATGCCACCGC CCCCATCGCT CCGGAACACG AACCTCTCGT TTACCCCCGC
TCCGGTCCAG CCTGGGTTGC CGTTGGCGCG GTGCTCGTTG CGATTGTCGC CGCGCTTGGA
CTGGATCGTC AGCTCTACGT CCTCGGCACG GGCCTCGCAG TGTTTGGCGC CATCCAGATC
GGCGTTCGCG CCACGGCGGC CAGAGGCGCG CTTGCACACA TCGTCAGCGA CCTTGCGCAA
ATGCCCGCGC AGATGAAGCA GCTGGCGCTC GCCCAGTTCT TCACCTGGAT CGGCTTCTTC
ATCGTGTGGA TCTACACCAC GCCCGTCGTT ACCGAACAGG CCTTCGGCGC GACCGACGTT
TCCAGCGCCG CCTACAACGA AGGCGCGGAC TGGGTCGGCG TGATGTTCGC GTTCTACAAC
GGGATCGCGG CTGTCTCGGC CTTCCTGCTG CCCGTGCTGG CGCGACGCAT CGGCAATGCC
AGGACGCATG CGGTCGGCCT CCTGTGCGGC GCCAGCGGCT TCCTCGGGCT GCTGCTGATC
CGCGATCCGT GGTGGCTGCT GCTGCCGATG GTCGGAATGG GGATCGCGTG GGCATCTGTC
CTGTCGATGC CCTACGTCAT CCTGACCCGC GTGCTGCCAG CGCGCAAGTT CGGCATCTAC
ATCGGAATCT TCAATTTCTT CATCGTCATC CCCCAGCTTG TCGTGGCGAC GCTGATGGGC
GGCATCATGC GCAGCTTTTT CCCGGGCGAA CCCCGCTGGA CGATGCTCGT GGCCGCGCTG
ATGATGGCCG CTGCCGCAGG GGCGATGATG CTGGTGCGCG AACACAGGAG GGATAGCGCG
GCATGA
 
Protein sequence
MGASLNAMPR QKPRQDFRGL WNISFGYFGI QLAFALQNAN VSRIFQSLGS AVDDLAFLWI 
AGPVTGLIVQ PLIGYYSDRT WGRLGRRRPY FLAGAVLAGL ALVGLPNTGL LLLAAAFLWM
LDASLNVAME PFRAFAGDMT PDDQRAQAFA FQTWFIGAGA VVGSLAPALF NWLGIANTAP
EGIIPPSVRY SFYLGAAAMV LAVGWTVLRV REYSPDEMAA FEADATAPIA PEHEPLVYPR
SGPAWVAVGA VLVAIVAALG LDRQLYVLGT GLAVFGAIQI GVRATAARGA LAHIVSDLAQ
MPAQMKQLAL AQFFTWIGFF IVWIYTTPVV TEQAFGATDV SSAAYNEGAD WVGVMFAFYN
GIAAVSAFLL PVLARRIGNA RTHAVGLLCG ASGFLGLLLI RDPWWLLLPM VGMGIAWASV
LSMPYVILTR VLPARKFGIY IGIFNFFIVI PQLVVATLMG GIMRSFFPGE PRWTMLVAAL
MMAAAAGAMM LVREHRRDSA A