Gene Saro_0374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0374 
Symbol 
ID3918258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp406050 
End bp407546 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content63% 
IMG OID640443103 
Productsulphate transporter 
Protein accessionYP_495656 
Protein GI87198399 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCTCA ATTTCAATAA TTATCGCCGC CAGTGGTTCA CTGACGGGAG CACCGCCCGC 
CGGGACATTC TGGCGGGCAT CGTCGTCGCG CTTGCCCTTA TTCCCGAAGC GATCGGGTTT
TCGATCATCG CCGGCGTGGA TCCCCGGGTC GGGCTTTACG CCTCAGTTGC CATCGCCATC
ACGATCGCCC TGATCGGCGG GCGTCCCGGC ATGATTTCGG CTGCCACGGC GGCTGTTGCG
GTTCTCGTGG TGCCCCTCGT CCGGGACCAC GGCGTCGAAT ACCTGTTTGC CGCAACGATC
CTGATGGGTG TGATCCAGAT CGTCGCGGGG CTGCTGCGGC TCAACCTGGT GATGCAGTTC
GTGTCCCGGT CGGTCATCAC CGGCTTCGTC AATGCGCTCG CCATCCTGAT CTTCATGGCG
CAGCTGCCCC AACTCACCAA TGTCGGGTGG GAGACCTATG CCATGGTCGC CGCGGGCCTG
GCGATCATCT ACCTGCTGCC CCGCATCACC ACTGCGGTGC CTTCGCCGCT CGTCGCGATC
CTGGTGCTGA CGGCAGTCGC CATTGGCATG GGCATCGATG TGAACACGGT GGGCGACATG
GGCAAGCTTC CCGAAGGTCT GCCAAGTCTT GCGCTGCCGC AGGTTCCCCT GACCCTGGAA
ACGCTGCGCA TCATCCTGCC CTATTCGCTC ACCATGGCGG CCGTCGGCTT GCTGGAATCG
CTGCTAACCG CTCAGATCGT CGATGACATG ACCGACACGG ACAGCGACAA GCGCCAGGAA
TGTGCCGGGC AAGGCGGGGC CAATATCGTT GCTGCCCTGT TTGGCGGCAT GGGCGGATGC
GCGATGATCG GCCAATCGGT GATCAACGTG ACTTCGGGCG GGCGCACGCG GCTTTCGACC
TTCGTCGCCG GCGCGTTTCT GCTGTTCCTG CTCGCCGTGC TCGGGCCCTA TGTTGGCCGT
GTGCCGATGC CGGCGCTGGT TGCGGTGATG ATCATGGTCT CGATCGGCAC CTTCAGCTGG
AACTCGATTC CCAATCTGCG TCGCCATCCG CCGACTTCGT CGATCGTCAT GCTGACAACC
GTGATCGTAG TAGTTGCCAC GCACGACCTT TCGCTGGGCG TGCTGGCCGG CGTCTTGCTC
TCGGGCATCT TCTTTGCGGG CAAGGTCCAG CGCATGTTCA CGGTCGAACG CGAAGGTTCG
GCCGATGGCG TGCTGGCGAC CTACCGCGTG ACGGGCGAAA TCTTCTTCGC CTCGGTCGAG
CGCTTCACCC GGGTCTTCCA GGCGGAAGAC CAGGCAGAGC GCGTGGTCAT CGATGTGACG
AGGGCGCATT TCTGGGACAT TTCCGGCGTC GGCGCGCTCG ACAAGGTCGT CGCCCGGCTG
CGCCGCGACG GACGGCAGGT TGAAGTCATC GGCTACAACC AGGCCAGCGC CGACATCATC
GACCGCTTTG CCTTGCACGA CAAGACCGGC GTCGAACTGG GCGTGGTGCC GCATTAA
 
Protein sequence
MSLNFNNYRR QWFTDGSTAR RDILAGIVVA LALIPEAIGF SIIAGVDPRV GLYASVAIAI 
TIALIGGRPG MISAATAAVA VLVVPLVRDH GVEYLFAATI LMGVIQIVAG LLRLNLVMQF
VSRSVITGFV NALAILIFMA QLPQLTNVGW ETYAMVAAGL AIIYLLPRIT TAVPSPLVAI
LVLTAVAIGM GIDVNTVGDM GKLPEGLPSL ALPQVPLTLE TLRIILPYSL TMAAVGLLES
LLTAQIVDDM TDTDSDKRQE CAGQGGANIV AALFGGMGGC AMIGQSVINV TSGGRTRLST
FVAGAFLLFL LAVLGPYVGR VPMPALVAVM IMVSIGTFSW NSIPNLRRHP PTSSIVMLTT
VIVVVATHDL SLGVLAGVLL SGIFFAGKVQ RMFTVEREGS ADGVLATYRV TGEIFFASVE
RFTRVFQAED QAERVVIDVT RAHFWDISGV GALDKVVARL RRDGRQVEVI GYNQASADII
DRFALHDKTG VELGVVPH