Gene Saro_3436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3436 
Symbol 
ID5077585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp35890 
End bp37278 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content67% 
IMG OID640481160 
Productsulfatase 
Protein accessionYP_001165822 
Protein GI146275662 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGGCA TTCGGCGGCG CGAGGTCCTC GGCGGCATTT CGGCGACCGC GCTGCTTTCG 
GGTCAGGCGC TGGCCGTGAC CCGCAAGGCC GCGCCAGAGC GGCCCAATAT CGTTTTCATC
ATGGCCGACG ACCTCGGCTA TGCCGACACC TCGGCCACGG GTTCGCGTCA TATCCGCACG
CCGGCCATCG ACAGCATCGG CGCCGGTGGC GTCATGTTGC GCCAGGGCTA TTCCAGCACG
CCGATCTGTT CGCCGACGCG CACCGCGCTG CTGACCGGGT GCTACGCGCA GCGCTTTGCC
ATCGGGGTGG AGGAACCGCT CGGCCCCAAT GCCCCCGCGG GGATCGGCGT GCCGCTTGAC
CGGCCGACCA TCGCCTCGGT CATGAAGGCG CTTGGTTATC GCACCAGCCT TGTCGGCAAG
TGGCACCTCG GCGAACCGCC GGCGCACGGG CCCTTGAAGC ACGGCTACGA CCATTTCCTC
GGCATCGTCG AAGGCGGCGC CGACTATTTC GTGCACCGCA TGGTCATGAG CGGAAAGCCT
GCCGGTGTCG GCCTTGCCGA GGACGACGCG CAGACCGACC GCACTGGTTA TCTGACCGAC
ATCTTCGGCG ACGAGGCGGT GCGGGTGATC GAAGAGGGCG GCAACCAGCC CTTTTTCCTC
AGTCTCCACT TCACCGCGCC GCACTGGCCG TGGGAAGGGC GCGAGGACGA GAAGCTGGCA
CGCGCGCTGC CCAGTTCATT CCACTACGAA GGCGGCAATC TGGCGAAGTA TCGCGAGATG
GTCGAGACGA TGGACCAGAA CGTCGCCAAG GTGCTCGCCG CGATCGACCG CAGCGGCAAG
GCCGACAACA CCGTCGTCGT CTTCACCAGC GACAACGGCG GCGAGCGCTT CTCCGACACC
TGGCCTTTCG TCGGCCACAA GGGCGAAGTG CTGGAAGGTG GGGTGCGGGT GCCGCTAATG
GTGCGCTGGC CGCGCCGGAT CAAGGCGGGG AGCCGTTCCG AACAGGTCAT GGTCTCGATG
GACTTCCTGC CGACGCTGCT GGGCATGGCG GGCGGCGATG CGGCAAGGAT CGGTCGCTTC
GACGGCGCGG ACCTTTCCGC CCAGCTTGCC GGCGCCGCGC CGGTCACGCG CACGCTGTTC
TGGCGCTTCA AGGCCAGCGA GCAGGCGGCG GTGCGACAGG GCGACATGAA GTACTTGCGC
ATGGCGGGCA AGGAGTACCT TTTCGACCTG TCGCAGGACG AGCGGGAGCA GGCAAACCTC
GCCCCCGCGA ACCCGGACAA GGTCAACGCG ATGCGCGCGC TGTGGGACGA TTGGAACCGG
GAAATGATGC CCTACCGGGT CGACGGGTAC TCGCAGGACG CGCGCAAGAG TTTTTCCGAC
AGATACTGA
 
Protein sequence
MAGIRRREVL GGISATALLS GQALAVTRKA APERPNIVFI MADDLGYADT SATGSRHIRT 
PAIDSIGAGG VMLRQGYSST PICSPTRTAL LTGCYAQRFA IGVEEPLGPN APAGIGVPLD
RPTIASVMKA LGYRTSLVGK WHLGEPPAHG PLKHGYDHFL GIVEGGADYF VHRMVMSGKP
AGVGLAEDDA QTDRTGYLTD IFGDEAVRVI EEGGNQPFFL SLHFTAPHWP WEGREDEKLA
RALPSSFHYE GGNLAKYREM VETMDQNVAK VLAAIDRSGK ADNTVVVFTS DNGGERFSDT
WPFVGHKGEV LEGGVRVPLM VRWPRRIKAG SRSEQVMVSM DFLPTLLGMA GGDAARIGRF
DGADLSAQLA GAAPVTRTLF WRFKASEQAA VRQGDMKYLR MAGKEYLFDL SQDEREQANL
APANPDKVNA MRALWDDWNR EMMPYRVDGY SQDARKSFSD RY