Gene Saro_2470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2470 
Symbol 
ID3916789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2667360 
End bp2669720 
Gene Length2361 bp 
Protein Length786 aa 
Translation table11 
GC content63% 
IMG OID640445225 
Productsulfatase 
Protein accessionYP_497740 
Protein GI87200483 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGCAA TCCGTTCCGC CCTTCGTTTG TCCGCCAGAA AACGCCGGCT CTGCGGAGGT 
TCCGCCATCG CCGCCTTACT TGCGCCAACC TTTGTCCTTG CAGAAACCGC CCCTGCCGAT
CCGCTGGCCG GCAAGGTCGG GCGCACCGTG CAGGTAACAC ATGCTCCGGC ATGGCCTGCC
CAACCGCAGG CCCCCAAGGG CGCGCCCAAC GTACTGGTCA TCCTTACCGA CGACGTGGGC
TTTGGCGTGA CCAGCGCGTT TGGTGGGCCG GTGCCGACTG CCACTTTCGA CGCGCTTGCC
CAGACGGGCC TCCGCTACAA CCGTTTCAAC ACCACCGCGC TGTGCTCGCC CACGCGCGCC
TCGCTCCTGA CGGGCCGCCT GCCGCAAAAT GTGGACATGG GCAACGTCAC CAACCTGCCG
ACCGGGTTTG ACGGCTACAC CACCGTTATC CCGCAGTCCG CAGCCACCGT CGCCGAAGTG
CTGAAGGAAA ACGGCTTCAA CACCGCGATG TTCGGCAAGA GCCACCTGAC GCCCGAATGG
CAGACGAGCG CGGCGGGCCC CTTCGACCAG TGGCCCACGG GCCTGGGGTT CGAATACTTC
TACGGCTTCC TTTCGGCAGA CACCTCGATG TGGCAGCCGA GCATTGTCGA GAACACCCTT
CCGGTCGAGC CACCTCACGA CGATCCAAAC TACTTCTTCG AGAAGGACAT GGCGGATCAC
GCGATCAAGT GGATGCGCAC GCAGCAAGCC GCCGCGCCGG ACAAGCCGTT CTTCATGTAC
TACGCTCCCG GCATTGCCCA CACTCCGCAC CATGCGCCCA AGGAGTGGCT GGAGAAGTTC
CGGGGCAAGT TCGATCAGGG CTGGGACAAG CTGCGCGAGG AGACTTTCGC TCGGCAGAAG
CGCATGGGCA TCATCCCCGC GAACTCCCGG CTTTCGCCTC GCCCGGCTAC GTTGCCCGCC
TGGGATTCAT TGAATGCCGA CCAGAAGAAG CTCTATTCGC GCCTGATGGA GGCCTACGCA
GCGAGCGTCT CGTATTCGGA TCATCAGACT GGTCGCCTGA TCGAAGCGAT CCGCGAGACC
GGCGAACTGG ACAACACGCT GATCATCTAC ATCCAGGGCG ACAATGGCAG CAGCGCAGAG
GGCGGGCCGG AAGGACTGCT CTACGAACAG TCAACGATCA CCGGCCGCAA GGAAACCATG
GCCGAGAAGC TGTCGCACAT TGACGATATC GGCGGGCCGA AGCTGTACAA CCATTTCCCC
GCAGCATGGG CCTGGGCAAC CAACTCGCCC TTCCCCTGGT GGAAGCAGGT CGCTTCGCAG
GCAGGCGGCG TGCGCAACGG CATGGTCGTT TCCTGGCCCA AGCGCATCAC CGAGAGGGGC
GTGATCCGCT CGCAATATGC GCACGTCAGC GACATTGCGC CGACCGTGCT CGATGCGGTC
GGGATCAAGT CTCCCGACTT GATCAAGGGC ATCAGGCAGA AGCCGGTCGA CGGAATCAGC
CTAGCCTACA CCTTCCAGCA GGGTTCTGCC CCGTCGGCCA GGCGCATGCA GATCTACGAG
ATGATGGAGA ACTTCGGCAT CTACAAGGAC GGCTGGATGG CCGGCACGCT TCCCAAGCGC
GCCGCCTGGG AAGCCGGCGC GGCGGGCGAC CGCAAGCTCA GCGTCGGGCC CGACGAGCGC
GAATGGTCTC TGTTCAACCT CGATGCCGAC TTCACCACGG CCAAGGATCT CGCGAAGCAG
AACCCCGCCA AGCTCAAGGA AATGCAAGAT CTGTTCTGGG CGGAAGCCGC AAGAAACAAC
ATTCTGCCGA TCCACGACTA TAGCCAGGGA ACCGAAGGAC GGCCTTCGCT TGGCGCCTAT
CGCTCCAGCT TCACCTACCG CCCGGGTACA GCCACGATCG CGGAGGACGC AGCGCCGCAT
ACCATTGGCA AAAGTTTCCG CATCGACGCT GACGTGACTG CAGGCAGCAG CACGAACGGC
GTGATGATCG CGCAGGGTGG TCGCTTCGGC GGCTACAGCT TCTACCTCAA GGACGGGCGT
CCGACCTTCC ATTACAACGC CGTAGGCGCG GACGCCTTCA CCGTCGCCGC AGGGAGTGCC
CTTGCCGAGG GCAAGCACAC GCTTTCCGCA GAGTTCACCG CCGACAAGAC CGTGCCGGGA
ACGCCTGGAA CGCTGACGCT ATATGTCGAC GGCAAGGCGG TAGGTTCCAG CAGGCTGGGC
CGCACGGTGG CCGGGTGGAT GTCGCACACC GAAGGCCTCG ACGTCGGCCT GGACCGGATA
AGCGCGGTCA GTCCCGACTA CAGCGTGCAG GATAGTGCCT TTACCGGCGA GATCGACGAA
GTGCGGGTGT CGATCAAATG A
 
Protein sequence
MRAIRSALRL SARKRRLCGG SAIAALLAPT FVLAETAPAD PLAGKVGRTV QVTHAPAWPA 
QPQAPKGAPN VLVILTDDVG FGVTSAFGGP VPTATFDALA QTGLRYNRFN TTALCSPTRA
SLLTGRLPQN VDMGNVTNLP TGFDGYTTVI PQSAATVAEV LKENGFNTAM FGKSHLTPEW
QTSAAGPFDQ WPTGLGFEYF YGFLSADTSM WQPSIVENTL PVEPPHDDPN YFFEKDMADH
AIKWMRTQQA AAPDKPFFMY YAPGIAHTPH HAPKEWLEKF RGKFDQGWDK LREETFARQK
RMGIIPANSR LSPRPATLPA WDSLNADQKK LYSRLMEAYA ASVSYSDHQT GRLIEAIRET
GELDNTLIIY IQGDNGSSAE GGPEGLLYEQ STITGRKETM AEKLSHIDDI GGPKLYNHFP
AAWAWATNSP FPWWKQVASQ AGGVRNGMVV SWPKRITERG VIRSQYAHVS DIAPTVLDAV
GIKSPDLIKG IRQKPVDGIS LAYTFQQGSA PSARRMQIYE MMENFGIYKD GWMAGTLPKR
AAWEAGAAGD RKLSVGPDER EWSLFNLDAD FTTAKDLAKQ NPAKLKEMQD LFWAEAARNN
ILPIHDYSQG TEGRPSLGAY RSSFTYRPGT ATIAEDAAPH TIGKSFRIDA DVTAGSSTNG
VMIAQGGRFG GYSFYLKDGR PTFHYNAVGA DAFTVAAGSA LAEGKHTLSA EFTADKTVPG
TPGTLTLYVD GKAVGSSRLG RTVAGWMSHT EGLDVGLDRI SAVSPDYSVQ DSAFTGEIDE
VRVSIK