Gene Saro_0474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0474 
Symbol 
ID3918603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp516979 
End bp518454 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content66% 
IMG OID640443204 
Productsulfatase 
Protein accessionYP_495756 
Protein GI87198499 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTGCGGCC TTACGCGACC ACGCGCGGGT CCATTGCCAG CAGCAATCAC GGACCGCCAC 
ATGATCTCCT CCTTCAATCT CACCCGCCGC GCCACCCTTG GCGGAGCCGC CGCCACCATG
GTTCTCGGCG CCGCGCCAGC GATTGCCAGC AAGCGCGCAA GGCGGCCGAA CATTCTCTAC
ATCATGGCTG ACGACCTCGG TTACGCAGAC CTGTCCTGCT ATGGCCGGCG CGATTTCGAG
ACGCCGGTGC TCGACAAACT GGCAGCGCAG GGACTTCGCT TCACCAATGC TTATGCCAAC
AGCGCGGTCT GCACGGCTAC CCGTGTAGGT CTCATCACCG GGCGCTATCA GTATCGCCTG
CCTGTGGGCC TGGAAGAACC ACTCGCGTTC CGACCCAACA TCGGCCTGCC GCCCTCGCAC
CCGACACTGC CCTCGCTGCT CGCCAAGGCG GGCTATCGCA CTTCGCTCAT CGGCAAGTGG
CACCTTGGAA GCCTTCCCGA CTTCGACCCG CTCAAGAGCG GTTACCAGAC CTTCTGGGGC
ATCCGCAGCG GCGGCGTCGA CTATTACACC CACGCCACCA GCAACGGCCA GCCAGACCTG
TGGGACGGAC CGACGCCGGT GGAAAGGGCG GGCTACCTGA CCGACCTCCT CGCCGACCGT
GCCGTAAGCG AGATCCGCGA AGCCTCGTCT GGCGAGGCCC CATGGTTCAT GAGCCTGCAC
TTCACCGCAC CGCACTGGCC ATGGGAAGGC CCTGACGACG CCAGTGAGTC CGCCCGCATT
GCCAAGCTGA AGGACCCCAG CGCCCTGTTC CACTTCGATG GCGGCAGCGC GGCGATCTAT
GCCGCCATGG TTCGCCGTCT CGACTATCAG ATTGGCCGTG TCCTCGAAGC GCTGAAGGCG
AACCGGGCCG AACAGGACAC AATCGTCGTA TTCACCAGCG ACAACGGCGG CGAGCGCTTC
TCCGACACCT GGCCGTTCAG CGGTCGCAAG ACCGAACTGC TCGAAGGCGG CCTGCGCATC
CCCGCCATCG TGCGCTGGCC CGGCGTCACG AGAGCCGGCA CGACCAGCGA CGCACAGATC
ATCTCGATGG ACTGGTTGCC CACGTTCCTT GCCGCTGCCG GCTCCGCCCC CGATCCCGGC
CACCCCAGCG ACGGCGTCGA CGTTACGCCG GCTCTCGGTG GTGGATCGCT CGCCGAACGC
GCCTTGTTCT GGCGCTACAA GAACCGCGCC CAGCGTGCCG TGCGGCGGGG CAACCTGAAA
TATCTCAGGA TCGCCGAAAA CGAATTCCTG TTCGACGTGG CTGCCGACCC GCTCGAACGG
GCGAACCTGA AGGACCGCCA GCCCGAGGAC TTCGCCGCGC TCAAGGCAGC GTGGGAAAAG
TGGAACGCCA CCATGCTGCC GCTCGATCCC CAGTCCTACA CCCACGGCTT CCACGCCGAC
GAGTTGGCCG ACCGCTTCGG AGTGCAGCCG GATTAG
 
Protein sequence
MCGLTRPRAG PLPAAITDRH MISSFNLTRR ATLGGAAATM VLGAAPAIAS KRARRPNILY 
IMADDLGYAD LSCYGRRDFE TPVLDKLAAQ GLRFTNAYAN SAVCTATRVG LITGRYQYRL
PVGLEEPLAF RPNIGLPPSH PTLPSLLAKA GYRTSLIGKW HLGSLPDFDP LKSGYQTFWG
IRSGGVDYYT HATSNGQPDL WDGPTPVERA GYLTDLLADR AVSEIREASS GEAPWFMSLH
FTAPHWPWEG PDDASESARI AKLKDPSALF HFDGGSAAIY AAMVRRLDYQ IGRVLEALKA
NRAEQDTIVV FTSDNGGERF SDTWPFSGRK TELLEGGLRI PAIVRWPGVT RAGTTSDAQI
ISMDWLPTFL AAAGSAPDPG HPSDGVDVTP ALGGGSLAER ALFWRYKNRA QRAVRRGNLK
YLRIAENEFL FDVAADPLER ANLKDRQPED FAALKAAWEK WNATMLPLDP QSYTHGFHAD
ELADRFGVQP D