Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0474 |
Symbol | |
ID | 3918603 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 516979 |
End bp | 518454 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640443204 |
Product | sulfatase |
Protein accession | YP_495756 |
Protein GI | 87198499 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTGCGGCC TTACGCGACC ACGCGCGGGT CCATTGCCAG CAGCAATCAC GGACCGCCAC ATGATCTCCT CCTTCAATCT CACCCGCCGC GCCACCCTTG GCGGAGCCGC CGCCACCATG GTTCTCGGCG CCGCGCCAGC GATTGCCAGC AAGCGCGCAA GGCGGCCGAA CATTCTCTAC ATCATGGCTG ACGACCTCGG TTACGCAGAC CTGTCCTGCT ATGGCCGGCG CGATTTCGAG ACGCCGGTGC TCGACAAACT GGCAGCGCAG GGACTTCGCT TCACCAATGC TTATGCCAAC AGCGCGGTCT GCACGGCTAC CCGTGTAGGT CTCATCACCG GGCGCTATCA GTATCGCCTG CCTGTGGGCC TGGAAGAACC ACTCGCGTTC CGACCCAACA TCGGCCTGCC GCCCTCGCAC CCGACACTGC CCTCGCTGCT CGCCAAGGCG GGCTATCGCA CTTCGCTCAT CGGCAAGTGG CACCTTGGAA GCCTTCCCGA CTTCGACCCG CTCAAGAGCG GTTACCAGAC CTTCTGGGGC ATCCGCAGCG GCGGCGTCGA CTATTACACC CACGCCACCA GCAACGGCCA GCCAGACCTG TGGGACGGAC CGACGCCGGT GGAAAGGGCG GGCTACCTGA CCGACCTCCT CGCCGACCGT GCCGTAAGCG AGATCCGCGA AGCCTCGTCT GGCGAGGCCC CATGGTTCAT GAGCCTGCAC TTCACCGCAC CGCACTGGCC ATGGGAAGGC CCTGACGACG CCAGTGAGTC CGCCCGCATT GCCAAGCTGA AGGACCCCAG CGCCCTGTTC CACTTCGATG GCGGCAGCGC GGCGATCTAT GCCGCCATGG TTCGCCGTCT CGACTATCAG ATTGGCCGTG TCCTCGAAGC GCTGAAGGCG AACCGGGCCG AACAGGACAC AATCGTCGTA TTCACCAGCG ACAACGGCGG CGAGCGCTTC TCCGACACCT GGCCGTTCAG CGGTCGCAAG ACCGAACTGC TCGAAGGCGG CCTGCGCATC CCCGCCATCG TGCGCTGGCC CGGCGTCACG AGAGCCGGCA CGACCAGCGA CGCACAGATC ATCTCGATGG ACTGGTTGCC CACGTTCCTT GCCGCTGCCG GCTCCGCCCC CGATCCCGGC CACCCCAGCG ACGGCGTCGA CGTTACGCCG GCTCTCGGTG GTGGATCGCT CGCCGAACGC GCCTTGTTCT GGCGCTACAA GAACCGCGCC CAGCGTGCCG TGCGGCGGGG CAACCTGAAA TATCTCAGGA TCGCCGAAAA CGAATTCCTG TTCGACGTGG CTGCCGACCC GCTCGAACGG GCGAACCTGA AGGACCGCCA GCCCGAGGAC TTCGCCGCGC TCAAGGCAGC GTGGGAAAAG TGGAACGCCA CCATGCTGCC GCTCGATCCC CAGTCCTACA CCCACGGCTT CCACGCCGAC GAGTTGGCCG ACCGCTTCGG AGTGCAGCCG GATTAG
|
Protein sequence | MCGLTRPRAG PLPAAITDRH MISSFNLTRR ATLGGAAATM VLGAAPAIAS KRARRPNILY IMADDLGYAD LSCYGRRDFE TPVLDKLAAQ GLRFTNAYAN SAVCTATRVG LITGRYQYRL PVGLEEPLAF RPNIGLPPSH PTLPSLLAKA GYRTSLIGKW HLGSLPDFDP LKSGYQTFWG IRSGGVDYYT HATSNGQPDL WDGPTPVERA GYLTDLLADR AVSEIREASS GEAPWFMSLH FTAPHWPWEG PDDASESARI AKLKDPSALF HFDGGSAAIY AAMVRRLDYQ IGRVLEALKA NRAEQDTIVV FTSDNGGERF SDTWPFSGRK TELLEGGLRI PAIVRWPGVT RAGTTSDAQI ISMDWLPTFL AAAGSAPDPG HPSDGVDVTP ALGGGSLAER ALFWRYKNRA QRAVRRGNLK YLRIAENEFL FDVAADPLER ANLKDRQPED FAALKAAWEK WNATMLPLDP QSYTHGFHAD ELADRFGVQP D
|
| |