Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3436 |
Symbol | |
ID | 5077585 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | - |
Start bp | 35890 |
End bp | 37278 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640481160 |
Product | sulfatase |
Protein accession | YP_001165822 |
Protein GI | 146275662 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGGCA TTCGGCGGCG CGAGGTCCTC GGCGGCATTT CGGCGACCGC GCTGCTTTCG GGTCAGGCGC TGGCCGTGAC CCGCAAGGCC GCGCCAGAGC GGCCCAATAT CGTTTTCATC ATGGCCGACG ACCTCGGCTA TGCCGACACC TCGGCCACGG GTTCGCGTCA TATCCGCACG CCGGCCATCG ACAGCATCGG CGCCGGTGGC GTCATGTTGC GCCAGGGCTA TTCCAGCACG CCGATCTGTT CGCCGACGCG CACCGCGCTG CTGACCGGGT GCTACGCGCA GCGCTTTGCC ATCGGGGTGG AGGAACCGCT CGGCCCCAAT GCCCCCGCGG GGATCGGCGT GCCGCTTGAC CGGCCGACCA TCGCCTCGGT CATGAAGGCG CTTGGTTATC GCACCAGCCT TGTCGGCAAG TGGCACCTCG GCGAACCGCC GGCGCACGGG CCCTTGAAGC ACGGCTACGA CCATTTCCTC GGCATCGTCG AAGGCGGCGC CGACTATTTC GTGCACCGCA TGGTCATGAG CGGAAAGCCT GCCGGTGTCG GCCTTGCCGA GGACGACGCG CAGACCGACC GCACTGGTTA TCTGACCGAC ATCTTCGGCG ACGAGGCGGT GCGGGTGATC GAAGAGGGCG GCAACCAGCC CTTTTTCCTC AGTCTCCACT TCACCGCGCC GCACTGGCCG TGGGAAGGGC GCGAGGACGA GAAGCTGGCA CGCGCGCTGC CCAGTTCATT CCACTACGAA GGCGGCAATC TGGCGAAGTA TCGCGAGATG GTCGAGACGA TGGACCAGAA CGTCGCCAAG GTGCTCGCCG CGATCGACCG CAGCGGCAAG GCCGACAACA CCGTCGTCGT CTTCACCAGC GACAACGGCG GCGAGCGCTT CTCCGACACC TGGCCTTTCG TCGGCCACAA GGGCGAAGTG CTGGAAGGTG GGGTGCGGGT GCCGCTAATG GTGCGCTGGC CGCGCCGGAT CAAGGCGGGG AGCCGTTCCG AACAGGTCAT GGTCTCGATG GACTTCCTGC CGACGCTGCT GGGCATGGCG GGCGGCGATG CGGCAAGGAT CGGTCGCTTC GACGGCGCGG ACCTTTCCGC CCAGCTTGCC GGCGCCGCGC CGGTCACGCG CACGCTGTTC TGGCGCTTCA AGGCCAGCGA GCAGGCGGCG GTGCGACAGG GCGACATGAA GTACTTGCGC ATGGCGGGCA AGGAGTACCT TTTCGACCTG TCGCAGGACG AGCGGGAGCA GGCAAACCTC GCCCCCGCGA ACCCGGACAA GGTCAACGCG ATGCGCGCGC TGTGGGACGA TTGGAACCGG GAAATGATGC CCTACCGGGT CGACGGGTAC TCGCAGGACG CGCGCAAGAG TTTTTCCGAC AGATACTGA
|
Protein sequence | MAGIRRREVL GGISATALLS GQALAVTRKA APERPNIVFI MADDLGYADT SATGSRHIRT PAIDSIGAGG VMLRQGYSST PICSPTRTAL LTGCYAQRFA IGVEEPLGPN APAGIGVPLD RPTIASVMKA LGYRTSLVGK WHLGEPPAHG PLKHGYDHFL GIVEGGADYF VHRMVMSGKP AGVGLAEDDA QTDRTGYLTD IFGDEAVRVI EEGGNQPFFL SLHFTAPHWP WEGREDEKLA RALPSSFHYE GGNLAKYREM VETMDQNVAK VLAAIDRSGK ADNTVVVFTS DNGGERFSDT WPFVGHKGEV LEGGVRVPLM VRWPRRIKAG SRSEQVMVSM DFLPTLLGMA GGDAARIGRF DGADLSAQLA GAAPVTRTLF WRFKASEQAA VRQGDMKYLR MAGKEYLFDL SQDEREQANL APANPDKVNA MRALWDDWNR EMMPYRVDGY SQDARKSFSD RY
|
| |