Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0533 |
Symbol | |
ID | 3918663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 577801 |
End bp | 580191 |
Gene Length | 2391 bp |
Protein Length | 796 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640443263 |
Product | sulfatase |
Protein accession | YP_495814 |
Protein GI | 87198557 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCATAT ACAAATTCGC CATGAGCCGC CGAAACCGGC ATAAAATGCG CGCTTTGCGT ATATTCACAG CCATATCATC GCCCCTGGCA CTTTGGTCGA GCACCGCGAC AGGACAAGAG GCCTTGCCCC AGGCGCCTCA GCCGTTTGCG GGCAGCATTG GCCGCACATA TGTCGATTCG GTGCCCTCCT TCCCGAAGCC TGTCACCGCT CCGGCCGGCG CGCCCAACGT CGTCCTGATA ATGACGGATG ACGTGGGATT CGGGGCCGCC TCGACCTTCG GCGGGCCGGT ACCCACCCCC AACCTCGACC GCCTCGCGTC GCGCGGAATC GTGTTCAATC GCTTCCACAC CAAGGCGATG TGCTCGCCGA CGCGGGCATC GTTGCTGACA GGTCGCAACC ACCATGCCGT CGACAACGGC ACGGTCGCCA ACCTGTCCAC CGGCTTTCCG GGATACGACA ACAACCTGCC GAAAAGCGCA GCCACGGTCG CCGAGATCCT TCGCCAGCAC GGGTGGAACA CGGCGATGAT CGGCAAGCAC CATAATACGC CGGAGCCGTT CGTCTCGCCC GCCGGACCGT TCGACCTTTG GCCCACCGGC CTCGGCTTCG AATATTTCTA CGGCTTCATG GCGGCCTCCA CGAACCAGTT CAGCCCCGCG CTCTATCGCA ACACCAGCCC CATCCCGACA TTGCGGGATG GCGTGCTCGA CAAGGCGCTG GCCGACGACG CGATCGGCTG GATTCACGCG CAGAAGGCCG CAGCGCCCGA CAAGCCGTTC TTCCTCTATT ACGCGACCGG TTCCGCCCAT AACCCGCTGC AGGCTCCGGC CGACTGGATT GCGAAGTTCC GCGGCCGGTT CGACAATGGC TGGGACGCCG TGCGCAAGGG CACGGTCGAC CGCCAGCGCA AGCTCGGCAT CGTTCCGCGC ACCACCAAGG ATACCACCCG GCCCGACGAA ATTCCGGCTT GGAGCACGCT TACGCCCGAG CAGCGGCGGG TCAACGCCCG GCTCATGGAA GTCTATGCCG GCATGCTGTC CTACCAGGAC GCGCAGATCG GCCGGATGCT CGACGAACTC GATCGCATGG GCGAGGCGGA CAACACGCTG GTCATGTTCA TCGAGGGCGA CAATGGCGCC GCGCCCGAGG CGGGACCGGA CGGGCAGTCG AATCCGATGG CGGTCTTCGC CAACGGATTC AAGGAGGACG CATCCTCGCT GGCAGCGCAG CTCGACAAGC TTGGCGGGCC GGATGCGGTT GCCGGCATGG GATGGGGCTG GGCCTGGGCG ACCAACGCGC CGTTCAAATG GTTCAAGCAA TACGGATCGC ACCTTGGCGG CACGCGCAAC CCGCTGGTGG TCTCGTGGCC AAAGGGCATT TCCGGGCGCG GCATCCGCTC GCAGTTCACC GATGTGGTCG ACGTGATGCC CACGATCCTC GATCTTGCCG GCGTGCAGAT CCCCGACAGC GTCAATGGCG TGAAGCAGCA AGCGGTCGAC GGCATAAGCT TCCGCTACAC GCTGGATGCT CCCGATGCGC CGGAACGCCG CCACACCCAG TACTTCGAGA TGATGGGCAA TCACGGCATC TACCACGATG GCTGGATGGC GAGCACCACG CCGGTCAACC GGTTGCGAAG CAAGCCGGAC CATCCAGTCC TGCCGACGGA CTACAAGTGG GAACTCTATA ACCTCACCCG CGATTATTCC CAGGCCAACG ACCTCGCTGC GAAACACCCG GAGAAGCTGG CGGAACTGAA GGCCCTCTTC GAAGTCGAAG CGCGGCGAAA CAATGTCTAC CCGCTGGACG ACAGGCTTGA CATGGCGCGC TTCAGCGCAT CAGCCGCACT CGTGCCGAAG CGCAAGCGGT ACGTCTATTG GGGCGAGGTC ACGCTTCCGG CGGCGACATC CGCACCGATC TTCAACCGGG GCTTCACGCT CGACGCGCAA GTCGACGTGG CATCGAGCCA GGGCACCGGT CCCCTTCTGG CAATCGGCGG GAAGTTCGCA GGGTGGTCGT TCTACCTGGT GGATGGCCGA CCGGCCGTGA CAGTCGCGAC GTCGCAGCGG CCCGAGGATC ATTTCAGGGT GGTCGCATCG CAGCCGGTCG CGCCGGGCGC GTCACGGATC GGGTTTTCCT TCCGTTACGA CGGTGGCCAC AACGCGGGCG GCGAGATGAT CATCACCGCC AACGGTAAGG AGATCGGGCG CGGTCGCATT CCCCGCACGC TGTCAAAGCT GGTGGAAATG ACCGACACCT TCGACATCGG TTTCGATGCC GATACACCGG TTACCGACGA CTACCCCAAG GGCAGTCATT TCCCCGGCAC CATCGCCAGG CTTGAAATCG TCCCCGGCGA TGCGGGTGCT CCGACGCCTG TGGAGCGGTA G
|
Protein sequence | MFIYKFAMSR RNRHKMRALR IFTAISSPLA LWSSTATGQE ALPQAPQPFA GSIGRTYVDS VPSFPKPVTA PAGAPNVVLI MTDDVGFGAA STFGGPVPTP NLDRLASRGI VFNRFHTKAM CSPTRASLLT GRNHHAVDNG TVANLSTGFP GYDNNLPKSA ATVAEILRQH GWNTAMIGKH HNTPEPFVSP AGPFDLWPTG LGFEYFYGFM AASTNQFSPA LYRNTSPIPT LRDGVLDKAL ADDAIGWIHA QKAAAPDKPF FLYYATGSAH NPLQAPADWI AKFRGRFDNG WDAVRKGTVD RQRKLGIVPR TTKDTTRPDE IPAWSTLTPE QRRVNARLME VYAGMLSYQD AQIGRMLDEL DRMGEADNTL VMFIEGDNGA APEAGPDGQS NPMAVFANGF KEDASSLAAQ LDKLGGPDAV AGMGWGWAWA TNAPFKWFKQ YGSHLGGTRN PLVVSWPKGI SGRGIRSQFT DVVDVMPTIL DLAGVQIPDS VNGVKQQAVD GISFRYTLDA PDAPERRHTQ YFEMMGNHGI YHDGWMASTT PVNRLRSKPD HPVLPTDYKW ELYNLTRDYS QANDLAAKHP EKLAELKALF EVEARRNNVY PLDDRLDMAR FSASAALVPK RKRYVYWGEV TLPAATSAPI FNRGFTLDAQ VDVASSQGTG PLLAIGGKFA GWSFYLVDGR PAVTVATSQR PEDHFRVVAS QPVAPGASRI GFSFRYDGGH NAGGEMIITA NGKEIGRGRI PRTLSKLVEM TDTFDIGFDA DTPVTDDYPK GSHFPGTIAR LEIVPGDAGA PTPVER
|
| |