Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2470 |
Symbol | |
ID | 3916789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 2667360 |
End bp | 2669720 |
Gene Length | 2361 bp |
Protein Length | 786 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640445225 |
Product | sulfatase |
Protein accession | YP_497740 |
Protein GI | 87200483 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGCAA TCCGTTCCGC CCTTCGTTTG TCCGCCAGAA AACGCCGGCT CTGCGGAGGT TCCGCCATCG CCGCCTTACT TGCGCCAACC TTTGTCCTTG CAGAAACCGC CCCTGCCGAT CCGCTGGCCG GCAAGGTCGG GCGCACCGTG CAGGTAACAC ATGCTCCGGC ATGGCCTGCC CAACCGCAGG CCCCCAAGGG CGCGCCCAAC GTACTGGTCA TCCTTACCGA CGACGTGGGC TTTGGCGTGA CCAGCGCGTT TGGTGGGCCG GTGCCGACTG CCACTTTCGA CGCGCTTGCC CAGACGGGCC TCCGCTACAA CCGTTTCAAC ACCACCGCGC TGTGCTCGCC CACGCGCGCC TCGCTCCTGA CGGGCCGCCT GCCGCAAAAT GTGGACATGG GCAACGTCAC CAACCTGCCG ACCGGGTTTG ACGGCTACAC CACCGTTATC CCGCAGTCCG CAGCCACCGT CGCCGAAGTG CTGAAGGAAA ACGGCTTCAA CACCGCGATG TTCGGCAAGA GCCACCTGAC GCCCGAATGG CAGACGAGCG CGGCGGGCCC CTTCGACCAG TGGCCCACGG GCCTGGGGTT CGAATACTTC TACGGCTTCC TTTCGGCAGA CACCTCGATG TGGCAGCCGA GCATTGTCGA GAACACCCTT CCGGTCGAGC CACCTCACGA CGATCCAAAC TACTTCTTCG AGAAGGACAT GGCGGATCAC GCGATCAAGT GGATGCGCAC GCAGCAAGCC GCCGCGCCGG ACAAGCCGTT CTTCATGTAC TACGCTCCCG GCATTGCCCA CACTCCGCAC CATGCGCCCA AGGAGTGGCT GGAGAAGTTC CGGGGCAAGT TCGATCAGGG CTGGGACAAG CTGCGCGAGG AGACTTTCGC TCGGCAGAAG CGCATGGGCA TCATCCCCGC GAACTCCCGG CTTTCGCCTC GCCCGGCTAC GTTGCCCGCC TGGGATTCAT TGAATGCCGA CCAGAAGAAG CTCTATTCGC GCCTGATGGA GGCCTACGCA GCGAGCGTCT CGTATTCGGA TCATCAGACT GGTCGCCTGA TCGAAGCGAT CCGCGAGACC GGCGAACTGG ACAACACGCT GATCATCTAC ATCCAGGGCG ACAATGGCAG CAGCGCAGAG GGCGGGCCGG AAGGACTGCT CTACGAACAG TCAACGATCA CCGGCCGCAA GGAAACCATG GCCGAGAAGC TGTCGCACAT TGACGATATC GGCGGGCCGA AGCTGTACAA CCATTTCCCC GCAGCATGGG CCTGGGCAAC CAACTCGCCC TTCCCCTGGT GGAAGCAGGT CGCTTCGCAG GCAGGCGGCG TGCGCAACGG CATGGTCGTT TCCTGGCCCA AGCGCATCAC CGAGAGGGGC GTGATCCGCT CGCAATATGC GCACGTCAGC GACATTGCGC CGACCGTGCT CGATGCGGTC GGGATCAAGT CTCCCGACTT GATCAAGGGC ATCAGGCAGA AGCCGGTCGA CGGAATCAGC CTAGCCTACA CCTTCCAGCA GGGTTCTGCC CCGTCGGCCA GGCGCATGCA GATCTACGAG ATGATGGAGA ACTTCGGCAT CTACAAGGAC GGCTGGATGG CCGGCACGCT TCCCAAGCGC GCCGCCTGGG AAGCCGGCGC GGCGGGCGAC CGCAAGCTCA GCGTCGGGCC CGACGAGCGC GAATGGTCTC TGTTCAACCT CGATGCCGAC TTCACCACGG CCAAGGATCT CGCGAAGCAG AACCCCGCCA AGCTCAAGGA AATGCAAGAT CTGTTCTGGG CGGAAGCCGC AAGAAACAAC ATTCTGCCGA TCCACGACTA TAGCCAGGGA ACCGAAGGAC GGCCTTCGCT TGGCGCCTAT CGCTCCAGCT TCACCTACCG CCCGGGTACA GCCACGATCG CGGAGGACGC AGCGCCGCAT ACCATTGGCA AAAGTTTCCG CATCGACGCT GACGTGACTG CAGGCAGCAG CACGAACGGC GTGATGATCG CGCAGGGTGG TCGCTTCGGC GGCTACAGCT TCTACCTCAA GGACGGGCGT CCGACCTTCC ATTACAACGC CGTAGGCGCG GACGCCTTCA CCGTCGCCGC AGGGAGTGCC CTTGCCGAGG GCAAGCACAC GCTTTCCGCA GAGTTCACCG CCGACAAGAC CGTGCCGGGA ACGCCTGGAA CGCTGACGCT ATATGTCGAC GGCAAGGCGG TAGGTTCCAG CAGGCTGGGC CGCACGGTGG CCGGGTGGAT GTCGCACACC GAAGGCCTCG ACGTCGGCCT GGACCGGATA AGCGCGGTCA GTCCCGACTA CAGCGTGCAG GATAGTGCCT TTACCGGCGA GATCGACGAA GTGCGGGTGT CGATCAAATG A
|
Protein sequence | MRAIRSALRL SARKRRLCGG SAIAALLAPT FVLAETAPAD PLAGKVGRTV QVTHAPAWPA QPQAPKGAPN VLVILTDDVG FGVTSAFGGP VPTATFDALA QTGLRYNRFN TTALCSPTRA SLLTGRLPQN VDMGNVTNLP TGFDGYTTVI PQSAATVAEV LKENGFNTAM FGKSHLTPEW QTSAAGPFDQ WPTGLGFEYF YGFLSADTSM WQPSIVENTL PVEPPHDDPN YFFEKDMADH AIKWMRTQQA AAPDKPFFMY YAPGIAHTPH HAPKEWLEKF RGKFDQGWDK LREETFARQK RMGIIPANSR LSPRPATLPA WDSLNADQKK LYSRLMEAYA ASVSYSDHQT GRLIEAIRET GELDNTLIIY IQGDNGSSAE GGPEGLLYEQ STITGRKETM AEKLSHIDDI GGPKLYNHFP AAWAWATNSP FPWWKQVASQ AGGVRNGMVV SWPKRITERG VIRSQYAHVS DIAPTVLDAV GIKSPDLIKG IRQKPVDGIS LAYTFQQGSA PSARRMQIYE MMENFGIYKD GWMAGTLPKR AAWEAGAAGD RKLSVGPDER EWSLFNLDAD FTTAKDLAKQ NPAKLKEMQD LFWAEAARNN ILPIHDYSQG TEGRPSLGAY RSSFTYRPGT ATIAEDAAPH TIGKSFRIDA DVTAGSSTNG VMIAQGGRFG GYSFYLKDGR PTFHYNAVGA DAFTVAAGSA LAEGKHTLSA EFTADKTVPG TPGTLTLYVD GKAVGSSRLG RTVAGWMSHT EGLDVGLDRI SAVSPDYSVQ DSAFTGEIDE VRVSIK
|
| |