Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3812 |
Symbol | |
ID | 4899133 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | + |
Start bp | 944619 |
End bp | 946268 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640114416 |
Product | sulfatase |
Protein accession | YP_001045664 |
Protein GI | 126464551 |
COG category | [R] General function prediction only |
COG ID | [COG2194] Predicted membrane-associated, metal-dependent hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0194754 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.186688 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCCCAT TTCCGCTCAG CGTCGGACGG GCTCCGGCAA CGGCTCCCGC ACGGGGTCGC GCGCTTCCCT CCGTCTTCCA GATCAATCTT GCGGTGACGA GCTTCCTGCT GATCGCGGAC AATGCGACCT TCTGGGCCCG TGCGCTCGGG ATCTTCGGGC CGGGGGCGGA GCTTCTGCTG TTCGGGACGG CGATCTGGGC ACTCACCTTC TTCATCGTCG CCCTGTTCGG AACGGGGCCG CTGCGCCGGC CGATGCTGGC ACTGCTGCTG CTGCTGGGCG CCGCGACCGG TTACTTCCAG GATCGGCTGG GCGTGACGAT GGACCGGGAC ATGATCGAGA ACGTGATGAC GACGACGCTC TCCGAGGGGC GCCATCTCGT CACGCCGGGC TTCCTGATCC ATCTGGCGAT CTTCGGCCTC TTGCCCGCGC TGGTCGTGCT GCGCCTGCCC GTGCGCGAGG GCGGCTGGCG GGCGGCGGCG CGCAGCGGGC TCTCGGCGGT CATGGCGCTC GCCCTCGGGG CGGGTCTCGT GATGGCCGAC TTCAAGACCC TCTCGGCGGT CCTGCGCGAG CACAAGGAGC TGGTGTCGGC CTGGCAGCCC GCGATGCCGC TTGGCTCGGC GCTCCGCTAT GCCAAGCTGC GCGTGCGCAC CCATGATCTG ACGGTGGCCG CTCTCGGCAC CGATGCGCAG AAGGGGCCGC TCCTCGCGGC GGCGCCGAAG CCGGTTCTGA CCGTCCTCGT GGTGGGCGAG ACGGCGCGGG CGCAGAACTG GGGGCTGAAC GGCTACGAGC GCGACACGAC GCCCGAGCTG CGCGCCCGCG GCGTGGTGAA TTTTTCCGAC GTCGAGAGCT GCGGCACGGC GACGGCCGTG TCGATGCCCT GCATGTTCTC GAACCTCACG CGCAAGAGCT ACAGCCACGA GAAGGGCCTC GCGCAGGAGA ACCTGCTCGA CGTGCTGGCC CATGCGGGCG TGGCGGTGGA GTGGTGGGAC AACAACACGG GCGACAAGGA CATCGCGGCG CGCCTGCCGT CGGCGCGGGT GCCCGAGACG GTGGACGCCT GCGGCGAGGG CGAATGCACC GATGCGGCCT TCCTGCCGCT CCTCGACCGG ACGCTCGCCG GCATGAAGGA CGATACGGTG TTGGTGCTGC ACCAGATCGG CAGCCACGGG CCCGCCTATC ACCTGCGCTA TCCCAAGGCC TTCGAGCGGT TCTCGCCCGC CTGCCAGAGC GCGGAATTCT CGCGCTGCAC GGACGAGGAG ATCCGCAATG CCTATGACAA CAGCCTGGCC TTCACCGATC ATATCCTCGC CGCGATGATC GACCGGCTGG CCGCGCAGGA CCGCGTGATC CCGGCGCTGG TCTATGTCTC GGACCACGGC GAGTCGCTGG GCGAGAACGG GCTCTATCTG CATGGCGCGC CGCGCTTCAT GGCGCCCGAC ACGCAGACCC ATGTGCCGAT GGTGATGTGG CTCTCCGAGG CGTTCCGGTC GGCCATGCAC CTCGATGTGG GCTGCCTGCA GGCACAGGCG GCCGAGCCGG CGAGCCATGA CAACCTGTTC CATTCGGTGC TCGGGCTGAT GGACATCCGC ACCGAGGTGC GCGACACGAG CCTCGACCGT GTCTCGTCCT GCCGCGCTTC CGCTTCCTGA
|
Protein sequence | MFPFPLSVGR APATAPARGR ALPSVFQINL AVTSFLLIAD NATFWARALG IFGPGAELLL FGTAIWALTF FIVALFGTGP LRRPMLALLL LLGAATGYFQ DRLGVTMDRD MIENVMTTTL SEGRHLVTPG FLIHLAIFGL LPALVVLRLP VREGGWRAAA RSGLSAVMAL ALGAGLVMAD FKTLSAVLRE HKELVSAWQP AMPLGSALRY AKLRVRTHDL TVAALGTDAQ KGPLLAAAPK PVLTVLVVGE TARAQNWGLN GYERDTTPEL RARGVVNFSD VESCGTATAV SMPCMFSNLT RKSYSHEKGL AQENLLDVLA HAGVAVEWWD NNTGDKDIAA RLPSARVPET VDACGEGECT DAAFLPLLDR TLAGMKDDTV LVLHQIGSHG PAYHLRYPKA FERFSPACQS AEFSRCTDEE IRNAYDNSLA FTDHILAAMI DRLAAQDRVI PALVYVSDHG ESLGENGLYL HGAPRFMAPD TQTHVPMVMW LSEAFRSAMH LDVGCLQAQA AEPASHDNLF HSVLGLMDIR TEVRDTSLDR VSSCRASAS
|
| |