Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3133 |
Symbol | |
ID | 4075005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 108763 |
End bp | 110277 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 638004636 |
Product | sulfatase |
Protein accession | YP_611369 |
Protein GI | 99078111 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATTGC CAAACATCCT CATTTTTATG GTCGATCAGT TGAACGGGAC CCTGTTCCCG GATGGCCCCG CAGAATGGCT GCACGCACCA AACATGAAGA AACTGGCCGC GCGGTCTACC CGGTTTCGCA ATTGCTATAC CGCCAGCCCG CTCTGTGCGC CGGGTCGGGC CAGTTTCATG TCCGGGCAGC TGCCGTCTGC CACGGGCGTC TACGACAACG CGGCGGAATT CGCCTCTTCA ATCCCGACGT ATGCCCATCA TCTGCGCCGC GCAGGCTATT ACACCTGCCT ATCGGGCAAG ATGCATTTTG TCGGCCCAGA TCAGCTTCAT GGCTTTGAAG AACGTCTGAC AACCGATATC TACCCACCCG ATTTCGGTTG GACCCCGGAC TATCGCAAAC CCGGCGAGCG CATCGACTGG TGGTATCACA ACATGGGGTC GGTCACCGGC GCCGGGGTGG CGGAGATTTC GAACCAGATG GAGTTTGATG ACGAGGTCGC CTTTCACGCG ACCCAAAAGA TCTACGACCT GGCGCGCGGC AAGGACGCCC GGCCGTGGTG CCTCACCGTC AGCTTTACGC ACCCCCATGA TCCCTATGTG ACTCGTAAAA AATACTGGGA TCTATACGAG GATTGCCCGC ATCTTATGCC GGAGGTCGCG GATCTCGGCT ATGAGAACCA GGATCCGCAC TCGAAACGGA TCTTTGACGC AAATGACTGG CGCAACTTTG ACATCACCGA AGAAGACATC CGCAGGTCGC GTCGCGCGTA TTTCGGCAAT ATCTCCTATC TCGACGACAA GATCGGCGAG GTCATGGAAG CGCTGGAAGG AACGCGTCAG GACAAGGATA CGATCATTCT CTTTGTCTCG GATCACGGCG ACATGCTGGG AGAGCGCGGC CTGTGGTTCA AGATGAGCTT TTATGAGGGG TCCTCACGCG TTCCGATGAT GATTTCAGCG CCCAATATGA CCCCTGGCCT GGTTTGCGAT CCGGTCTCCA ACATCGATGT CTGTCCAACG CTTTGCGATC TGGCAGGTGT GAGCATGTCC GAGGTAATGC CTTGGACCGC TGGGGAAAGC CTGGTCCCGC TTGGCCAAGG TGGCACGCGC AGCACGCCGG TGGCGATGGA ATATGCAGCC GAAGCCTCTT ATGCCCCGAT GGTCTCCTTG CGGTCGGGGC GCTACAAGCT CAATCTTTGT GCGCTTGATC CGGACCAGCT GTTTGATCTG GACGCCGACC CACATGAACG GGTGAATCTC GCCAAAGATC CCACCCACCA CGAGGCTTAT CAGGCGCTCA AGGCGATTGC GGCCGAGCGC TGGGATCTGG ATCGATTTGA CGCCGATGTG CGCGCCAGCC AGGCGCGGCG CTGGGTGGTA TATGAGGCGC TCCGCCAGGG CGGCTATTTC CCGTGGGATT ATCAACCCCT GCAAAAAGCG TCCGAACGCT ACATGCGCAA CCATATGGAT TTGAATGTGG TCGAAGACCA AGCCCGCTAC CCGCGCGGAG AATAA
|
Protein sequence | MTLPNILIFM VDQLNGTLFP DGPAEWLHAP NMKKLAARST RFRNCYTASP LCAPGRASFM SGQLPSATGV YDNAAEFASS IPTYAHHLRR AGYYTCLSGK MHFVGPDQLH GFEERLTTDI YPPDFGWTPD YRKPGERIDW WYHNMGSVTG AGVAEISNQM EFDDEVAFHA TQKIYDLARG KDARPWCLTV SFTHPHDPYV TRKKYWDLYE DCPHLMPEVA DLGYENQDPH SKRIFDANDW RNFDITEEDI RRSRRAYFGN ISYLDDKIGE VMEALEGTRQ DKDTIILFVS DHGDMLGERG LWFKMSFYEG SSRVPMMISA PNMTPGLVCD PVSNIDVCPT LCDLAGVSMS EVMPWTAGES LVPLGQGGTR STPVAMEYAA EASYAPMVSL RSGRYKLNLC ALDPDQLFDL DADPHERVNL AKDPTHHEAY QALKAIAAER WDLDRFDADV RASQARRWVV YEALRQGGYF PWDYQPLQKA SERYMRNHMD LNVVEDQARY PRGE
|
| |