Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_2516 |
Symbol | |
ID | 4023007 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 2815177 |
End bp | 2817564 |
Gene Length | 2388 bp |
Protein Length | 795 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637962709 |
Product | sulfatase |
Protein accession | YP_569647 |
Protein GI | 91976988 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.556421 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACAGC GGCGCACAAC GACATGCAGC CTCGTCGCCA GCGTGGCGCT ACTTGCGCTG ATGGGGCCGG CCCTCGCGCA ACGCATTACC ACCGCTGCCG CCGCCGATGG CTCCGTGTTG CCATTTCCGG GCTCGCCGTC GGCCAGCATC GCGGCTCCCC GTTTGCAGGA TTCCAAACAC GTGCGGCGGG TCGAGCCGAG CCATCTGCGC AAGGATGCGC CGAATGTCCT GATCATCCTG CTGGATGACG TCGGCTTCGG TCAGGCCGCG ACCTTCGGCG GCGAGGTCAA CACGCCGACG CTGAGCAAGC TCGCCGAGCA GGGCGTGAGC TACAACGCCT TCCACACCAC GGCGATCTGC TCGCCGACCC GCGCGGCGCT GTTGACCGGC CGCAACCATC AGCGCGTCGG CAACGGCACC ATCGCCGAGC GCGCGGTCGA TTGGGACGGC TACACTGGCG TGATCCCGAA AAGCTCCGCC ACCATGGCTG AGGTGATGCG GCACTATGGT TACAAGACGG CGGCGATCGG TAAGTGGCAC AACACTCCCG CCGACCAGAC CACCTCGATG GGCCCGTTCG ATCGCTGGCC GACCGGACAC GGCTTCGACT ACTTCTATGG CTTCCTCGCC GGCGAGACCT CGCAGTGGGA GCCGCGGTTG GTCGAGAACA CCAACCAGAT CGAGCCGCCG CACAGTGAGA CGTATCACCT CAGCGAGGAC CTCGCGCAGC GCGGCATCGA TTGGCTGCGT CGCCACCAGG CGTTTGCTCC CGACAAGCCG TTCCTGCTGT ATTGGGCGCC CGGCGCCGGC CACGGGCCGC ATCAGATATT CAAGGAATGG GCCGACAAGT ACAAAGGCAA GTTCGACAAT GGCTGGGATG CCTATCGCGA CCGCGTGTTC GCGCGGCAGA AACAGCTCGG CTGGATTCCT GCCGACACCC AGCTGACGCC GCGCACCGCC TCGATGCCGT CCTGGGACAG CATTCCGGAA GCGCAGCGGC CGTTCCAGCG GCGGCTGATG GAAATCTTCG CCGGTTTCGT CGAGCATGTC GACGTGCAGG CGGGCAGGGT GGTGGACGAG CTGGAGCGTC TCGGCATTCG CGACAACACC ATCGTCATCT ACATATTCGG CGACAACGGC GCCAGCGCCG AGGGCCAGAA CGGCACGATC AGCGAATTGC TGGCGCAGAA CGGCATTCCG AACACGGTCG AGCAACAGCT TGCCGCGCTG GACCGGTTGG GCGGGCTCGA AGCGCTCGGT GGTCCGAAGA CTGACAGCAT GTATCACGCC GGCTGGGCCT GGGCCGGCAA CACGCCGTTC CAGCACACCA AGCTGGTCGC CTCGCATTTC GGCGGCACGC GAAATCCGAT GGTGATCTCC TGGCCGAAGG GCATCAAGCC GGACAAGACG CCGCGGCCGC AGTTCCACCA TGTCAACGAC ATTGCGCCGA CGATCTACGA ACTCGTCGGA ATCAAGCCGC CGAAAATCGT CGATGGCGTC GTGCAGGATC CGATCGATGG CGTGAGCCTC GCCTATACTT TCAATGATCC GAAGGTGCCG CCGCGCAAGA CCTCGCAGTA CTTCGACAAC AACGGCAGCC GGGCTATGTA TCAGGACGGA TGGATCGCGG CGACCTTCGG TCCGCTGGTG CCGTGGCTGC CCGGCGCGCC CGGCCTCGCC GAATGGGACT CGGCCAAAGA CAAGTGGGAA CTCTACCAGA TCGGCAAGGA TTTCTCCGAA GCCAACGATC TCGCCACGAA GGAGCCGCAG CGCTTAGCGA AGTTGCAGAA GGCCTTCGAT CAGCAGGCCA AGGCCAACAA GGTCTATCCG CTCGGCGCCG GCATCTGGCT TCGCCTGCAT CCGGAGGACC GGATCAAGAC GCCGTATACG CGCTGGCGGT TCGATGCCAC CACCACGCGG ATGCCGGAAT TCACCGCACC CGGCATCGGC CACGACAACA ACACCGTCAT CATCGACGCG GAGATCGGCG ACAACGCGTC GGGCGTGCTC TATGCGCTCG GTGGCGCGGG CGGCGGAGTC ACGCTCTACA TGGACCAGGG AGATCTGGTC TACGAATACA ACATGATGAT CATCGAGCGC TACATCGCAC GCTCCGCGAC CAAGATCACG CCCGGCAAGC ACCGCATCGA GGTGACGACC AGGCTCGAAA GCGCCAAGCC GCTGTCGGGA GCGGACGTCG TTATCAAGGT CGACGGCCAA GAGGTGGGGC GCACCACGGT GAAACGCACG GTGCCCGCCG CCTTCTCCGC CAGCGAGACC TTCGATGTCG GCGTCGATCT CGGCTCGACG GTGTCGACCG ACTATTTCGA CCGGCGGCCG TTCCGCTTCG ACGGCAAGAT CGAGAAGGTC GAGGTCAACT TGCAGTAA
|
Protein sequence | MRQRRTTTCS LVASVALLAL MGPALAQRIT TAAAADGSVL PFPGSPSASI AAPRLQDSKH VRRVEPSHLR KDAPNVLIIL LDDVGFGQAA TFGGEVNTPT LSKLAEQGVS YNAFHTTAIC SPTRAALLTG RNHQRVGNGT IAERAVDWDG YTGVIPKSSA TMAEVMRHYG YKTAAIGKWH NTPADQTTSM GPFDRWPTGH GFDYFYGFLA GETSQWEPRL VENTNQIEPP HSETYHLSED LAQRGIDWLR RHQAFAPDKP FLLYWAPGAG HGPHQIFKEW ADKYKGKFDN GWDAYRDRVF ARQKQLGWIP ADTQLTPRTA SMPSWDSIPE AQRPFQRRLM EIFAGFVEHV DVQAGRVVDE LERLGIRDNT IVIYIFGDNG ASAEGQNGTI SELLAQNGIP NTVEQQLAAL DRLGGLEALG GPKTDSMYHA GWAWAGNTPF QHTKLVASHF GGTRNPMVIS WPKGIKPDKT PRPQFHHVND IAPTIYELVG IKPPKIVDGV VQDPIDGVSL AYTFNDPKVP PRKTSQYFDN NGSRAMYQDG WIAATFGPLV PWLPGAPGLA EWDSAKDKWE LYQIGKDFSE ANDLATKEPQ RLAKLQKAFD QQAKANKVYP LGAGIWLRLH PEDRIKTPYT RWRFDATTTR MPEFTAPGIG HDNNTVIIDA EIGDNASGVL YALGGAGGGV TLYMDQGDLV YEYNMMIIER YIARSATKIT PGKHRIEVTT RLESAKPLSG ADVVIKVDGQ EVGRTTVKRT VPAAFSASET FDVGVDLGST VSTDYFDRRP FRFDGKIEKV EVNLQ
|
| |