Gene RPD_2516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2516 
Symbol 
ID4023007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2815177 
End bp2817564 
Gene Length2388 bp 
Protein Length795 aa 
Translation table11 
GC content64% 
IMG OID637962709 
Productsulfatase 
Protein accessionYP_569647 
Protein GI91976988 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.556421 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACAGC GGCGCACAAC GACATGCAGC CTCGTCGCCA GCGTGGCGCT ACTTGCGCTG 
ATGGGGCCGG CCCTCGCGCA ACGCATTACC ACCGCTGCCG CCGCCGATGG CTCCGTGTTG
CCATTTCCGG GCTCGCCGTC GGCCAGCATC GCGGCTCCCC GTTTGCAGGA TTCCAAACAC
GTGCGGCGGG TCGAGCCGAG CCATCTGCGC AAGGATGCGC CGAATGTCCT GATCATCCTG
CTGGATGACG TCGGCTTCGG TCAGGCCGCG ACCTTCGGCG GCGAGGTCAA CACGCCGACG
CTGAGCAAGC TCGCCGAGCA GGGCGTGAGC TACAACGCCT TCCACACCAC GGCGATCTGC
TCGCCGACCC GCGCGGCGCT GTTGACCGGC CGCAACCATC AGCGCGTCGG CAACGGCACC
ATCGCCGAGC GCGCGGTCGA TTGGGACGGC TACACTGGCG TGATCCCGAA AAGCTCCGCC
ACCATGGCTG AGGTGATGCG GCACTATGGT TACAAGACGG CGGCGATCGG TAAGTGGCAC
AACACTCCCG CCGACCAGAC CACCTCGATG GGCCCGTTCG ATCGCTGGCC GACCGGACAC
GGCTTCGACT ACTTCTATGG CTTCCTCGCC GGCGAGACCT CGCAGTGGGA GCCGCGGTTG
GTCGAGAACA CCAACCAGAT CGAGCCGCCG CACAGTGAGA CGTATCACCT CAGCGAGGAC
CTCGCGCAGC GCGGCATCGA TTGGCTGCGT CGCCACCAGG CGTTTGCTCC CGACAAGCCG
TTCCTGCTGT ATTGGGCGCC CGGCGCCGGC CACGGGCCGC ATCAGATATT CAAGGAATGG
GCCGACAAGT ACAAAGGCAA GTTCGACAAT GGCTGGGATG CCTATCGCGA CCGCGTGTTC
GCGCGGCAGA AACAGCTCGG CTGGATTCCT GCCGACACCC AGCTGACGCC GCGCACCGCC
TCGATGCCGT CCTGGGACAG CATTCCGGAA GCGCAGCGGC CGTTCCAGCG GCGGCTGATG
GAAATCTTCG CCGGTTTCGT CGAGCATGTC GACGTGCAGG CGGGCAGGGT GGTGGACGAG
CTGGAGCGTC TCGGCATTCG CGACAACACC ATCGTCATCT ACATATTCGG CGACAACGGC
GCCAGCGCCG AGGGCCAGAA CGGCACGATC AGCGAATTGC TGGCGCAGAA CGGCATTCCG
AACACGGTCG AGCAACAGCT TGCCGCGCTG GACCGGTTGG GCGGGCTCGA AGCGCTCGGT
GGTCCGAAGA CTGACAGCAT GTATCACGCC GGCTGGGCCT GGGCCGGCAA CACGCCGTTC
CAGCACACCA AGCTGGTCGC CTCGCATTTC GGCGGCACGC GAAATCCGAT GGTGATCTCC
TGGCCGAAGG GCATCAAGCC GGACAAGACG CCGCGGCCGC AGTTCCACCA TGTCAACGAC
ATTGCGCCGA CGATCTACGA ACTCGTCGGA ATCAAGCCGC CGAAAATCGT CGATGGCGTC
GTGCAGGATC CGATCGATGG CGTGAGCCTC GCCTATACTT TCAATGATCC GAAGGTGCCG
CCGCGCAAGA CCTCGCAGTA CTTCGACAAC AACGGCAGCC GGGCTATGTA TCAGGACGGA
TGGATCGCGG CGACCTTCGG TCCGCTGGTG CCGTGGCTGC CCGGCGCGCC CGGCCTCGCC
GAATGGGACT CGGCCAAAGA CAAGTGGGAA CTCTACCAGA TCGGCAAGGA TTTCTCCGAA
GCCAACGATC TCGCCACGAA GGAGCCGCAG CGCTTAGCGA AGTTGCAGAA GGCCTTCGAT
CAGCAGGCCA AGGCCAACAA GGTCTATCCG CTCGGCGCCG GCATCTGGCT TCGCCTGCAT
CCGGAGGACC GGATCAAGAC GCCGTATACG CGCTGGCGGT TCGATGCCAC CACCACGCGG
ATGCCGGAAT TCACCGCACC CGGCATCGGC CACGACAACA ACACCGTCAT CATCGACGCG
GAGATCGGCG ACAACGCGTC GGGCGTGCTC TATGCGCTCG GTGGCGCGGG CGGCGGAGTC
ACGCTCTACA TGGACCAGGG AGATCTGGTC TACGAATACA ACATGATGAT CATCGAGCGC
TACATCGCAC GCTCCGCGAC CAAGATCACG CCCGGCAAGC ACCGCATCGA GGTGACGACC
AGGCTCGAAA GCGCCAAGCC GCTGTCGGGA GCGGACGTCG TTATCAAGGT CGACGGCCAA
GAGGTGGGGC GCACCACGGT GAAACGCACG GTGCCCGCCG CCTTCTCCGC CAGCGAGACC
TTCGATGTCG GCGTCGATCT CGGCTCGACG GTGTCGACCG ACTATTTCGA CCGGCGGCCG
TTCCGCTTCG ACGGCAAGAT CGAGAAGGTC GAGGTCAACT TGCAGTAA
 
Protein sequence
MRQRRTTTCS LVASVALLAL MGPALAQRIT TAAAADGSVL PFPGSPSASI AAPRLQDSKH 
VRRVEPSHLR KDAPNVLIIL LDDVGFGQAA TFGGEVNTPT LSKLAEQGVS YNAFHTTAIC
SPTRAALLTG RNHQRVGNGT IAERAVDWDG YTGVIPKSSA TMAEVMRHYG YKTAAIGKWH
NTPADQTTSM GPFDRWPTGH GFDYFYGFLA GETSQWEPRL VENTNQIEPP HSETYHLSED
LAQRGIDWLR RHQAFAPDKP FLLYWAPGAG HGPHQIFKEW ADKYKGKFDN GWDAYRDRVF
ARQKQLGWIP ADTQLTPRTA SMPSWDSIPE AQRPFQRRLM EIFAGFVEHV DVQAGRVVDE
LERLGIRDNT IVIYIFGDNG ASAEGQNGTI SELLAQNGIP NTVEQQLAAL DRLGGLEALG
GPKTDSMYHA GWAWAGNTPF QHTKLVASHF GGTRNPMVIS WPKGIKPDKT PRPQFHHVND
IAPTIYELVG IKPPKIVDGV VQDPIDGVSL AYTFNDPKVP PRKTSQYFDN NGSRAMYQDG
WIAATFGPLV PWLPGAPGLA EWDSAKDKWE LYQIGKDFSE ANDLATKEPQ RLAKLQKAFD
QQAKANKVYP LGAGIWLRLH PEDRIKTPYT RWRFDATTTR MPEFTAPGIG HDNNTVIIDA
EIGDNASGVL YALGGAGGGV TLYMDQGDLV YEYNMMIIER YIARSATKIT PGKHRIEVTT
RLESAKPLSG ADVVIKVDGQ EVGRTTVKRT VPAAFSASET FDVGVDLGST VSTDYFDRRP
FRFDGKIEKV EVNLQ