Gene RPD_2522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2522 
Symbol 
ID4023013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2821921 
End bp2824290 
Gene Length2370 bp 
Protein Length789 aa 
Translation table11 
GC content59% 
IMG OID637962715 
Productsulfatase 
Protein accessionYP_569653 
Protein GI91976994 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTTCT TGCGAAAGAC GCTAGCAGGC GCCGTCAGCC TATCGCTGGC GGTATTGTTC 
TCGCCGATCG TCAATGCCCA GGAAATCCTG CCGTTTCCTC CGGCGCCATC GGGCTCCGTT
GCCGGCCTGA CGATCCAGGA CTCGGTTTAC AGCAAACGCG TCGAACCAAA ACGGCTCGCC
GATGGTGCAC CGAACATTCT GATCATCCTG ATGGACGATG TCGGCCCGGC GACGCCGTCG
ACCTATGGCG GCGAAATCAA TACGCCAACG CTTGATCGCG TCGCCAGCAT GGGTGTCTCC
TATAATCGCT TCCACTCCAC GGCGATGTGC TCGCCGACGC GTGCGTCTCT GCTCACCGGA
CGCAACCACA CCTTCGTCGG AAATGGCCAG ATCGCCGCGA TTGCCAATGA TTTTGACGGG
TTCAGCGGGA CTATCCCAAA ATCATCCGCG ACGGTTGCAG AGGTGCTGAA GAACTACGGC
TACAATACCG GAGCTTGGGG CAAGTGGCAC AACACTCCCG AGGAGCAGAT CACTTCCAAA
GGCCCCTTTG AATATTGGCC GACCGGTTAT GGCTTCGAAT ATTTCTACGG TTTCCTCGCC
GGCGAAGCCT CACAGTATGA ACCGACCCTG GTGCGCAACA CTACTCAGGT CACCGAAGAG
CCGCGAAAAG GCTACCACCT GACTGACGAT ATCGCGGCTG ACGCAATCAA ATGGCTGCGC
GAGCAGAAGG CTTACGCACC GGACAAGCCG TTCTTCATGT ACTGGGCGCC AGGCGCTTCG
CACGGCCCGC ATCAGATCAT GAAGGAGTGG GCCGATAAGT ATAAAGGCAA GTTCGACGAC
GGCTGGGACG CTTACCGCGA GCGCGTGTTC AAGCGCGCCA GGGAGAAGGG CTGGATCCCG
CAAAATTCTC AATTGACGCC GCGCCCTGAA TCGATGGCCT CCTGGGCTTC GATCCCCGAG
GATGAAAAGC CATTCCAGAG CCGCCTCATG GAAGTCTTCG CCGGTTTTAC CGAGCACGCT
GACTACAACG CAGGCCGTGT GATCGATGAG ATCAAGCGGC AGGGCAGGCT CGACAACACA
TTGATCTTCT ACATCTGGGG CGACAACGGA TCCTCGGCAG AGGGGCTTAA CGGCACCATC
AGCGAGCAAC TGGCACAGAA CGGCATTCCT ACCAAGATTT CGCAACACCT CGAGGCGTTG
AAGGAGCTTG GCGGGCTTGA AGCACTGGGT GGTCCAAAGA CCGACAATAT GTACCACGCG
GCGTGGGCCT GGGCCGGAAG CACGCCCTAT AAGTCGACCA AACTCGTCGG AGCACATTTC
GGAGGCACAC GCCAACCGAT GGCGGTTGCA TGGCCGAAGG GCATCAAGCC GGATCCGACC
CCTCGACCCC AATTCCATCA TGTGATCGAC ATCGTCCCAA CTATCTACGA TCAGCTCAAG
ATCACTCCGC CCCGCGTGGT CAATGGATTC GAGCAAGATC CGATCCACGG CGTGAGCATG
AGCTATACGC TCGCCGACGC CACGGCGCCG GGGCGACGGA AGACGCAGTT CTTTGACATC
ATGGCGAGCC GCGGAATCTA TCATGACGGC TGGTTCGCAA GCGCCCCCGG ACCGCGCGAG
CCTTGGGTAG GCGGGATACC CAAGGGCATT CGGGAATGGT CGCCTCTGAC CGACAAATGG
GAGCTTTACA ATCTCGACAA AGACTGGAGC CAGGCCAACG ATCTTGCTGC CGCCGAACCG
CAAAAACTGA CGGAGATGAA GTCGTTGTTT CTGATCGAAT CCACCAAGAA CAAGAACCTG
CCGATTGGCG GCGGTCTGTG GTCCACGGCG CTGTACCATC CGGAAGATGC TCCGGCCTCA
AATCTCACCG AATGGACGTT CGATGGTCCG ATGATGCGGA TGCCGGAATC CGCCGCGCCC
AAACTCGGCA AGGTGGACAG CCTTGTCAGC ATGGAGGTGG ACCTGCCAGC GAACCCGAAC
GGCGTGCTCT ATGCGCTGGC CGGATTCTCC GGCGGCGTCA CATGCTACGT CAAAGACGGC
ATTCTCAGCT ACGAGTTCAA CCTGTTCGAG ATCACGCGCA CCAAGATCAG GGCGAAGGAA
AAGCTGGCCG CCGGCAAGGC GAAGATCGAG GTCGAGTCGA AACTCGTCGA CAAAATCGGC
GGGGCGATGA ACGTCACGCT GAAGGTCAAC GGGAAGGCGG TAGCGCAGGG CCAAGTGCCA
GCGGCGATGT CGCTTCACTT CACGTCGAAT GCCACCTTCG ACATCGGTAG CGATCTCGAT
TCGCCGGTGT CGCTCGACTA TTTCGACAAG GCACCGTTTG CCTTCAACGG CACGATCGGA
ACGACGAAGG TCACCTATTT GAAGAAGTAG
 
Protein sequence
MEFLRKTLAG AVSLSLAVLF SPIVNAQEIL PFPPAPSGSV AGLTIQDSVY SKRVEPKRLA 
DGAPNILIIL MDDVGPATPS TYGGEINTPT LDRVASMGVS YNRFHSTAMC SPTRASLLTG
RNHTFVGNGQ IAAIANDFDG FSGTIPKSSA TVAEVLKNYG YNTGAWGKWH NTPEEQITSK
GPFEYWPTGY GFEYFYGFLA GEASQYEPTL VRNTTQVTEE PRKGYHLTDD IAADAIKWLR
EQKAYAPDKP FFMYWAPGAS HGPHQIMKEW ADKYKGKFDD GWDAYRERVF KRAREKGWIP
QNSQLTPRPE SMASWASIPE DEKPFQSRLM EVFAGFTEHA DYNAGRVIDE IKRQGRLDNT
LIFYIWGDNG SSAEGLNGTI SEQLAQNGIP TKISQHLEAL KELGGLEALG GPKTDNMYHA
AWAWAGSTPY KSTKLVGAHF GGTRQPMAVA WPKGIKPDPT PRPQFHHVID IVPTIYDQLK
ITPPRVVNGF EQDPIHGVSM SYTLADATAP GRRKTQFFDI MASRGIYHDG WFASAPGPRE
PWVGGIPKGI REWSPLTDKW ELYNLDKDWS QANDLAAAEP QKLTEMKSLF LIESTKNKNL
PIGGGLWSTA LYHPEDAPAS NLTEWTFDGP MMRMPESAAP KLGKVDSLVS MEVDLPANPN
GVLYALAGFS GGVTCYVKDG ILSYEFNLFE ITRTKIRAKE KLAAGKAKIE VESKLVDKIG
GAMNVTLKVN GKAVAQGQVP AAMSLHFTSN ATFDIGSDLD SPVSLDYFDK APFAFNGTIG
TTKVTYLKK