Gene RPD_1640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1640 
Symbol 
ID4022120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1841631 
End bp1844099 
Gene Length2469 bp 
Protein Length822 aa 
Translation table11 
GC content60% 
IMG OID637961835 
Productsulfatase 
Protein accessionYP_568778 
Protein GI91976119 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.139745 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.861297 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGAT TGGCTTTGAA GCGATTGGTT TTGGTTGTTA CGCCACTGTT GTTGCTGGCA 
TCGTGGATCG GCTACGCAAG CGCTCAGCAG GCGGCTCCGA TCGACCCGGC TCCGATCAAC
CCGTCGATGC CCACCGATCG CTCGGTATTG CCGATTCCTG AACCGCAGTA CCCGCACAGC
ACGGTGTTCG ACGTCCGCAA CGCAACGCCG CCGCCACGGT TCGAAGTCAA GGCTCCGGCG
GGCGCGCCCA ACGTAGTCAT CGTTCTGATC GACGATATGG GTTTTGGCCA ATCCAGCGCA
TTCGGCGGAC CCGTCAGGAT GCCGACAGTC GAAGGCCTGG CCAATCAGGG ACTCCGCTAC
AATGAGTTTC ACACCACGGC GCTTTGCTCG CCGACACGCG CCGCGCTGCT CAGCGGACGC
AATCACCACA TCAACAACAT GGGCTCGATT ACGGAAACGG CGACGTCGTT TCCCGGGCAA
ACCGGGCAGC GCCCGAACAG CGTGGCATCG GTCGCAGAGA TTCTGCGGCT GAACGGCTAC
AGCACCGCCC ACTTCGGCAA GAACCACGAA ACCGCGGCAT GGGAGGTCAG TCCGTCCGGT
CCGACCGACC GCTGGCCGAC GCGTCAAGGG TTCGACAAGT TCTACGGATT CATGGGGGGC
GAGACCAACC AATGGGCGCC CCTTCTGTAT GACGGCATGG CCCAGGTCGA ACTACCAAAA
GACCCGAACT ATCATTTTAT GACCGACATG ACGAATCATG CGGTCGACTG GATGAAGTCG
CAGAAGGCGC TGACCCCGGA CAAGCCGTTC TTCATCTACT TCGCGCCCGG CGCCACACAC
GCGCCGCATC AGGTGCCGAA GGAATGGATC GCTAAATACA AAGGCAAATT CGATCAGGGC
TGGGACCAGC TTCGCGAAGA GACCCTGGCG CGGCAGATCA GGCTCGGCGT GGTGCCGGCC
GGCACCAAGC TCGCCCCGAA GCCCGAGCCG ATCAAGGATT GGGCCACGCT GAACGCAGAC
GAGAAGAAGC TGTTCGCGCG CCAGATGGAA GTCTTCGCCG GTTTCGGCGA ATACGCCGAC
ACCGAAATCG GCCGTCTGAT CGAGGCCATC AGGCAAACGG GTCAGCTCGA CAATACGCTG
ATCTTCTACA TCGTCGGCGA CAACGGCGCG AGCGCTGAAG GCGGCATGAA CGGCCTGTTC
AACGAGATGA CCTATTTCAA TGGCGCTCAG GAAACCGTTC AGGATGTTCT GAAGCACTAC
GACGAGCTGG GCGGCCCGAA CACCTATGGC CACTATGCAG CCGGTTGGGC GATTGCAGGC
GATACGCCGT TCACCTGGAC CAAGCAGGTG GCCTCCAGCT ACGGCGGTAC CCGTAACGGC
ATGGTGATCC ACTGGCCCAA GGGCATCTCC GCAAAGGGTG AGGTGCGCTC GCAATGGCAT
CACGTCATCG ATGTCGTGCC CACAATTCTC GAAGCGGCGA GTTTGCCGGA GCCGAGCGCC
GTCAACGGCA CGCCTCAACT GCCGATCGTC GGCAACAGCA TGGTGTATAC GTTCGCCGAC
CCGAAAGCGG CGAGCACGCA CAAGACCCAG TACTTTGAGA TCTTCGGCAA TCGCGCGATC
TACAGCGACG GATGGCTGGC CGGAACGGTT CACCGAGCGG CATGGGAAAC CAAGCCCCGC
AGGGCGCTCG AACAGGATGT CTGGGAACTC TACGATACGC GGTCGGATTT CAGCCTCGTC
AACGATCTGG CGGCGACGAA TCCCGACAAG CTGAAGGAGT TGCAGGATCT GTTCATGAAG
GAGGCGGAGA AGAACTCCGT CCTGCCACTC GACGATCGAA CCCTGGAGCG CACCAACGCA
GCTCTGGTCG GACGCCCGGA TCTGATGGCC GGTCGAACCA CGCTGACGGT TTACGAGGGA
ATGATCGGGA TGTCTGAAAA CGTCTTCATC AATCTCAAGA ACCGGTCTCA CACGGTCACC
GCCGAGGTGG ACGTTCCGAA GGCCAACGCC AACGGCGTCC TGATGGCCCA GGCCGGGCGA
TTTGGCGGCT GGAGCCTGTA TGTGAAGAAC GGCAAACCGG TTTACACCTA CAACTGGCTC
GGCCTGAAAC GGTTCAGTAT CGCCGGCAAA CAGCCGATAC CGGCCGGCAA AGCAACGATC
CGTTTCGAGT TTGTCTACGA CGGCGGGGGG CTCGGAAAGG GCGGCCTTGG CACCCTTCTG
GTCAACGGAA AACCCGCTGC TTCAGGTCGC ATCGATCAGA CCCAATGCTG CTTCTACTCG
GCCGACGAAG GCGCCGATGT CGGCGCCGAC GAAGGAACGC CCGTGACCGA AGACTACAAG
TCGCCGTTCA AATTCACCGG AAAGATCTCG TCAGTGACGA TCGAGCAGAA AGAGATGAAG
AAGACCGAAA GCGAGGACGC CGTTCAGGCT CGCAAGGCGG CGCTGTTGAA GAAGGGACTG
TCGGATTGA
 
Protein sequence
MKRLALKRLV LVVTPLLLLA SWIGYASAQQ AAPIDPAPIN PSMPTDRSVL PIPEPQYPHS 
TVFDVRNATP PPRFEVKAPA GAPNVVIVLI DDMGFGQSSA FGGPVRMPTV EGLANQGLRY
NEFHTTALCS PTRAALLSGR NHHINNMGSI TETATSFPGQ TGQRPNSVAS VAEILRLNGY
STAHFGKNHE TAAWEVSPSG PTDRWPTRQG FDKFYGFMGG ETNQWAPLLY DGMAQVELPK
DPNYHFMTDM TNHAVDWMKS QKALTPDKPF FIYFAPGATH APHQVPKEWI AKYKGKFDQG
WDQLREETLA RQIRLGVVPA GTKLAPKPEP IKDWATLNAD EKKLFARQME VFAGFGEYAD
TEIGRLIEAI RQTGQLDNTL IFYIVGDNGA SAEGGMNGLF NEMTYFNGAQ ETVQDVLKHY
DELGGPNTYG HYAAGWAIAG DTPFTWTKQV ASSYGGTRNG MVIHWPKGIS AKGEVRSQWH
HVIDVVPTIL EAASLPEPSA VNGTPQLPIV GNSMVYTFAD PKAASTHKTQ YFEIFGNRAI
YSDGWLAGTV HRAAWETKPR RALEQDVWEL YDTRSDFSLV NDLAATNPDK LKELQDLFMK
EAEKNSVLPL DDRTLERTNA ALVGRPDLMA GRTTLTVYEG MIGMSENVFI NLKNRSHTVT
AEVDVPKANA NGVLMAQAGR FGGWSLYVKN GKPVYTYNWL GLKRFSIAGK QPIPAGKATI
RFEFVYDGGG LGKGGLGTLL VNGKPAASGR IDQTQCCFYS ADEGADVGAD EGTPVTEDYK
SPFKFTGKIS SVTIEQKEMK KTESEDAVQA RKAALLKKGL SD