Gene RPC_2738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2738 
Symbol 
ID3970291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2973052 
End bp2974335 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content67% 
IMG OID637925848 
Productsulfatase 
Protein accessionYP_532605 
Protein GI90424235 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.15562 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGCTC AGCTGGCCAG CCTTCTGCAG CAGCACGCCT CGCGGCCCAT CGAAGCAGAG 
AACCAGATGT CGCTCAGATC GATCATCACC CTCGGTGTTG CCGTCGCGCT CGCCACGCTG
ATGGCTGTCC CGAACGTTGC GGCGCAGGCG CGCCCCGATC CCCGTCCGGC GCGTGTCATC
ATCTTCCACA TGGACAGCAT GCTCGCCGAC GCGCCGGAGC GCCTCAGCCT GAGCAACTGG
CTCGCGGTCG CGGCCGAGGG CACCCGCGCG AGCGAGATGA CCACCGTGAT TCCGTACCAT
CCGACGGATT CCGGCTACTT CGTGCTCAGC ACGACCTCGT TTCCCAACCC GACGACGGCC
GCCGGCACGC TCTTCCTCGA GCCAGCCATC GAACAGACCT ATCTCCAGCA TCGCTTCAAA
GGCCACACCG CCTTGATCGC CGGCTCGACG GCCTACCGGT CGATCGGCGA AGGATTCACT
TACACCAATC TGTCTCAGGC ACTCACCGAC GAGCGGGTTG TCGAGGAGGG GCTGCGGCAA
TTGCGCGAGC ACCCCGACCT GAGCTTCATG CGCCTGATAC TGCAGGACGC CAACGCCGTG
TTGCAGCGTG TCGGCTTCAC CCGGGAGAAT GTGCCCTGGC GCGGCGACGC CTACGGGGAG
GGCTCGCCCT ACTTTGCATC GCTGCGGCGG GCAGACGCGC TGCTCGGCCG CTTCGTCGAC
GAGTTGAAGC GGATGGACAA GTACGAGGAC ACGCTGCTGG TGCTGATGCC CGATGGCGCC
GCCCGCGGCG GCTGGCACGG CCCGCAGCAG GAAGAGAGCT GGCGCTTGCC CTTCGCACTA
CGCGGTCCGG GAATCGCGAA ACGGCGTGTG ATCGGCTATG CCGAGAATAT CGACGTGGCG
CCGACCATCG CCGCCATGAT GGGTGTCGAG CCGCCGAACG CCGACGGAGC CTCGGGACGC
GTCCTGACCG AGGTCATGGC GGGGCAACCC GCGACGGCGG CCGGCGGCGC GCGCCGCATC
GAGCGCCTCA ACCGCCAACA CAAAGAATAC TTGCGTCTCA CCGGCTGGAT GCAGGTTCAT
GCCGGCCGCT ACCCGCTGCT CGACCTGGCG TGGATGGCGT CGCACAACCG GCTGGTACAA
CCGACGCGGT TCTGGGACCT TTCCAGCATC GACGAGTGGC GCCGCGCCGG GAGCTTCGAT
CGAATGCTCG CGGACAACGA GGCTGCGCTC GTCGCTCTTC GCGATGCTCT CGCGCGATCG
GGCGCGCCGG CTCTGCCGGA TTGA
 
Protein sequence
MSAQLASLLQ QHASRPIEAE NQMSLRSIIT LGVAVALATL MAVPNVAAQA RPDPRPARVI 
IFHMDSMLAD APERLSLSNW LAVAAEGTRA SEMTTVIPYH PTDSGYFVLS TTSFPNPTTA
AGTLFLEPAI EQTYLQHRFK GHTALIAGST AYRSIGEGFT YTNLSQALTD ERVVEEGLRQ
LREHPDLSFM RLILQDANAV LQRVGFTREN VPWRGDAYGE GSPYFASLRR ADALLGRFVD
ELKRMDKYED TLLVLMPDGA ARGGWHGPQQ EESWRLPFAL RGPGIAKRRV IGYAENIDVA
PTIAAMMGVE PPNADGASGR VLTEVMAGQP ATAAGGARRI ERLNRQHKEY LRLTGWMQVH
AGRYPLLDLA WMASHNRLVQ PTRFWDLSSI DEWRRAGSFD RMLADNEAAL VALRDALARS
GAPALPD