Gene RPC_3246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3246 
Symbol 
ID3971912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3594302 
End bp3596008 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content66% 
IMG OID637926357 
Productsulfatase 
Protein accessionYP_533107 
Protein GI90424737 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.320036 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCTGC CGCCCAATCC AGTGCCATCG GTGGGCGCGG CGACCGCAGC CGCGCCGGCC 
AGCGTCTGGC GGATGCTGGC GGTGGTGCCG CCGCATCTCG CGGCATTGGC GTTGATGCTG
CACACCGAAA CCGATTTCAA CGGCCGGCTC GGCTTCCTGC TGGCCTGGGG GCTGCTGAAT
TTCGCCTGGA TCGCGGCGCT GCGCCGGCCG GCACTGTCCG GCGCGTTGTC GCTGACCATG
GTGGTGATGC TGGTGTTGCT GTCGCGGCTG AAGCACGACA TCGTGCAGAT GACCGCGAAC
TTCGTCGACC TGATGGTGAT CGACCGCGAC ACCATCGCCT TCCTGTTCAC GATCTTTCCC
GAGCTGCGCT GGTCGGTGAT CGGCACCGCG CTGCTGATCG TTCCCTTGAT GTACGCGCTG
TGGTGGCTCG ACCCGTTCCG GATCCGCCGC CAGCCGGCGC TGGCCGGCAC CATGCTGTTT
CTGGCCGGTC TGGTCGGCTA TGCGACGGCC TGGCCGGACG AGGCCTGGCG CGGCTATTAC
GACGACGGCT ACCTGTCGAA ATTCGCCCGC TCCGGGGTTA CCGCGGTGTC GGATTTCTTC
AACTACGGCT TCATGGAATC CGATGCGGTG GTCGCCGACC AGTTGAAGCT GCCGCTGGAG
GAGGCCTGCC ATCCGGTCGG GCGGCGGCCG AACATCATCA TGATCCACGA CGAGTCGTCG
TTCGACATTC GCGCCGCCGG GCAGGTGAAG GTGCCGGCGG GTTACGGCGC GCATTTCCAA
TCGTTCGACG GCAAGGCGCG CAAATTTCTC GCCGAGAGCA ATGGCGGGCC GAGCTGGTTC
ACCGAATACA ACGTGCTGGC CGGGCTGTCG TCGCGCTCGT TCGGGCGGTT TTCCTATTTC
GTCACCCGGA TCGCCTCCGG CCGGGTGGAG CGCGGCCTGC CGCTAGCGCT GCGCCGCTGC
GGCTACACCA CGACGGCGCT GTATCCCGCC AACGGCGCCT TCATGAGCGC GCGCAATTTC
CAGACCACCA CCGGCATGGA GCGGTTCTTC GACGCCCGTG ATCTCGGCTC CAGCCATGTC
GAGCCGGACA GCTTCTTCTA CGACAAGGCG CTCGGCTTGA TGCCGCAGCA CGGCGCGCCA
AAGCCGTTCT TCATGTTCGT CTATCTCGCC GCCAATCATT TCCCCTGGCA GACCAAATTC
CGCCCCGAGC TGACGCCTTC CTGGCGCGCG CTCGGCAACG CCCCGGTGGT CGAGGAATAT
CTGCGCCGCC AGGCCTTGAG CGTGACCGAC TACGCAGCCT TCCTGGCCGG CTTGAAGAAA
CACTATCCGG CGCAGCCGTT CCTGATCGTG CGGTTCGGCG ATCATCAGCC GGAATTCTCG
CCGCAACTGC TCGACCCCGA GCTCGACGAG GCCGGCCTCG GCAAGAAGCT GATGGCCTAT
GACCCGCGCT ACTACGCCAC CTACTACGCG ATCGACGCGA TCAACTTTCA GCCGGTGGAG
AGCCCGGCGA TCATGGACAC GATCGATGCC GCCTATCTGC CGCTGGTGAT CCAGGAGGCG
GCGGGGCTGC CGCTCGACCC CTCCTTTGCC GAGCAGAAGG CCATCATGCT CCGCTGCAGC
GGGCTGTTCT ACGGCTGCCG CAACGGCGCC GAGGCGCGGC GCTTCAACCG GCTGCTGATC
GACGCCGGCA TGATCAAGAA TCTGTAA
 
Protein sequence
MGLPPNPVPS VGAATAAAPA SVWRMLAVVP PHLAALALML HTETDFNGRL GFLLAWGLLN 
FAWIAALRRP ALSGALSLTM VVMLVLLSRL KHDIVQMTAN FVDLMVIDRD TIAFLFTIFP
ELRWSVIGTA LLIVPLMYAL WWLDPFRIRR QPALAGTMLF LAGLVGYATA WPDEAWRGYY
DDGYLSKFAR SGVTAVSDFF NYGFMESDAV VADQLKLPLE EACHPVGRRP NIIMIHDESS
FDIRAAGQVK VPAGYGAHFQ SFDGKARKFL AESNGGPSWF TEYNVLAGLS SRSFGRFSYF
VTRIASGRVE RGLPLALRRC GYTTTALYPA NGAFMSARNF QTTTGMERFF DARDLGSSHV
EPDSFFYDKA LGLMPQHGAP KPFFMFVYLA ANHFPWQTKF RPELTPSWRA LGNAPVVEEY
LRRQALSVTD YAAFLAGLKK HYPAQPFLIV RFGDHQPEFS PQLLDPELDE AGLGKKLMAY
DPRYYATYYA IDAINFQPVE SPAIMDTIDA AYLPLVIQEA AGLPLDPSFA EQKAIMLRCS
GLFYGCRNGA EARRFNRLLI DAGMIKNL