Gene RPB_4333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4333 
Symbol 
ID3912146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4917658 
End bp4919295 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content71% 
IMG OID637886237 
Productalkaline phosphatase 
Protein accessionYP_487931 
Protein GI86751435 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3540] Phosphodiesterase/alkaline phosphatase D 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0499289 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGATG TACCGGCGGC AACTGCTTGC CGAATGGTAC CGTCGATGTC CCCTCACCCA 
CCCTCCTCCG GCCTCGCCTT CGACCGCCGC ATGCTGCTGC GCGGCGCACT CGGGGCCGGC
CTTGCCGGCG CGCTGCCGGC CGCCGCCGGC GCGGCGGAGC GCGTCGCCTT CAAGTCCGAC
CCGTTCACGC TCGGGATCGC GTCGGGTGAT CCGGCGCCCG ACGGCTTCGT GCTGTGGACC
CGACTGGCGC CGGAGCCGCT CGAGGCACGT GGCGGCATGA CGCCCGCGCC GGTCGAGGTC
ACCGTCGAGA TCGCCGACGA TGCGGCGATG GCGCGCGTGA TACGCACCGT CACGGCCACC
GCCCGGCTCG AACTCGCGCA CGCGGTCCAT GCCGAAATCG AAGGCCTCGC CCCGGGGCGC
GATTACTTCT ATCGCTTCCG CGCCGGCGAC GCCGAAAGCC CGATCGGCCG CGCCCGCACC
CTGCCCGCGC CCGGCGAGAC GCCGGCGCGG CTGCGTTTCG CTGCGGCCGG CTGCCAGCGC
TGGGAAGGCG GTTACTACAC CGCCTGGCGG GCGATCGCCG ACGATCAGCT CGACTTCGTA
TTCCACTACG GCGACTACAT CTACGAATAT GCCTTCGCGG CGCACGACAG GGACGGCAAG
CCCTACCCGC GCACGATGCC GCCGGACTTT CCGGTCTGCT TCACGTTGAC CGACTATCGC
CGCCGCTACG CGCTGTACAA GGGCGACCCG GACCTGCAGG CGGCGCACGC CTCCTGTCCG
TTCCTGTCGA GTTTCGACGA CCACGATGTC GTCAACAACT GGGCCGCCGA CAGCGACCCG
AAGCAGACCC CGCCCGACGC CTTTCTGTTC CGCCGGGCGA TGGCGCTGCA GGCCTGGTAC
GAGCATATGC CGGTGCGCCG CGCGCAGCTC CCGCGCGGCC CCGACGTGCG CGCCTATCGC
GGCTTCCGTT TCGGCACACT CGCCGACATC GCCGTGCTCG ACACGAGGCA GTATCGCTCG
CGCCAGCCCT GCGGCGACGG CTTCCGCGCG CATTGCGACG AGGCCGATGC GGCCGGTCGC
ACCATGCTCG GCGCAGCGCA GGAGCAGTGG CTGGCGCAGC GGCTGAAGCA GAGCAAAGCG
ACCTGGCAGG TACTGGCGCA GCAAGTGCAG TTCGCGCCGT TCGACTGGCG CGGCTTTCCG
TTCGTCAAGG AGACCGACGC GCCGGTACTC GATCTCGACA CCTGGAGCGG CGCCAGCGCC
GCGCGCGACC GCGTGACGGC GATGCTCGCC GAAGCGAACA TCGCCAATCC GGTGGTGCTG
ACCGGCGACC TGCACCGGGC GATGGCGCTG GACTTGCGCC GCGACTGGCG CGATCCCGAC
TCGCCGCGCA TCGGCGTCGA ATTCCTGTCG ACCTCGATCT CGTCGCCCGG TGACGGCCCG
GCGACGGCCG AGAAGCTGGC GGCGCTGTAT CGCAACAACC CGCATCTGAA ATTCTTCAGC
GACCGGCGCG GCTACACCCG CCACACCGTG ACGCCGGCGC GCTGGCAGGC CGATTTCCGC
ACGGTGGACA GCGTCGCGAC CCGCGGAGAG CCGGTCACAA CCGCCCAAAC CCTGATGGTC
GAAGCCGGCC GGGGCTGA
 
Protein sequence
MHDVPAATAC RMVPSMSPHP PSSGLAFDRR MLLRGALGAG LAGALPAAAG AAERVAFKSD 
PFTLGIASGD PAPDGFVLWT RLAPEPLEAR GGMTPAPVEV TVEIADDAAM ARVIRTVTAT
ARLELAHAVH AEIEGLAPGR DYFYRFRAGD AESPIGRART LPAPGETPAR LRFAAAGCQR
WEGGYYTAWR AIADDQLDFV FHYGDYIYEY AFAAHDRDGK PYPRTMPPDF PVCFTLTDYR
RRYALYKGDP DLQAAHASCP FLSSFDDHDV VNNWAADSDP KQTPPDAFLF RRAMALQAWY
EHMPVRRAQL PRGPDVRAYR GFRFGTLADI AVLDTRQYRS RQPCGDGFRA HCDEADAAGR
TMLGAAQEQW LAQRLKQSKA TWQVLAQQVQ FAPFDWRGFP FVKETDAPVL DLDTWSGASA
ARDRVTAMLA EANIANPVVL TGDLHRAMAL DLRRDWRDPD SPRIGVEFLS TSISSPGDGP
ATAEKLAALY RNNPHLKFFS DRRGYTRHTV TPARWQADFR TVDSVATRGE PVTTAQTLMV
EAGRG