Gene RPB_1014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1014 
Symbol 
ID3909138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1163194 
End bp1164159 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content69% 
IMG OID637882907 
ProductPDZ/DHR/GLGF 
Protein accessionYP_484635 
Protein GI86748139 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTCCT TGCCCGAATG GAACGTGCCG GCCGCGATCC GGCCGCGCGC TGCGGACTTT 
CCGTTCGATC TCGATCGCAC GCTGTCGGCG GTGCTCGGCG TGCACGCGAT CATTCCGCCC
GACGCCTTCA CCGCGAATAC GCTCGGCACC GAACGCGCCG GCAACGCCGT GCTGATCGAC
GACGGCCTGC TGCTGACCAT CGGCTATCTG ATCACCGAAG CAGAGACCGT CTGGCTGCAT
CTCGGCGACG GCCGGGCGGT CGAGGGGCAC GCGCTCGGCA TCGATTCCGA CAGCGGCTTC
GGCCTGGTGC AGGCGCTCGG CGCGATCGAC CTGCCGCCGC TGCGGCTCGG CCATTCGAGC
GCGGCCAAGA CCGGCGATCG CGTGATCGTC GGCGGCGTCG GCGGACGTAT CCGCTCGGTC
GCGGGGCGGA TCGCGGCGCG GCAGCCCTTC GCCGGCTATT GGGAATATCT GATCGACGAC
GCGATCTTCA CCGAGCCGTC GCACCCGAAC TGGGGTGGGG CCGGGCTGAT CTCCGCGACC
GGCGAACTGA TCGGCATCGG CTCGCTGCAG ATCGAGCGCA GCGGCAGCGA CGAGCACTAC
AACATGATGG TGCCGATCGA TCTCTTGAAG CCGGTGCTCG GCGATCTGCG CAAATTCGGC
CGGGTCGACA GACCGCCGCG GCCGTGGCTC GGGCTGTATT CGACCGAGAT CGAGGACCGG
ATCGTCGTGG TCGGGATCGC GCCGAAGGGC CCCGCCGCGC GCGCCGAGCT GAAATCGGGC
GATGTCATTC TCGCGGTCGC CGGCGACAAG GTCACGAGCG AGGCGGAATT CTATCGCAAG
GTCTGGGCGC TCGGCGCCGC CGGCGTCGAG GTCCCGCTGA CGCTGTTCAG CGGCGGCGCC
ACCTTCGACG TCGTGCTGCA TTCGGCGGAC CGCGCCAAAT TCCTCAAGGG ACCGCGGATG
CATTGA
 
Protein sequence
MPSLPEWNVP AAIRPRAADF PFDLDRTLSA VLGVHAIIPP DAFTANTLGT ERAGNAVLID 
DGLLLTIGYL ITEAETVWLH LGDGRAVEGH ALGIDSDSGF GLVQALGAID LPPLRLGHSS
AAKTGDRVIV GGVGGRIRSV AGRIAARQPF AGYWEYLIDD AIFTEPSHPN WGGAGLISAT
GELIGIGSLQ IERSGSDEHY NMMVPIDLLK PVLGDLRKFG RVDRPPRPWL GLYSTEIEDR
IVVVGIAPKG PAARAELKSG DVILAVAGDK VTSEAEFYRK VWALGAAGVE VPLTLFSGGA
TFDVVLHSAD RAKFLKGPRM H