Gene RPD_1117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1117 
Symbol 
ID4021593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1271529 
End bp1272494 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content67% 
IMG OID637961309 
ProductPDZ/DHR/GLGF 
Protein accessionYP_568256 
Protein GI91975597 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0129707 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTCCT TGCCCGAATG GAATGTGCCG GCCGCGATCC GGCCGCGTGC CGCTGACTTT 
CCGTTCGATC TCGAACGCGC GTTGTCGTCG GTGATCGGGT TGCATTCGAT CATTCCGTCG
GACGCCTTCA CGGCGAACAC GCTCGGCACC GAGCGCGCCG GCAATTGCGT GCTGATCGAC
GACGGCCTGC TGCTGACCAT CGGCTATCTG ATCACCGAGG CGGAGACGGT CTGGCTTCAT
CTCGGCGACG GGCGGGTGGT CGAGGGCCAT GCGCTCGGCT TCGATGCGGA GAGCGGGTTC
GGTCTCGTGC AGGCGCTCGG CCCGATCGAT CTGCCGCCGC TGGCGCTCGG CAATTCCGGT
GCGGCCAAAG CCGGCGATCG CGTGGTGATC GCCGGCGCCG GCGGACGAAC GCGATCGGTC
GCGGGTCGGA TCGCCACAAG GCAGGAATTC GCCGGCTATT GGGAATATCT GCTCGACGAC
GCGATCTTCA CCGAACCGTC GCACCCGAAT TGGGGCGGCG CCGGGCTGAT TTCGGCGACG
GGCGAACTCA TCGGCATCGG CTCGCTGCAG ATCGAGCGCA GCGGCACCGA CGCGCATTAC
AACATGATCG TGCCGATCGA TCTGTTGAAG CCGGCGCTCT CCGATCTGCG CAAATTCGGT
CGCGTCGACA AGCCGCCGCG GCCGTGGCTC GGCCTGTATT CGACCGAGAT CGAGGGCAAG
ATCGTCGTGG TCGGAATCGC GCCGAAGGGC CCGGCCGCCC GCGCCGAACT GAAGTCCGGC
GACGTCATCC TCGCGGTCGC CGGCGAGAAG GTCAGCAGCG AGGGCGAGTT CTATCGCAAG
ATCTGGGCGC TCGGCACCGC GGGCGTCGAG GTCCCGCTGA CGCTGTTCAG CGGCGGCGCG
ACCTTCGACG TCGTGCTGCA CTCATCCGAC CGCGCCAAAT TCCTCAAGGC CCCGCGGCTG
CACTGA
 
Protein sequence
MPSLPEWNVP AAIRPRAADF PFDLERALSS VIGLHSIIPS DAFTANTLGT ERAGNCVLID 
DGLLLTIGYL ITEAETVWLH LGDGRVVEGH ALGFDAESGF GLVQALGPID LPPLALGNSG
AAKAGDRVVI AGAGGRTRSV AGRIATRQEF AGYWEYLLDD AIFTEPSHPN WGGAGLISAT
GELIGIGSLQ IERSGTDAHY NMIVPIDLLK PALSDLRKFG RVDKPPRPWL GLYSTEIEGK
IVVVGIAPKG PAARAELKSG DVILAVAGEK VSSEGEFYRK IWALGTAGVE VPLTLFSGGA
TFDVVLHSSD RAKFLKAPRL H