Gene RPB_2723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2723 
Symbol 
ID3910516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3107292 
End bp3108320 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content68% 
IMG OID637884623 
ProductTPR repeat-containing protein 
Protein accessionYP_486336 
Protein GI86749840 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.26129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.431849 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCTT GGCGCGGTTG CGACACCGTG AACCGCAACC GCGCAGCCTG CGACAAAGCC 
GTCAACGCAG GCGCGAGGGA AACCGACATG CGATCTCACA TGGTCTCGTC GAATGTCGTG
GCGGTCGCGC TGTTCGCGTT GCTGTCGCAA AGTGCCGCGG CCGAGGACGC CGCGTGGAAA
GGCTGTGTCG GACTGCAGGG CTCTCCGGCG GAGCGCGTCG CCGCCTGCAG CACGGTGATC
GAGACCAACG CCGAAACTGG CCGGCGGCTG GCGGCGGCGT ATTGCAACCG GGGCCACGGC
CTGACCGAAC AGCGCAAGCT CGACGAGGCG ATGGCCGATC TCGAAGCGGC GGTGCGGCTC
GATCCGGGAT TCGCCTGCGC CTACAACAAT CGGGGCCGGG TCTATGCGTT CAAGGGAGAG
GGCGATCGCG CGTTGGCCGA CTACGACGAA GCAATCAGGC TCGATGCGAA ATTCGCGCTG
GCCTACAACA ACCGCGCCAT GATCTGGCTC GCCCGGCGCG ACCCCGACCG CGCACTGGAC
GACCTCTCCG CGGCGATCAC AGCGGACCCG GGGCTCGCCG TCGCTTACGG CAATCGCGGC
CACATCTACT ATCAGCAGCG CGACATGGCT CGTGCGCTGG CGGATTTCGA CGCCGAGATC
GCCTTGCGGC CCAACGTGCT CGCCTACATC AATCGCGGCA ATGTCCATCG CGACACCGAG
CAACTCGACC GCGCCGCCGC GGATTACGGC GAGGCGATCC GGCTGGCGCC GGAGGACGCC
CGCGGCTGGC GCAATCGGGC GCTGATCAAG CTGTACCAGG GCGACAACAA GGGCGGCCTC
GCCGACTACG ACAAGGCGCT ACGCTACGAT CCGGCCGACG TGTTCTCCTG GAACAACCGC
GCCCAGGCCA GGATGCGGCT CGGCGACCGC AGCGGCGCGA TCGCGGATTT CCGCAAGGCG
CTGGAATTGC GGCCGGGCCT GCAGACCGCG CGGGATTCGC TGAAGCGGCT CGGCGCTGCG
GTGAACTGA
 
Protein sequence
MPPWRGCDTV NRNRAACDKA VNAGARETDM RSHMVSSNVV AVALFALLSQ SAAAEDAAWK 
GCVGLQGSPA ERVAACSTVI ETNAETGRRL AAAYCNRGHG LTEQRKLDEA MADLEAAVRL
DPGFACAYNN RGRVYAFKGE GDRALADYDE AIRLDAKFAL AYNNRAMIWL ARRDPDRALD
DLSAAITADP GLAVAYGNRG HIYYQQRDMA RALADFDAEI ALRPNVLAYI NRGNVHRDTE
QLDRAAADYG EAIRLAPEDA RGWRNRALIK LYQGDNKGGL ADYDKALRYD PADVFSWNNR
AQARMRLGDR SGAIADFRKA LELRPGLQTA RDSLKRLGAA VN