Gene RPB_1085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1085 
Symbol 
ID3910171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1245973 
End bp1247751 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content64% 
IMG OID637882978 
ProductTPR repeat-containing protein 
Protein accessionYP_484706 
Protein GI86748210 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCTTC ACCGCTTCCG TCGTTCGATG TTTGTCGCCG TCACCGTCGC GGCGCTGCCG 
ATCGCGGGGC AGGCGCTGGC GCAGACTCCG GACCATCCGG GCGACAATTC CGCGCAGTTT
CCGACCAGCC AGGATCTGCG GTCGATGACC ACGGCGGGCA GCTATCTCGC CGCCCGCCAC
GCCAGCGTCG AGCGCGACGC CGCCTCGGCC GCGGCGTTCT ATCGGTCGGC GCTGCGCACC
GACCCGAAGA ACAACGAATT GCTCGACCGC GCCTTCATCT CTTCGCTGGC CGAAGGCAAT
ATCGAGGAGT CGGTCAAGCT CGCCGACCGG ATTCTCAAGA TCGACAAGAC CAACCGCGTG
GCGCGGCTGG TGATCGGCGT GCGCGATCTG AAGACCAAGA AATACGCCGC AGCGGTCCAG
AACGTGAATC TGTCGGTCCG CGGCCCGATC ACCGATCTGG TCGCGACGCT GCTGTCGAGC
TGGGCGATGG AGGGCGCCGG CGACGTCAAG GGCGCCGTCG CCAATATCGA CAAGCTCGCC
GGTCCGGAAT GGTATCCGAT CTTCAAGGAT CTGCATTCCG GCATGATGCT CGAGCTCGCC
AACAAGCAGA AGGACGCCGG CGTCCGCTTC GAGCGGGCCT ACAAGCTCGA CGATTCCGCG
CTTCGGGTGA CGGATGCCTA TGCGCGCTGG CTGTCGCGCA ACAAGGACGA CGGCTCCGCG
GTCGCGATCT ACGAGGGCTT CGACAAGAAG CTGTCGCGCC ATCCGTTGGT GTTGGAGGGA
TTGCGTGACG CCAAGGCCGG CAAGAAGCTG CCGCCGCTGG TCGACAGCCC GCAGGCCGGC
GCTGCCGAAG CGCTGTACGG TATCGGAGCG TCGCTGACCC GCCGTGGCGG CGAGGACCTC
GCGCTGGTCT ATCTGCAGCT CGCGCTGTAT CTGAAGCCCG ATCACGCGCT GGCGCTGCTG
GCGCTCGGCG ATCTGTACGA ATCGGTGAAG AAGCCGCAGA TGGCGGTGAA GGTCTACGAG
CGCGTGCCGG CGGATTCGCC GCTCAAGCGC AACGCCCAGA TCCAGCTCGC CACCGATCTC
GACGCGATCG ACCGCAGCGA GGAAGCGATC AAGATCCTGA AGACGGTTAT CGCCGAGGAC
GGCAAGGACC TCGAGGCGAT CATGGCGCTC GGCAACATCG AGCGCGGCCG CAAGAAGTTC
GCCGATTGCG CGGTCACCTA CAGCCAGGGC ATCGATGCGC TCACCGGCAC CGAGAAGAAC
AGCTGGGTCT ATTATTATTT CCGCGGCATC TGCGAGGAGC GTTCCAAGCA GTGGGCCAAG
GCCGAGGTCG ACATGAAGAA GGCGCTGCAG CTGCAGCCCG AGCAGCCGCA TGTTCTGAAC
TATCTCGGCT ATTCCTGGAT CGACCAGGGC ATCAATCTCG ACGAAGCGAT GAAGATGATC
AAGCGCGCCG TCGATCAGCG CCCCGACGAC GGCTACATCG TCGACTCGCT CGGCTGGGCT
TATTTCCGCA TCGGCAATTA CGAAGAGGCG GTGAAGACGC TGGAGCGCGC CATCGATCTG
AAGCCGGAAG ATCCGACCAT CAACGATCAC CTCGGCGACG CCTATTGGCG CGTCGGGCGA
ACGCTGGAGG CGCGCTTCCA GTGGGCGCAC GCCCGCGATC TCAAGCCGGA TCCGGAAGAG
TTGCCGAAGA TCGAGGCCAA GCTCGCCAAC GGTCTCCCGG AGGACACCTC GTCGGCGGCG
TCGGCGGACA AGAAAAAAGA CGACGACAAG GGCGGCTGA
 
Protein sequence
MLLHRFRRSM FVAVTVAALP IAGQALAQTP DHPGDNSAQF PTSQDLRSMT TAGSYLAARH 
ASVERDAASA AAFYRSALRT DPKNNELLDR AFISSLAEGN IEESVKLADR ILKIDKTNRV
ARLVIGVRDL KTKKYAAAVQ NVNLSVRGPI TDLVATLLSS WAMEGAGDVK GAVANIDKLA
GPEWYPIFKD LHSGMMLELA NKQKDAGVRF ERAYKLDDSA LRVTDAYARW LSRNKDDGSA
VAIYEGFDKK LSRHPLVLEG LRDAKAGKKL PPLVDSPQAG AAEALYGIGA SLTRRGGEDL
ALVYLQLALY LKPDHALALL ALGDLYESVK KPQMAVKVYE RVPADSPLKR NAQIQLATDL
DAIDRSEEAI KILKTVIAED GKDLEAIMAL GNIERGRKKF ADCAVTYSQG IDALTGTEKN
SWVYYYFRGI CEERSKQWAK AEVDMKKALQ LQPEQPHVLN YLGYSWIDQG INLDEAMKMI
KRAVDQRPDD GYIVDSLGWA YFRIGNYEEA VKTLERAIDL KPEDPTINDH LGDAYWRVGR
TLEARFQWAH ARDLKPDPEE LPKIEAKLAN GLPEDTSSAA SADKKKDDDK GG