Gene RPC_4357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4357 
Symbol 
ID3970834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4857395 
End bp4859176 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content65% 
IMG OID637927466 
Producttetratricopeptide TPR_2 
Protein accessionYP_534199 
Protein GI90425829 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0312227 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0147446 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTCAA CTCGACTTCG TCGCTCGATG ATCGCTGCCG TCGCGCTGGC CGCGCTGCCG 
CTGTCCAGCC CGGTGCTGGC GCAGACGCCG GATCACCAGA ACGACAGCTC GGTGCCGTTT
CCGACCAGCC GCGATCTGAA ATCGCTGACC ACCGCGGGCA GCTATCTGGC GGCGCGCCAC
GCCAGCGTGC AGCGCGACGC CAGTGCGGCG GCGACCTTCT ATCGCTCGGC GCTGCGCGCC
GATCCGAAGA ACAACGAATT GTTGGATCGC GCCTTTATTT CCTCGCTGGC CGACGGCGAC
ATCGACGAGG CGGTCAAGCT CGCCGATCGC ATTCTGGCAA TCGACAAGAC CAACCGGGTG
GCGCGCCTGG TGGTCGGCAT TCGTGACCTG AAGCAGAAGA AATACGCCGC CGCCCAGCAG
AACATCAATC AATCGGTGCG CGGTCCGATC ACCGATCTGG TCGCGACGCT GATCTCCGGC
TGGGCGAGCT ACGGCGCCGG CGACGTCAGG ACCGCGGTCG GCAATATCGA CAAGCTGGCC
GGTCCGGAAT GGTATCCGAT TTTCAAGGAC CTGCATTCCG GCATGATGCT GGATCTCGCG
GGCAAGCAGA AGGACGCCGG GGTCCGGCTG GAACGCGCCT ACAAGCTCGA CGACTCCGCG
TTGCGGGTGG TCGACGCTTA CGGGCGCTGG CTGTCGCGCA ACAAGGACGA CGCCGCGGCG
AGCGCGGTCT ACGAGGCGTT CGACAAGAAG CTGGCGCGGC ATCCGTTGGT GCTGGAAGGC
ATCCGCGAGA CCAAGGCCGG CAAGAAATTG CCGCCGCTGG TCGATTCGGC GCAGGCCGGC
GCCGCCGAGG CGCTGTACGG CATCGGCGCC TCATTGACCC GGCGCGGCGG CGAGGATCTG
GCGCTGGTCT ATCTGCAGCT CGCTCTGTAC CTGCAGCCGC AGCATTCGCT GGCGCTGTTG
TCGCTCGGCG ATCTCTACGA GTCGGTGAAG AAGCCGCAGA TGGCGATCAA GGCCTACGAG
CGCGTGCCGG CCAATTCGCC GCTGAAGCGC AACGCCCAGA TCCAGCTCGC CACCGATCTC
GACGCCGCCG ACCGCAGCGA GGAGGCGATC AAGATCTTGA AGGACGTCAC CGCCGAAGAT
CCGAAGGATC TCGAGGCGAT CATGGCGCTC GGCAACATCG AGCGCGGCCG CAAGAAGTTC
GCCGATTGCG CGGTGACCTA CTCGCAGGGC ATCGACGTGC TGTCCGGCGC CGAGAAGAAC
AACTGGGTGT ACTACTATTT CCGCGGCATC TGCGAGGAGC GCTCCAAGCA ATGGAGCAAG
GCCGAAGCCG ACATGAAGAA GGCGCTCGAG CTGCAGCCGG AGCAGCCGCA CGTCTTGAAC
TATCTCGGCT ACTCCTGGAT CGATCAGGGG ATCAATCTCG ACGACGCGAT GAAGATGATC
CGGCGCGCGG TCGATCAGCG CCCGGACGAC GGCTACATCG TCGACTCGCT CGGCTGGGCC
TATTACCGCA TCGGCAATTA CGAGGAGGCG GTGAAGAACC TCGAGCGCGC CATCGACCTG
AAGCCGGAAG ATCCCACCAT CAACGATCAT CTCGGCGACG CCTATTGGCG GGTCGGGCGC
ACCCTGGAGG CGAAATTCCA GTGGGTGCAT GCCCGCGATC TGAAGCCCGA GGCGGAAGAA
CTGCCGAAGA TCGAGGCCAA GATCGAGAAC GGGCTGCCGG ACGACGCCGG CGCGTCGGCG
GCCTCGGCGG ACAAGAAAAA AGAAGATGGC AAGGGCGGCT GA
 
Protein sequence
MLSTRLRRSM IAAVALAALP LSSPVLAQTP DHQNDSSVPF PTSRDLKSLT TAGSYLAARH 
ASVQRDASAA ATFYRSALRA DPKNNELLDR AFISSLADGD IDEAVKLADR ILAIDKTNRV
ARLVVGIRDL KQKKYAAAQQ NINQSVRGPI TDLVATLISG WASYGAGDVR TAVGNIDKLA
GPEWYPIFKD LHSGMMLDLA GKQKDAGVRL ERAYKLDDSA LRVVDAYGRW LSRNKDDAAA
SAVYEAFDKK LARHPLVLEG IRETKAGKKL PPLVDSAQAG AAEALYGIGA SLTRRGGEDL
ALVYLQLALY LQPQHSLALL SLGDLYESVK KPQMAIKAYE RVPANSPLKR NAQIQLATDL
DAADRSEEAI KILKDVTAED PKDLEAIMAL GNIERGRKKF ADCAVTYSQG IDVLSGAEKN
NWVYYYFRGI CEERSKQWSK AEADMKKALE LQPEQPHVLN YLGYSWIDQG INLDDAMKMI
RRAVDQRPDD GYIVDSLGWA YYRIGNYEEA VKNLERAIDL KPEDPTINDH LGDAYWRVGR
TLEAKFQWVH ARDLKPEAEE LPKIEAKIEN GLPDDAGASA ASADKKKEDG KGG