Gene Rpal_5052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5052 
Symbol 
ID6412746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5434744 
End bp5436306 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content66% 
IMG OID642714937 
Productprotease Do 
Protein accessionYP_001994016 
Protein GI192293411 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.610685 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGGCG ACCATTTCGA CCACGTATCA GCTCAAACCG AGGCTCACAT CCGCAAGGTG 
TTGAGGCCGC GGCGTCTCTC GCTGTTGGCT TCCGCTGCCG GCTTGAGCAT GGTGGTGGCG
CTCGGCGGCG CCGGATTTCT CACCCGCGAG ATGCCGTCGC TGACGTCGCC GGCCTATGCG
GCTGAAAACG GCAAGCCGGC GCCCGCCGGG TTCGGCGATC TGGTCGACAA GGTCAAGCCG
GCAGTGATTT CGGTGCGGGT CAAGATCGAC GACGATCGCC AGACCACGCC GCTGGTGCGT
GACGATGGTG ACGGCGATCA GATGCAGACG CCGCGGGGCC TCGCGCCGTT CCAGCAGTTC
GAGCGTCAGT TCGGCTTCCG CGGTCCTGAA GGCATGCCGA AGCGGCACCA GATGATCACC
GGCGAAGGCT CCGGCTTCTT CATCACCGCC GACGGCTACG CGGTCACCAA TAATCACGTG
GTCGATCACG CCAAGTCGGT GCAGGTGACG ACCGACGACG GCACTATCTA CACCGCCAAG
GTCGTCGGCA CCGACGACAA GACCGATCTG GCGCTGATCA AGGTCGACGG CAAAACCGAT
TTTCCGCACG TCAACTTCGC CGATGCGCCG GCGCGGGTCG GCGATTGGGT GATCGCTGTC
GGCAATCCGT TCGGCCTCGG CGGCACGGTG ACGGCGGGCA TCGTCTCGGC GCGCGGTCGT
GACATCGGTT CGGGCCCCTA TGACGACTAC GTGCAGATCG ACGCGCCTAT CAACAAGGGC
AACTCCGGCG GTCCGGCGTT CGACACCAAT GGCAATGTGA TCGGCGTCAA CACCGCGATC
TATTCGCCAT CGGGCGGCTC GGTCGGCATC GGCTTCGATA TTCCGGCGGC GACCGCGAAG
CTGGTGGTGT CGCAGCTCAA GGACAAGGGC TACGTCACCC GCGGCTGGCT CGGCGTGCAG
GTGCAGCCGG TCACGGCGGA GATCGCCGAC AGTCTCGGCA TGAAGCAGGC CCGCGGCGCG
CTGGTCGATA GTCCGCAGGA CGGCAGCCCG GCCGCGAAGG CGGGCATCAA GGCCGGCGAT
GTGATCACCG CGGTCGACGG CAAGGAGGTC AAGGACTCCC GCGCGCTCGC CCGCACCATC
AGCACGCTGG CACCGGGCTC CTCGGTGAAG CTCGACGTGC TGCACAACGG CCAGTCCAAG
ACGATGGATC TGACGCTCGC CGAAATGCCC GGTGATCATC AGAAGGTCGC CGACAGCAGC
GGCGATCGCG ACGCTACCCG TCCGTATCTC GGCCTGCGCG TGGCACCGGC CAGCGAAGTC
GACGGTGCCG GCAAGAACGG GGTGGTCGTT ACCGGTGTCG ATCCGGACGG GCCGGCCGCC
GACAAGGGCC TGCGCACGGG TGATGTCATC CTCGACGTCG GCGGCAAGGC GGTGACCAAC
ACCGGCGATG TCCGCAACGC GCTCACACAG GCCGGCAAGG ACGGCAAGAA GACCGTGCTG
ATGCGGGTGA AGACGGCGGA TTCGGCGGCG CGCTTTGTCG CGGTGCCGAT CGCGAAGGGC
TGA
 
Protein sequence
MEGDHFDHVS AQTEAHIRKV LRPRRLSLLA SAAGLSMVVA LGGAGFLTRE MPSLTSPAYA 
AENGKPAPAG FGDLVDKVKP AVISVRVKID DDRQTTPLVR DDGDGDQMQT PRGLAPFQQF
ERQFGFRGPE GMPKRHQMIT GEGSGFFITA DGYAVTNNHV VDHAKSVQVT TDDGTIYTAK
VVGTDDKTDL ALIKVDGKTD FPHVNFADAP ARVGDWVIAV GNPFGLGGTV TAGIVSARGR
DIGSGPYDDY VQIDAPINKG NSGGPAFDTN GNVIGVNTAI YSPSGGSVGI GFDIPAATAK
LVVSQLKDKG YVTRGWLGVQ VQPVTAEIAD SLGMKQARGA LVDSPQDGSP AAKAGIKAGD
VITAVDGKEV KDSRALARTI STLAPGSSVK LDVLHNGQSK TMDLTLAEMP GDHQKVADSS
GDRDATRPYL GLRVAPASEV DGAGKNGVVV TGVDPDGPAA DKGLRTGDVI LDVGGKAVTN
TGDVRNALTQ AGKDGKKTVL MRVKTADSAA RFVAVPIAKG