Gene RPB_3063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3063 
Symbol 
ID3910864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3494146 
End bp3495732 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content62% 
IMG OID637884970 
ProductATPase 
Protein accessionYP_486675 
Protein GI86750179 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.454376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATCGGAA ACGATGATGA CGGTGACGTT CCGCCGTCTC CCGATGCCGT CGATCTGGAC 
GCTACCGTCG ACGGTGCCGC AGCAGACTCG GCCGATAGCG ACCAAGAGCA CTTTCAAGGA
CTGCCGAGGC TGCTGCGCTA CGCAATGCTG GCCAGGGACA ACGTCGACGA CACCGTCGTG
CATCGCCTGA TCGCGGAGAT CGATGAGCTT TCGCCGAAGC TGCCCCAGAA CGCCGACTGG
GCGGCCGCGC AGAATTCCGA AACCGGCTTT GCCCTTGCTG CTGAACTCGA TCAGCTGGCA
GTGGTCCGTG ACCTGCCCTT ATTGCGCAGC CTCAGTGACT GCGTTCGGTT GCTCACCCTG
ACGCTGCCAT GCGACCCGAA ACGGTTCGGG GCGTATCGGC GCGTGGCGCG GGCCATGATC
TTTGCCTTCC GCCAGATCGA GGCGAGCCTC GACGACGAGC GCTGCGCCGA GCTTGAGCGA
ATTGTCTATG GCTTTGCGGC ATTGCCGGCA GCAATCGGGG CAAACGACAG CGCCCGCTCA
TGGCCGGCTA TTTCCAGCGC CGCGCATCTG GGCGAGCAAT CGGTGCGTCA CCGCCTGCGG
GCAGTTGTCG TGAGAACCTC CCAGAGCGTC CGACGTCAGA TAGAACGGCA AACGGAGCCG
AAGAAACCAA AAGATACTCC CTCGATCGAC GCCACCACGC CCCATCCGCC AGTCAACGAA
CCAATAGCGC CCAACCACGT CATTGTGGCC CGGATCGAAC AGACGGAGCT CAAGAACCTC
AAGTTCATCG AGCCGTTCCG ACATGTGCTC AACAATGCCC TGCCGCTGGT CGAGGCAACC
GCTCTCGACC GGGTCCGCAC CACCTTGGCG GCCGAGTTTC CTTATGCTGT CGAAGTGGTG
GACTTCATGC TCACCGACTT GATTGGCCGT CCGACTATAC GGCTGCGACC ATTGCTGCTG
GTCGAAGCAC CCGGCAGTGG AAAATCACGT TTCGCACGCA GACTTGGCGA ACTGCTGGGA
GTCGCAATCT GGCGGACCGA CGCATCTCAA TCCGACGGGA ACGTATTTGC CGGCACAGAC
CGGCGTTGGA ATTCGGCCGA ACCCTGCCAT CCGCTGCTCG CAATCGCACG CGGCAAGATT
GCGAACCCGA TCGTCATAAT CGACGAGATC GAAAAAGCCG GCACGCGCAG CGACAATGGT
AGGCTTTGGG ACTGCCTGCT CGGCCTTCTG GAGCCAGAGA CCAACGCGCG ATATCCCGAT
CCAGCGCTGC AAACTCCCCT CGACCTCAGC CACGTCAGCT ACGTCGCCAC GGCAAACACT
CTCGATCCAT TGCCATCACC GCTGCTGGAC CGTCTCCGGA TCATCGCGTT CCCGAAGCCG
ACCCTGGACG ATCTCAATGC CCTGTTGCCG GGTTTGATTG AGGCCATTGC TGAGGACCGT
GGTGTCGACG GACGTTGGAT TGCCCCCCTG GACGCCCATG ATTGCGCTGC AATCGCCGCT
GTTTGGCCCG GCGGATCGGT GCGCCGGCTA CGCCGGGCTG TCGAATTTAT CCTTCAGCAG
CGAGATCGCA CCGCGCCGAG ACATTGA
 
Protein sequence
MIGNDDDGDV PPSPDAVDLD ATVDGAAADS ADSDQEHFQG LPRLLRYAML ARDNVDDTVV 
HRLIAEIDEL SPKLPQNADW AAAQNSETGF ALAAELDQLA VVRDLPLLRS LSDCVRLLTL
TLPCDPKRFG AYRRVARAMI FAFRQIEASL DDERCAELER IVYGFAALPA AIGANDSARS
WPAISSAAHL GEQSVRHRLR AVVVRTSQSV RRQIERQTEP KKPKDTPSID ATTPHPPVNE
PIAPNHVIVA RIEQTELKNL KFIEPFRHVL NNALPLVEAT ALDRVRTTLA AEFPYAVEVV
DFMLTDLIGR PTIRLRPLLL VEAPGSGKSR FARRLGELLG VAIWRTDASQ SDGNVFAGTD
RRWNSAEPCH PLLAIARGKI ANPIVIIDEI EKAGTRSDNG RLWDCLLGLL EPETNARYPD
PALQTPLDLS HVSYVATANT LDPLPSPLLD RLRIIAFPKP TLDDLNALLP GLIEAIAEDR
GVDGRWIAPL DAHDCAAIAA VWPGGSVRRL RRAVEFILQQ RDRTAPRH