Gene RPD_2122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2122 
Symbol 
ID4022605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2372063 
End bp2374084 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content68% 
IMG OID637962316 
ProductATPase 
Protein accessionYP_569258 
Protein GI91976599 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.317051 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGACA GCGATTCGAG AAGCAAGACG GAAGAATACT GGCTGACAAC CGTTGAGGAG 
GACCGCGCAG AGTCCGCGCC GGCGCCGGCT GATCCGGCTG ATCTCGACAT CCTCGAGCTC
CCCACCGCGG CCATCGACGC CATGCTGTGC CCGACGGAGC GCGAGGCGGT CGACTGCGCC
ATCGTGCGCG CCGAGGTCGC GCTGCGCGGC CGGACGGATC GCCTGCAGGT CGACTTAGCC
AAGATGCGGA CGCCAAAGGC CGATGCCATC GATCGCCTGC GGTGGTCGTC CGAGTGCCGC
GCGATAGCCT CTGAACTCGA AGCGATCGGC GGCGCCGCAG CTAACGAAGC CGCCCTCCTC
TGGTGGATGC TGTCGTCATC GCCGCGAGAC CCCGGGACGT GCGCGGTCGC TGTCGACCTG
GCGAAGATCG CCCCGCGGAT CCGGTGGCCG GAGAAGGCCG ACGAGATCAA GGCGCGACTC
AGTGTCTGGA GGCGCGCCTA CGCCGGCAAG TACGTGGACG ACAACCGCGA CCTTTTCGCC
ATCGCCGAAG AAGAGATTAT GAGATCGAAG AATCCGACGA TGCGGAAGAA CTACGGGTCA
GTTCCGCTCT CGGTGCTACT CTGCGACGAC GAGAAGTCTG CCGTCGCGCT GGTCGTCAAA
TACACGCGGA AGATCTCGGG ACCACCGTAC GTCGATGTTC CTGACTTCCT GGTGAGCACG
CCGCCGAAGG ACGGGGATCG GCGCATCTGG TGGCGCACTG AGTGCCGCCG GATGTCGTCC
CGGCTATCCG AGCGCGCCGA CGATGGCCTC GCCGGAGCGA CCGCGCTCGC GTGGCGGATG
CTCGCCGCCG ACCCGGCCGA TACGCGGACA TTTTTGTCGG TCCTGCCGGC TCTGAGGCAT
CTCGCGGACC GGTGCACATG GTCGGATGAC CAGAAGTCCA AGCTGACGGC GCGCCTAGCT
ATATGGACCC TGGCAGCCTC CGGCGAGTAC GGCCGCGATC CGCGCGACAT GTTCGTGATC
GCCGCGCAGC AGTCCGAGTC CCTGCACGCC GACGAGGGTC ATGACGATCC GGCGGTCGAA
GAGCTCTGCA ATCGCGCCCA TGCAGCGCTG AGTCGCCGGA AAGTGCGCCT CCCGTCGTCG
CCGGCCAGGC CGAGGCCGGC GCCGGATGGC CCTGCTGTCG TCGTGATGAC CGAGGCGCCG
GTCGAGAAGA AGCACATGCC GGACGTCTGG AAGCGCCTGG AAGGCGCCGA CGTGCCGCTG
GTAGTCTGCC GCGACGCTGC CGTCGTCCGG CAGGCACTGG AGGCCGAGTA CCCGCACGCG
CGCGCTGCCG TGGCGATGCT GACGCAGGAT CTGCGGGACG GCGAGCCGGT CCGGATGCGG
CCGACCCTCC TCGTTGGTCC CCCAGGCTCC GGCAAGTCGC GCCTGGTGCG CCGGATCGGG
GAACTACTCG GCGTCTATGT TTACAGGCTC GACGCAACCG CCAGCGCCGA CGGATTTTTC
GCCGGCACCA ACCGCGCATG GCACTCGTCG GCCCCCTCCG TGCCCGCGCG CGCGATCGCG
GCGGCGATGC GCGCAAACCC AATCGTGATG ATCGACGAGA TCGAGAAGGC CGCCGAGAGC
ACTATGAACG GAAACCTATG GTCGGCGATG GCGCCTCTTC TGGAGCGCGA GACCTCGCGT
AGCTACAGAG ACTGCGGCCT CGACGCCCAG CTCGATCTCT CGCACGTCAA CCATGTCGCC
ACCGCGAACA CGACAGAGCG GTTGCCGTCG TTCCTGCGCG ACCGCTTCAG GCTGATCCGC
GTCCCGTCCC CCACGCTGGC GCACCTGCCG GCGTTGGCGG CGCTGGTCCT GCAGGACATC
GCGCGGGACG ACGACGCGCG CGCCGGGGCA CCACCACTGG CGCCCGACGA GCTCGACGTG
ATCGGCCGGG CGTGGGCCCG GGAGAAATTC TCGATGCGCA AGCTGGCGCG CCTGGTCGAG
GCGACCCTCG AGGCGCGCGA CGCCTGCGCG CCGCGGCACT GA
 
Protein sequence
MSDSDSRSKT EEYWLTTVEE DRAESAPAPA DPADLDILEL PTAAIDAMLC PTEREAVDCA 
IVRAEVALRG RTDRLQVDLA KMRTPKADAI DRLRWSSECR AIASELEAIG GAAANEAALL
WWMLSSSPRD PGTCAVAVDL AKIAPRIRWP EKADEIKARL SVWRRAYAGK YVDDNRDLFA
IAEEEIMRSK NPTMRKNYGS VPLSVLLCDD EKSAVALVVK YTRKISGPPY VDVPDFLVST
PPKDGDRRIW WRTECRRMSS RLSERADDGL AGATALAWRM LAADPADTRT FLSVLPALRH
LADRCTWSDD QKSKLTARLA IWTLAASGEY GRDPRDMFVI AAQQSESLHA DEGHDDPAVE
ELCNRAHAAL SRRKVRLPSS PARPRPAPDG PAVVVMTEAP VEKKHMPDVW KRLEGADVPL
VVCRDAAVVR QALEAEYPHA RAAVAMLTQD LRDGEPVRMR PTLLVGPPGS GKSRLVRRIG
ELLGVYVYRL DATASADGFF AGTNRAWHSS APSVPARAIA AAMRANPIVM IDEIEKAAES
TMNGNLWSAM APLLERETSR SYRDCGLDAQ LDLSHVNHVA TANTTERLPS FLRDRFRLIR
VPSPTLAHLP ALAALVLQDI ARDDDARAGA PPLAPDELDV IGRAWAREKF SMRKLARLVE
ATLEARDACA PRH