Gene RPD_3055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3055 
Symbol 
ID4023558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3400601 
End bp3403006 
Gene Length2406 bp 
Protein Length801 aa 
Translation table11 
GC content64% 
IMG OID637963254 
ProductATPase AAA-2 
Protein accessionYP_570182 
Protein GI91977523 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID[TIGR02639] ATP-dependent Clp protease ATP-binding subunit clpA 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.800011 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACAT TTTCCCAGAG TCTTGAGCAA TCCCTCCACC GCGCGCTCGC GATCGCCAAC 
GAGCGACACC ATCAATACGC GACGCTCGAG CATCTGTTGC TCTCGCTGGT CGACGATTCC
GACGCCGCCG CGGTGATGCG GGCTTGCAGC GTCGACCTCG ATAAGCTCCG CGCCAGCCTC
GTCAACTATC TCGAGACCGA ATTCGAGAAC CTCATCACCG ATGGGTCGGA AGACGCCAAG
CCGACCGCCG GGTTCCAGCG TGTGATCCAG CGCGCGGTAA TTCATGTGCA GTCGTCCGGC
CGCGAAGAGG TGACCGGCGC CAACGTGCTG ATCGCGATCT TCGCCGAACG CGAGAGCCAT
GCCGCGTATT TCCTGCAGGA GCAGGACATG ACGCGCTACG ACGCGGTCAA CTACATCAGC
CATGGCATCG CCAAGCGGCC CGGCGTGTCC GAGGCGCGGC CGGTGCGCGG CGTCGACGAA
GAGACCGAGA CCAAGAGCGG CGAGGACTCC AAGAAGAAGG GAGACGCGCT CGAGACCTAT
TGCGTCAACC TGAACAAGAA GGCGCGCGAC GGCAAGATCG ATCCGGTGAT CGGCCGCAAT
GCCGAGATCA ACCGCGCGAT TCAGGTGCTG TGCCGCCGGC AGAAGAACAA CCCGCTGTTC
GTCGGCGAAG CCGGCGTCGG CAAGACCGCG ATCGCCGAGG GCCTCGCCAA GCGTATCGTC
GACAGCGAAG TGCCGGAAGT GCTCGCGGCG GCGACGGTGT TCTCGCTCGA CATGGGCACG
CTGCTCGCCG GCACGCGCTA TCGCGGCGAC TTCGAGGAGC GCCTGAAGCA GGTGCTCAAG
GAGCTCGAAG CGCATCCCAA CGCGATTCTG TTCATCGACG AGATCCACAC CGTGATCGGC
GCTGGCGCGA CCTCCGGCGG CGCGATGGAT GCCTCGAATT TGCTCAAGCC TGCCTTGGCT
TCGGGCACGA TCCGCTGCAT GGGCTCGACG ACCTATAAGG AATACCGTCA GCACTTCGAG
AAGGATCGCG CGCTTGTGCG CCGGTTCCAG AAGATCGACG TCAACGAGCC GACGGTGGCG
GATGCGATCG CGATCCTGAA GGGGCTGAAG CCCTATTTCG AGGACTATCA CAAGCTCAAA
TACACCAACG AGGCGATCGA ATCCGCGGTC GAGCTTTCGT CGCGCTACAT CCATGACCGG
AAATTGCCGG ACAAGGCGAT CGACGTGATC GATGAATCCG GCGCGGCGCA GATGCTGGTG
GCGGAGAACA AGCGCAAGAA GACGATCGGC ATCAAGGAAA TCGAGGCCAC CGTCGCGACC
ATGGCTCGGA TCCCGCCGAA GAGCGTGTCG AAGGACGACG CCGAGGTGCT GATGCATCTC
GAGCAGACCC TGAAGCGCGT CGTGTTCGGG CAGGACAAGG CGATCGAATC GCTCGCCGCG
TCGATCAAGC TCGCCCGCGC CGGCCTGCGG GAACCGGAGA AGCCGATCGG CTGCTATCTG
TTCTCGGGTC CGACCGGCGT CGGCAAGACC GAAGTCGCCA AGCAATTGGC GCTGACGCTC
GGCGTCGAGC TGCTGCGCTT CGACATGTCG GAATACATGG AACGCCACAC CGTCTCGCGG
CTGATCGGCG CGCCGCCCGG CTATGTCGGC TTCGATCAGG GCGGCCTTTT GACCGACGGC
GTCGACCAGC ATCCGCATTG CGTCGTGCTG CTCGACGAAA TCGAGAAGGC CCATCCCGAT
CTGTACAACG TGCTGCTGCA GATCATGGAT CACGGCCGGC TGACCGATCA CAACGGCAAG
CAGGTCAACT TCCGCAACGT CATCCTGATC ATGACGACGA ACGCGGGCGC GGCCGATCTG
GCCCGGCAGG CGTTCGGCTT CACCCGCAAC AAGCGGGAAG GCGACGACCA CGAGGCGATC
AACCGGCAGT TCGCTCCGGA ATTCCGCAAC CGGCTCGACG CGATCGTGTC GTTCGCGCAT
CTCAATGCCG ATGTCATCGG CATGGTGGTC GAGAAGTTCG TGCTGCAGCT CGAGGCTCAG
CTCGCCGATC GCGACGTGAC GATTGAGCTG TCCGACGCCG CCAAGGCGTG GCTGGTTCAG
CACGGCTATG ACGAGCAGAT GGGGGCGCGG CCGATGGCGC GCGTGATCCA GGAGCACATC
AAGAAACCGC TCGCCGACGA GGTGCTGTTC GGTCAGCTCA AGGGCGGCGG CCATGTCCGG
GTCGTTTTGG TCAAGGACGA GGCGGTCGCC GGCGTCGAGC TGGAGAAGAT CGCCTTCGAG
TTCCTCGATG GCCCGGTGAC GCCCAAGCCG GAGAAGCTGC CCAACGCCAA AAAGCGCGGC
GGCGCAGCGC GGAAACCGAA GTCAGGCCCG AAGGGGTCGG CAAAGGATCC GCTGGTCAAG
GCCTGA
 
Protein sequence
MPTFSQSLEQ SLHRALAIAN ERHHQYATLE HLLLSLVDDS DAAAVMRACS VDLDKLRASL 
VNYLETEFEN LITDGSEDAK PTAGFQRVIQ RAVIHVQSSG REEVTGANVL IAIFAERESH
AAYFLQEQDM TRYDAVNYIS HGIAKRPGVS EARPVRGVDE ETETKSGEDS KKKGDALETY
CVNLNKKARD GKIDPVIGRN AEINRAIQVL CRRQKNNPLF VGEAGVGKTA IAEGLAKRIV
DSEVPEVLAA ATVFSLDMGT LLAGTRYRGD FEERLKQVLK ELEAHPNAIL FIDEIHTVIG
AGATSGGAMD ASNLLKPALA SGTIRCMGST TYKEYRQHFE KDRALVRRFQ KIDVNEPTVA
DAIAILKGLK PYFEDYHKLK YTNEAIESAV ELSSRYIHDR KLPDKAIDVI DESGAAQMLV
AENKRKKTIG IKEIEATVAT MARIPPKSVS KDDAEVLMHL EQTLKRVVFG QDKAIESLAA
SIKLARAGLR EPEKPIGCYL FSGPTGVGKT EVAKQLALTL GVELLRFDMS EYMERHTVSR
LIGAPPGYVG FDQGGLLTDG VDQHPHCVVL LDEIEKAHPD LYNVLLQIMD HGRLTDHNGK
QVNFRNVILI MTTNAGAADL ARQAFGFTRN KREGDDHEAI NRQFAPEFRN RLDAIVSFAH
LNADVIGMVV EKFVLQLEAQ LADRDVTIEL SDAAKAWLVQ HGYDEQMGAR PMARVIQEHI
KKPLADEVLF GQLKGGGHVR VVLVKDEAVA GVELEKIAFE FLDGPVTPKP EKLPNAKKRG
GAARKPKSGP KGSAKDPLVK A