Gene Rpal_3556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3556 
Symbol 
ID6411230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3804607 
End bp3806991 
Gene Length2385 bp 
Protein Length794 aa 
Translation table11 
GC content63% 
IMG OID642713434 
ProductATP-dependent Clp protease, ATP-binding subunit clpA 
Protein accessionYP_001992531 
Protein GI192291926 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID[TIGR02639] ATP-dependent Clp protease ATP-binding subunit clpA 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGACAT TTTCCCAAAG CCTTGAGCAA TCCCTCCATC GTGCGCTCGC CATCGCCAAC 
GAGCGTCACC ACCAGTACGC AACCCTCGAA CATCTGCTGT TGTCTCTGGT CGACGATTCC
GACGCGGCCG CTGTGATGCG CGCCTGCAGC GTCGATCTCG ACAAGCTGCG CACCAGCCTG
GTCAACTATC TCGAAACCGA ATTCGAGAAT CTGGTGACCG ACGGCTCTGA GGACGCCAAG
CCGACCGCCG GCTTCCAGCG CGTGATTCAG CGCGCCGTGA TCCATGTGCA ATCCTCGGGG
CGTGAGGAAG TGACCGGCGC TAATGTGCTG GTTGCGATCT TCGCCGAGCG TGAAAGCCAC
GCCGCGTACT TCCTGCAGGA GCAGGACATG ACGCGCTATG ACGCGGTCAA CTACATCAGC
CACGGCATCG CCAAGCGTCC CGGCGTGTCC GAAGCTCGCC CGGTGCGTGG TGTCGACGAA
GAGACCGAGA CCAAGGGCGG CGACGACTCC AAGAAGAAGG GCGACGCGCT CGAGACCTAT
TGCGTCAACC TCAACAAGAA GGCCCGCGAC GGCAAGATCG ATCCGGTGAT CGGGCGCAAT
GCCGAGATCA GCCGCGCGAT CCAGGTGCTG TGCCGCCGGC AGAAGAACAA CCCGCTGTTC
GTCGGCGAAG CCGGCGTCGG CAAGACGGCG ATTGCGGAAG GTCTCGCCAA GCGCATCGTC
GACGGCGAGG TTCCGGAGGT GCTGTCCGCC GCCACCGTGT TCTCGCTCGA CATGGGCACG
CTGCTGGCAG GTACGCGCTA TCGCGGTGAC TTCGAAGAAC GCCTCAAGCA AGTCTTGAAG
GAACTCGAAG CGCACCCGAA CGCCATCCTG TTCATCGACG AGATCCACAC CGTCATCGGC
GCCGGTGCCA CTTCCGGCGG CGCAATGGAT GCCTCGAATT TGCTCAAGCC TGCCTTGGCT
TCGGGCACTA TTCGCTGCAT GGGCTCGACC ACCTACAAGG AATACCGTCA GCACTTCGAG
AAGGACCGCG CGCTGGTGCG GCGCTTCCAG AAGATCGATG TCAACGAACC GACCGTCGAG
GATGCGATCG CGATCCTGAA GGGGCTGAAG CCTTACTTCG AGGATTACCA CAAGCTGAAG
TACACCAACG AGGCGATCGA ATCCGCGGTG CAGCTGTCGT CGCGCTACAT CCATGACCGG
AAGTTGCCGG ACAAGGCGAT CGACGTGATC GACGAATCCG GCGCGGCGCA GATGCTGCTG
TCGGAGAACA AGCGCAAGAA GACGATCGGC ATCAAGGAGA TCGAAGCCAC GGTTGCGACC
ATGGCGCGGA TCCCGCCGAA GAGCGTGTCG AAGGACGACG CCGAGGTGCT CAAGCATCTC
GAGCAGACCC TGAAGCGCGT GGTGTTCGGT CAGGACAAAG CGATCGAGGC GCTGTCGGCG
TCGATCAAGC TGGCGCGTGC CGGCCTGCGT GAACCGGAGA AGCCGATCGG CTGCTACCTG
TTCTCGGGCC CGACCGGCGT CGGCAAGACC GAAGTCGCCA AGCAACTCGC CTCGACGCTC
GGTGTCGAGC TGCTGCGCTT CGACATGTCG GAGTACATGG AGCGGCACAC CGTGTCGCGT
CTGATCGGCG CGCCTCCCGG CTATGTCGGC TTCGACCAGG GCGGCCTGCT GACCGACGGC
GTCGATCAGC ATCCGCATTG CGTGGTGCTG CTCGACGAAA TCGAGAAGGC GCATCCGGAT
CTGTACAACG TGCTGCTGCA GATCATGGAT CATGGCCGGC TCACCGATCA CAACGGCAAG
CAGGTCAACT TCCGTAACGT GATCCTGATC ATGACCACGA ACGCGGGCGC GGCGGATCTG
GCTCGTCAGG CGTTCGGCTT CACCCGCAAC AAGCGGGAAG GTGACGACCA CGAGGCGATC
AACCGGCAGT TCGCGCCGGA ATTCCGCAAC CGTCTCGATG CGATCGTGTC GTTCGCGCAC
CTCAATGCCG ACGTCATCGG CATGGTGGTG GAGAAGTTCG TGCTGCAGCT CGAGGCTCAG
CTCGCGGACC GCGACGTCAC CATTGAGCTG TCCGAGCCCG CCAAGGCGTG GCTGGTGCAG
CACGGCTACG ACGAGCAGAT GGGCGCCCGG CCGATGGCCC GGGTGATCCA GGAGCACATC
AAGAAGCCGC TCGCCGACGA GGTGCTGTTC GGCAAGCTGA AGGGCGGCGG CCACGTCAAG
GTCGTGCTGG TCAAGGACGA CGCCGTCGCC GGCGTCGAGC TGGAGAAGAT CGGCTTCGAA
TTCCTCGACG GCCCTGTGAC GCCGAAGCCG GAAAATCTAC CCGGCAAGAA GCGTGGCGGC
TCGGCCCGCA AGGCCAAGGT GAAGGATCAG GTCGAGAAAG CCTGA
 
Protein sequence
MPTFSQSLEQ SLHRALAIAN ERHHQYATLE HLLLSLVDDS DAAAVMRACS VDLDKLRTSL 
VNYLETEFEN LVTDGSEDAK PTAGFQRVIQ RAVIHVQSSG REEVTGANVL VAIFAERESH
AAYFLQEQDM TRYDAVNYIS HGIAKRPGVS EARPVRGVDE ETETKGGDDS KKKGDALETY
CVNLNKKARD GKIDPVIGRN AEISRAIQVL CRRQKNNPLF VGEAGVGKTA IAEGLAKRIV
DGEVPEVLSA ATVFSLDMGT LLAGTRYRGD FEERLKQVLK ELEAHPNAIL FIDEIHTVIG
AGATSGGAMD ASNLLKPALA SGTIRCMGST TYKEYRQHFE KDRALVRRFQ KIDVNEPTVE
DAIAILKGLK PYFEDYHKLK YTNEAIESAV QLSSRYIHDR KLPDKAIDVI DESGAAQMLL
SENKRKKTIG IKEIEATVAT MARIPPKSVS KDDAEVLKHL EQTLKRVVFG QDKAIEALSA
SIKLARAGLR EPEKPIGCYL FSGPTGVGKT EVAKQLASTL GVELLRFDMS EYMERHTVSR
LIGAPPGYVG FDQGGLLTDG VDQHPHCVVL LDEIEKAHPD LYNVLLQIMD HGRLTDHNGK
QVNFRNVILI MTTNAGAADL ARQAFGFTRN KREGDDHEAI NRQFAPEFRN RLDAIVSFAH
LNADVIGMVV EKFVLQLEAQ LADRDVTIEL SEPAKAWLVQ HGYDEQMGAR PMARVIQEHI
KKPLADEVLF GKLKGGGHVK VVLVKDDAVA GVELEKIGFE FLDGPVTPKP ENLPGKKRGG
SARKAKVKDQ VEKA