Gene RPD_4141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4141 
Symbol 
ID4024663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4609730 
End bp4611646 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content66% 
IMG OID637964349 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_571261 
Protein GI91978602 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCA ATTTGCGGAA TTTCGCCCTC TGGGTCATTA TCGTCTTGCT GCTTTTGGCG 
CTGTTTACAC TCTTCCAGAA TCCCGGCCAG CGCGCCGCTT CGCAGGACAT TTCGTTCTCG
CAGTTGTTGA GCGATGTCGA TCAGAATCGC GTTCGCGACG TGGTGATCCA GGGCCAGGAA
ATTCACGGCA CCTTCACCAA CGGCTCGACC TTCCAGACCT ACGCGCCGAA CGATCCGTCG
CTGGTCACGC GCCTGTACAA CGGCAAGGTT GCGATCACCG CGAAGCCGCC GGGCGACAAC
GTGCCGTGGT TCGTTTCGCT GCTCGTCTCC TGGCTGCCAT TCATCGCGCT GATCGGCGTC
TGGATTTTCC TGTCGCGGCA GATGCAGGGC GGCGCCGGCA AGGCGATGGG CTTTGGCAAG
TCGCGCGCCA AGATGCTGAC CGAGGCGCAT GGCCGCGTCA CCTTCGAGGA CGTCGCCGGC
GTCGACGAGG CCAAGCAGGA CCTGCAGGAG ATCGTCGAGT TCCTGCGCGA CCCGGGCAAG
TTCCAGCGCC TCGGCGGCCG CATTCCGCGC GGCGTGCTGC TGGTCGGCCC GCCCGGCACC
GGTAAGACGC TGATCGCCCG CGCCGTGGCC GGCGAAGCCA ATGTGCCGTT CTTCACGATT
TCGGGTTCGG ACTTCGTCGA AATGTTCGTC GGCGTCGGCG CCAGCCGCGT CCGCGACATG
TTCGAGCAGG CCAAGAAGAA CGCGCCGTGC ATCATCTTCA TCGACGAAAT CGACGCGGTC
GGCCGTCATC GTGGCGCCGG TCTTGGCGGC GGTAACGACG AGCGCGAGCA GACCCTCAAC
CAGCTGCTGG TCGAGATGGA CGGCTTCGAG GCCAATGAGG GCGTGATCCT GATCGCCGCC
ACCAACCGGC CCGACGTGCT CGATCCGGCG CTGCTGCGTC CCGGCCGCTT CGACCGCCAG
GTGGTGGTGC CGAATCCGGA CGTGGTCGGC CGCGAGCAGA TCCTCAAGGT CCATGTCCGC
AAGGTGCCGC TGGCGCCGGA TATCAATCTG AAGACCATCG CGCGCGGCAC GCCCGGCTTC
TCGGGCGCCG ACCTGATGAA CCTCGTCAAC GAGGCGGCGC TGATGGCGGC CCGGCGCAAC
AAGCGCATGG TCACCCAGGC CGAGTTCGAA GACGCCAAGG ACAAGGTGAT GATGGGCGCC
GAGCGCAAGT CGCTGGTGAT GACGGAGGAG GAGAAGCTCC TCACCGCCTA TCACGAAGGC
GGCCACGCCA TCGTCGGCCT GAACGTCGTC GCCACCGATC CGATCCACAA GGCGACCATC
ATTCCGCGCG GCCGAGCGCT CGGCATGGTG ATGCAGCTGC CCGAGCGCGA CAAGCTGTCG
ATGTCGCTCG AGCAGATGAC CTCGCGCCTC GCGATCATGA TGGGCGGCCG CGTCGCCGAA
GAGATGATCT TCGGTCGCCA GAAGGTGACC TCGGGCGCGT CGTCCGACAT CGAGCAGGCC
ACCCGCCTGG CCCGGATGAT GGTGACCCGC TGGGGCCTTT CGGAGGAACT CGGCACGGTG
TCGTATGGCG AGAACCAGGA CGAGGTGTTC CTCGGGATGT CGGTCTCGCG CACCCAGAAC
GCCTCCGAGG CGACGGTCCA GAAGATCGAC GCCGAGATCA AGCGGCTGGT TCAGGAGGGT
TACGACGAGG CCGAGCGCAT CCTCACCGAA AAGCGCGCCG ACCTCGAAGC GCTCGCCAAG
GGCCTGCTGG AGTTCGAGAC GCTGACCGGC GACGAGATCA CCGATCTCAT CAACGGCAAG
AAGCCGAACC GCGAATCGGT GCTGGAGCCG TCTGGCCCGC GCACCTCGGC TGTCCCGCCG
GCCGGCAAGC CGCGGCCGCG GCCCGATCCC GGCCTGGAGC CGCAGCCGCA GGCCTGA
 
Protein sequence
MNANLRNFAL WVIIVLLLLA LFTLFQNPGQ RAASQDISFS QLLSDVDQNR VRDVVIQGQE 
IHGTFTNGST FQTYAPNDPS LVTRLYNGKV AITAKPPGDN VPWFVSLLVS WLPFIALIGV
WIFLSRQMQG GAGKAMGFGK SRAKMLTEAH GRVTFEDVAG VDEAKQDLQE IVEFLRDPGK
FQRLGGRIPR GVLLVGPPGT GKTLIARAVA GEANVPFFTI SGSDFVEMFV GVGASRVRDM
FEQAKKNAPC IIFIDEIDAV GRHRGAGLGG GNDEREQTLN QLLVEMDGFE ANEGVILIAA
TNRPDVLDPA LLRPGRFDRQ VVVPNPDVVG REQILKVHVR KVPLAPDINL KTIARGTPGF
SGADLMNLVN EAALMAARRN KRMVTQAEFE DAKDKVMMGA ERKSLVMTEE EKLLTAYHEG
GHAIVGLNVV ATDPIHKATI IPRGRALGMV MQLPERDKLS MSLEQMTSRL AIMMGGRVAE
EMIFGRQKVT SGASSDIEQA TRLARMMVTR WGLSEELGTV SYGENQDEVF LGMSVSRTQN
ASEATVQKID AEIKRLVQEG YDEAERILTE KRADLEALAK GLLEFETLTG DEITDLINGK
KPNRESVLEP SGPRTSAVPP AGKPRPRPDP GLEPQPQA