Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1818 |
Symbol | |
ID | 3908977 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2079680 |
End bp | 2081596 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637883712 |
Product | ATP-dependent metalloprotease FtsH |
Protein accession | YP_485437 |
Protein GI | 86748941 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.192895 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.442759 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCCA ATCTGCGGAA TTTCGCCCTC TGGGTCATTA TCGTCTTGCT GCTTTTGGCG CTGTTTACGC TCTTCCAGAA TCCCGGCCAG CGCGCCGCAT CGCAGGACAT CTCGTTCTCG CAGCTGCTCA GCGAGGTCGA TCAGAATCAC GTTCGCGACG TGGTGATCCA GGGCCAGGAA ATTCACGGCA CCTTCACCAA CGGCTCGAGC TTCCAGACCT ACGCGCCGAA CGATCCGTCG CTCGTTACCC GCCTGTACAA CGGTAAGGTC GCGATCACCG CGAAGCCGCC GGGCGACAAT GTGCCGTGGT TCGTGTCGCT GCTGGTGTCG TGGCTGCCGT TCATCGCGCT GATTGGCGTC TGGATCTTCC TGTCGCGGCA GATGCAGGGC GGCGCCGGCA AGGCGATGGG CTTCGGCAAA TCGCGCGCCA AGATGCTGAC CGAGGCGCAC GGCCGCGTCA CCTTCGAGGA CGTCGCCGGC GTCGACGAGG CCAAGCAGGA CCTGCAGGAG ATCGTCGAGT TCCTGCGCGA CCCGGGCAAG TTCCAGCGGC TCGGCGGACG GATTCCGCGC GGCGTGCTGC TGGTCGGTCC GCCTGGCACG GGTAAGACGC TGATCGCGCG TGCGGTCGCC GGCGAAGCCA ACGTGCCGTT CTTCACCATC TCCGGTTCGG ACTTCGTCGA AATGTTCGTC GGCGTCGGCG CCTCTCGGGT GCGCGACATG TTCGAGCAGG CCAAGAAGAA TGCCCCCTGC ATCATCTTCA TCGACGAAAT CGACGCGGTC GGTCGTCATC GTGGCGCCGG TCTCGGCGGC GGCAATGACG AGCGCGAGCA GACCCTCAAC CAGTTGCTGG TCGAGATGGA CGGGTTCGAG GCCAATGAGG GCGTGATCCT GATCGCGGCC ACCAACCGGC CCGACGTGCT CGATCCCGCG CTGCTGCGTC CCGGCCGTTT CGACCGTCAG GTCGTGGTGC CGAACCCGGA CGTCGTCGGC CGCGAGCAGA TCCTCAAGGT GCATGTCCGC AAGGTGCCGC TGGCGCCGGA TATCAACCTC AAGACCATCG CGCGCGGCAC GCCGGGATTC TCCGGCGCCG ACCTGATGAA CCTCGTCAAC GAAGCGGCGC TGATGGCGGC CCGGCGTAAC AAGCGCATGG TCACCCAGGC CGAATTCGAA GACGCCAAGG ACAAGGTGAT GATGGGCGCC GAGCGCAAGT CGCTGGTGAT GACGGAGGAG GAGAAGCTTC TCACCGCCTA TCACGAGGGC GGCCACGCGA TCGTCGGCCT CAATGTTCCG GCCACCGACC CGATCCACAA GGCCACCATC ATCCCGCGCG GCCGCGCGCT CGGCATGGTG ATGCAGCTTC CCGAGCGCGA CAAGCTGTCG ATGTCGCTGG AGCAGATGAC CTCGCGCCTC GCCATCATGA TGGGCGGCCG CGTCGCCGAA GAAATGATCT TCGGCCGCCA GAAGGTGACG TCGGGCGCTT CGTCCGACAT CGAGCAGGCG ACCCGATTGG CCCGCATGAT GGTCACGCGC TGGGGTCTGT CGGAAGAGCT CGGCACCGTG TCGTATGGCG AGAACCAGGA CGAAGTATTC CTCGGCATGT CGGTGTCGCG CACCCAGAAC GCGTCGGAAG CGACGGTTCA GAAGATCGAC GCCGAGATCA AGCGGTTGGT CGAAGAGGGC TACAAGGAAG CCGAGCGTAT TCTCACCGAG AAGCGCGCGG ACCTCGAAGC CCTCGCCAAG GGTCTGCTCG AGTTCGAGAC GCTGACCGGC GACGAGATCA CCGATCTCAT GAACGGCAAG AAGCCGAACC GCGAGTCGGT GCTGGAGCCC TCGGGCCCGC GCACCTCGGC TGTCCCGCCG GCCGGCAAGC CGCGGCCGCG TCCCGATACT GGCCTGGAGC CGCAGCCCCA GGCGTAA
|
Protein sequence | MNANLRNFAL WVIIVLLLLA LFTLFQNPGQ RAASQDISFS QLLSEVDQNH VRDVVIQGQE IHGTFTNGSS FQTYAPNDPS LVTRLYNGKV AITAKPPGDN VPWFVSLLVS WLPFIALIGV WIFLSRQMQG GAGKAMGFGK SRAKMLTEAH GRVTFEDVAG VDEAKQDLQE IVEFLRDPGK FQRLGGRIPR GVLLVGPPGT GKTLIARAVA GEANVPFFTI SGSDFVEMFV GVGASRVRDM FEQAKKNAPC IIFIDEIDAV GRHRGAGLGG GNDEREQTLN QLLVEMDGFE ANEGVILIAA TNRPDVLDPA LLRPGRFDRQ VVVPNPDVVG REQILKVHVR KVPLAPDINL KTIARGTPGF SGADLMNLVN EAALMAARRN KRMVTQAEFE DAKDKVMMGA ERKSLVMTEE EKLLTAYHEG GHAIVGLNVP ATDPIHKATI IPRGRALGMV MQLPERDKLS MSLEQMTSRL AIMMGGRVAE EMIFGRQKVT SGASSDIEQA TRLARMMVTR WGLSEELGTV SYGENQDEVF LGMSVSRTQN ASEATVQKID AEIKRLVEEG YKEAERILTE KRADLEALAK GLLEFETLTG DEITDLMNGK KPNRESVLEP SGPRTSAVPP AGKPRPRPDT GLEPQPQA
|
| |