Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0860 |
Symbol | |
ID | 3909118 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 984404 |
End bp | 985372 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637882753 |
Product | hypothetical protein |
Protein accession | YP_484482 |
Protein GI | 86747986 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAGGGG TCGCAGGGGC GGATTTCCAG CAGGACCTTC TGTTCGCGTT CGGTGGCGGA GATCCGCAGA TCACCATCGC GGTGCTGGAT GGGCCGGTGG ATCGCACACA TGATTGTTTT CGCGGCGCGC GGCTGGCTCC GCTCGACACG CCGGCTGCGG CGCGATGCGC CGAGGAATTC GCCAGCGCGC AGGGCACGCA TCTGGCGAGT CTGATTTTCG GCCAGCCCTG CAGTTCGGTC GAAGGCGTCG CGCCGCTCTG CCGCGGGCTG ATCGCACCGA TCTTCGGCGA CGACCGGCCG GGCTGCTCCC AGACCGATCT GGCGCAGGCG ATCGTGATGG CGCTGGATCA CGGCGCCCAC ATCCTCCACA TCAGCGGCGG ATTGTTCGAC GGGCGGCGGC AGCCGACCGC CGAACTGAGC GCGGCCGTGG CGCTGTGCAA CGACCACAAC GCGCTGATCG TCGCCGGCGC CGGCCGCGAT GGCTGCGCCA GCCTGCTGCG CCGCTGCGGC GCGGTGAATC TTCTCCCGGT GGGCGCGATC GACAGAAACG GCCGGCTGAT CGGCGGCAGC GACGCCAGCC TGCACGAGAT CGGCATCGCG GTCCCCGGCG CCAGGCTGAT CGGCGCGACG CTGGAAGGCG GCATCGGCGA GCGCCGCGGC GCGAACTACT CGGCCGCTTT GATCGCCGGC ATTGCCGGGC TGCTGCTCGG GACGCAGACC GCCACCGGCC GGGCCCACGA CCCCGGCGCG GCCATCGAGG CGATCCTCGG CAGCGCGACG CCGCGGCTCG CGGCGCGGGC CGACGAATGC CAGCGCGCCT GGATCGGCCG GGCAAATGTC GAAGCCGCCG CGGCCCGCCT CACCGGCGTG ATGTTCGACA GTGCCGCGAC GTCTCCGTTC GGACTGTGGC ATCGGGCGCT GACGAATGCG CAGCCGTCGG ACCACGCCAC GTTCCGGCCG CGGGCGTGA
|
Protein sequence | MVGVAGADFQ QDLLFAFGGG DPQITIAVLD GPVDRTHDCF RGARLAPLDT PAAARCAEEF ASAQGTHLAS LIFGQPCSSV EGVAPLCRGL IAPIFGDDRP GCSQTDLAQA IVMALDHGAH ILHISGGLFD GRRQPTAELS AAVALCNDHN ALIVAGAGRD GCASLLRRCG AVNLLPVGAI DRNGRLIGGS DASLHEIGIA VPGARLIGAT LEGGIGERRG ANYSAALIAG IAGLLLGTQT ATGRAHDPGA AIEAILGSAT PRLAARADEC QRAWIGRANV EAAAARLTGV MFDSAATSPF GLWHRALTNA QPSDHATFRP RA
|
| |