Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_0415 |
Symbol | |
ID | 3970867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 445890 |
End bp | 447551 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637923530 |
Product | RNA polymerase factor sigma-54 |
Protein accession | YP_530309 |
Protein GI | 90421939 |
COG category | [K] Transcription |
COG ID | [COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog |
TIGRFAM ID | [TIGR02395] RNA polymerase sigma-54 factor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.774408 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACTGA CCCAACGCTT AGAATTCCGC CAGTCGCAAT CGCTGGTGAT GACGCCGCAG CTGATGCAGG CGATCAAGCT GCTGCAACTG TCGAATCTCG ACCTCGCCAG CTTTGTGGAG GACGAGCTCG AGCGGAACCC GCTGCTCGAT CGCGCCAATG AGAACGGCGA ACCGCCGGTT CCGGGCGAGC TCGCGCCGGA GCGCGCCGAA TTCTCCGACC GCGACGACGC CGCGTCGGGA TCTGGCGAGG ATGCGGGGGC GGATGGCTCG GATTTTTCCG ACCGCGCAGC CGGCGATGGC TTTGAAGCCT CGGCCGAGGA CTGGATGCAG CGCGACCTCG GCAGCCGCGC CGAGATCGAA CAGACCCTCG ACACCGGCCT CGACAATGTA TTTTCCGAAG AGCCGGCCGA GACCGCTGCG CGCACCGCGC AGGACGCAGC CCCAACCGCC TTCACCGAAT GGGGCGGCGG CTCGTCGAAC GACGACGGCT ACAATCTGGA AGCCTTCGTG GCCGCCGAAA TCACGCTGGG CGGCCATCTG GCCGAACAGC TGGCGGTGGC GTTCACCGAT CCCAAGCAGC GGCTGATCGG GCAGTACCTG GTCGACCTGG TCGACGACGC CGGCTATCTG CCGCCGGATC TCGGCGACGC CACCGAACGG CTCGGCGCCA GCGCCGAGCA GGTCGAGGCG GTGGTCGCAG TGCTGCAGAA ATTCGACCCC GCCGGGGTCT GCGCGCGCAA CCTCAGCGAA TGCCTGGCGA TCCAGCTGCG CGATCGCGAC CGCTACGATC CGGCGATGCA GGCCTTGGTC GAGCATCTCG ATCTGTTGGC CAAGCGCGAC ATCGGCGCGT TGCGCCGGAT CTGCGGCGTC GACGACGAGG ACCTCGCCGA CATGATCGGG GAGATCCGCC ATCTCGATCC GAAGCCGGGG CTGAAATTCG GCACCGCGCG GGTGCAGACC GTGGTGCCCG ACGTCTATGT GCGGCCCGGG CCGGACGGCG GCTGGCATGT CGAGCTGAAC AGCGAGACGC TGCCGAAGGT CCTGGTCAAT CAGGTGTATT ATTCCGAACT TTCGAAGACG ATCCGCAAGG ACGGCGACAA GGCCTACTTC ACCGATTGCC TGCAGAACGC CACCTGGCTG GTGCGCGCGC TGGATCAGCG CGCCCGCACC ATCCTCAAGG TCTCCACCGA GATCGTGCGC CAGCAGGACG GCTTCTTCAC CCAGGGCGTC GCCCATCTGC GGCCGCTCAA TCTGAAAGCC GTCGCAGATG CCATTCAGAT GCACGAGTCG ACAGTCTCGC GCGTGACCGC CAACAAGTAT ATGGCGACCA ATCGCGGGAT CTTCGAACTG AAATACTTCT TCACCGCGTC GATCGCCTCG GCAGACGGCG GCGAAGCCCA TTCCGCCGAG GCGGTGCGCC ATCACATCAA GCAATTGATC GACGGGGAAA ACCCGGCGAT TATTCTGTCC GACGACACCA TCGTTGAAAA ACTGCGCGAG GCTGGTATTG ACATCGCCCG GCGCACCGTC GCCAAGTACC GCGAAGCGAT GCGGATTCCT TCCTCCGTCC AGCGTCGCCG AGACAAGCAA AGCATGCTTG GAAATGCACT CACAGCGCCG GCAACAACGG CAGACCGGTC CCGCGACACC GCACCGGCTT GA
|
Protein sequence | MALTQRLEFR QSQSLVMTPQ LMQAIKLLQL SNLDLASFVE DELERNPLLD RANENGEPPV PGELAPERAE FSDRDDAASG SGEDAGADGS DFSDRAAGDG FEASAEDWMQ RDLGSRAEIE QTLDTGLDNV FSEEPAETAA RTAQDAAPTA FTEWGGGSSN DDGYNLEAFV AAEITLGGHL AEQLAVAFTD PKQRLIGQYL VDLVDDAGYL PPDLGDATER LGASAEQVEA VVAVLQKFDP AGVCARNLSE CLAIQLRDRD RYDPAMQALV EHLDLLAKRD IGALRRICGV DDEDLADMIG EIRHLDPKPG LKFGTARVQT VVPDVYVRPG PDGGWHVELN SETLPKVLVN QVYYSELSKT IRKDGDKAYF TDCLQNATWL VRALDQRART ILKVSTEIVR QQDGFFTQGV AHLRPLNLKA VADAIQMHES TVSRVTANKY MATNRGIFEL KYFFTASIAS ADGGEAHSAE AVRHHIKQLI DGENPAIILS DDTIVEKLRE AGIDIARRTV AKYREAMRIP SSVQRRRDKQ SMLGNALTAP ATTADRSRDT APA
|
| |