Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4131 |
Symbol | |
ID | 3911939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4703578 |
End bp | 4705722 |
Gene Length | 2145 bp |
Protein Length | 714 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637886035 |
Product | RNA polymerase sigma factor RpoD |
Protein accession | YP_487734 |
Protein GI | 86751238 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.138863 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAGCA AGGCGAAGAC GGTTCAGTTG AAAGACAAGG AAAAGGACGA CAAGGCCGAC GCGCCGGAGA AGGACTCCGC CGACGCTCCC TCGCCGTTGC TCGACCTGTC GGACGCGGCC GTCAAGAAGA TGATCAAGCA GGCCAAGAAG CGCGGCTTCG TGACCTTCGA TCAGCTCAAC GAAGTGCTGC CGTCCGACAC CACGTCGCCG GAGCAGATCG AGGACATCAT GTCGATGCTC TCCGACATGG GCATCAACGT GTCCGAGGCG GAAGAGAGCG ACAGCGAGGA CGAGGACGCC AAGGACGAGG CCGAGGAAGA GCCCGATAAC GACCTCGTCG AGGTCACCCA GAAGGCCGTC ACCGAGACCA AGAAGTCCGA GCCCGGCGAG CGCACCGACG ATCCCGTCCG GATGTATCTG CGCGAGATGG GCACCGTCGA GCTGCTGTCG CGCGAGGGCG AAATCGCCAT CGCCAAGCGG ATCGAGGCCG GCCGCGAGGC GATGATCGCC GGACTCTGCG AAAGCCCGCT GACCTTCCAG GCGATCATCA TCTGGCGCGA CGAGCTCAAC GAAGGCAAGA TCTTCCTTCG CGACATCATC GATCTCGAAG CCACCTATGC GGGCCCCGAC GCCAAGGCCA ACATGAACCC TGCGATGGCC GAGGGCGCCG GCGAGGAAGC CAGCGCCGAA GGCGAGGCCG ACGCCGGAGC GCCCGCGCAT GTCGCGCCGC CCGCCGCGCC GCCGACGGCG ACGCCGTTCC GTCCCGCGCA GCAGCGCTCG GCGCCCGCCG CGGCCCCGGC CGCCGGCGAG GGCGGCGGTG AAGGCGCCGC CGAAGGCGAC ATGGACGACG ACGAGTTCGA GAACCAGATG TCGCTCGCCG CCATCGAGGC CGAACTCAAG CCGAAGGTGG TCGAGACCTT CGACAAGATC GCCGACAACT ACAAGAAGCT GCGCAAGCTG CAAGAGCAGG ACATCGCCAA CCAGCTCGAA AGCGCGTCGC AGGGGCCGTC GCTGTCGCCC TCGCAGGAGC GCAAATACAA GAAGCTCAAG GACGAAATCA TCGTCGAGGT GAAGTCGCTG CGGCTCAATC AGGCGCGTAT CGACTCACTG GTCGAGCAGC TCTACGACAT CAACAAGAAG CTGGTGTCGT TCGAAGGCCG GCTGCTGCGC CTCGGCGACA GCCACGGCGT GGCGCGCGAG GACTTCCTGC GCAACTATCA GGGCTCCGAG CTCGATCCGC GCTGGCTCAA CCGCGTCTCG AAACTCTCCG CCAAGGGCTG GAAGAACTTC GTCCATTTCG AGAAGGACCG GATCCGCGAG CTGCGCCAGG AAATCCAGTC GATGGCCGCG CTCACCGGGC TCGAGATCGG CGAATTCCGC AAGATCGTGC ATTCGGTGCA GAAGGGCGAG CGCGAAGCCC GCCAGGCCAA GAAGGAAATG GTCGAGGCCA ACCTGCGTCT GGTGATCTCG ATTGCCAAGA AATACACCAA CCGCGGCCTG CAGTTCCTCG ATCTCATTCA GGAAGGCAAT ATCGGCCTGA TGAAGGCGGT CGACAAATTC GAATATCGCC GCGGCTACAA ATTCTCCACC TATGCGACGT GGTGGATCCG GCAGGCGATC ACGCGTTCGA TCGCCGACCA GGCCCGCACC ATCCGCATTC CGGTGCACAT GATCGAGACG ATCAACAAGA TCGTGCGCAC CTCGCGGCAG ATGCTCAACG AGATCGGCCG CGAGCCGACC CCGGAAGAGC TCGCCGAGAA GCTCGGCATG CCGCTGGAGA AGGTCCGCAA GGTCCTCAAG ATCGCCAAGG AGCCGCTGTC GCTCGAAACC CCGGTGGGTG ACGAAGAGGA CAGCCATCTC GGCGACTTCA TCGAGGACAA GAACGCGGTG CTGCCGATCG ACGCCGCGAT CCAGTCGAAC CTGCGCGAAA CCACGACGCG CGTGCTCGCC TCGCTCACCC CGCGCGAAGA ACGCGTGCTG CGCATGCGCT TCGGCATCGG CATGAACACC GACCACACGC TGGAAGAAGT CGGCCAGCAG TTCTCGGTCA CCCGCGAACG CATCCGCCAG ATCGAAGCCA AGGCGCTGCG CAAGTTGAAG CATCCGAGCC GGTCAAGGAA GCTGCGGAGC TTCTTGGACA ACTGA
|
Protein sequence | MASKAKTVQL KDKEKDDKAD APEKDSADAP SPLLDLSDAA VKKMIKQAKK RGFVTFDQLN EVLPSDTTSP EQIEDIMSML SDMGINVSEA EESDSEDEDA KDEAEEEPDN DLVEVTQKAV TETKKSEPGE RTDDPVRMYL REMGTVELLS REGEIAIAKR IEAGREAMIA GLCESPLTFQ AIIIWRDELN EGKIFLRDII DLEATYAGPD AKANMNPAMA EGAGEEASAE GEADAGAPAH VAPPAAPPTA TPFRPAQQRS APAAAPAAGE GGGEGAAEGD MDDDEFENQM SLAAIEAELK PKVVETFDKI ADNYKKLRKL QEQDIANQLE SASQGPSLSP SQERKYKKLK DEIIVEVKSL RLNQARIDSL VEQLYDINKK LVSFEGRLLR LGDSHGVARE DFLRNYQGSE LDPRWLNRVS KLSAKGWKNF VHFEKDRIRE LRQEIQSMAA LTGLEIGEFR KIVHSVQKGE REARQAKKEM VEANLRLVIS IAKKYTNRGL QFLDLIQEGN IGLMKAVDKF EYRRGYKFST YATWWIRQAI TRSIADQART IRIPVHMIET INKIVRTSRQ MLNEIGREPT PEELAEKLGM PLEKVRKVLK IAKEPLSLET PVGDEEDSHL GDFIEDKNAV LPIDAAIQSN LRETTTRVLA SLTPREERVL RMRFGIGMNT DHTLEEVGQQ FSVTRERIRQ IEAKALRKLK HPSRSRKLRS FLDN
|
| |