Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_0971 |
Symbol | |
ID | 3969695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 1066325 |
End bp | 1068433 |
Gene Length | 2109 bp |
Protein Length | 702 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637924087 |
Product | RNA polymerase sigma factor RpoD |
Protein accession | YP_530860 |
Protein GI | 90422490 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0743249 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00275822 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCACCA AGGCAAAGAC GCTGCAGACC AAGGACACCA AGGACGACAA GGTCGTCGAC GCGCCGGAAA AGGATGCTCC GGACGCGCCG TCGCCGTTGC TCGACCTCTC CGACGCAGCC GTCAAGAAGA TGATCAAGCA GGCCAAGAAG CGCGGCTTCG TCACCTTCGA TCAGCTCAAT GAAGTGCTGC CGTCCGACAC CACCTCGCCG GAGCAGATCG AGGACATCAT GTCGATGCTC TCCGACATGG GCATCAACGT GTCGGAAGCC GACGATGCCG ACAGCGACGA AGAGGACGCC AAGGAAGAGG CCGAGGAAGA GCCCGACAAC GAACTGGTCG AGGTCACCGC CAAGGCCGTC ACCGAGACCA AGAAGTCCGA GCCCGGCGAG CGCACCGACG ATCCGGTGCG GATGTACTTG CGCGAAATGG GCACCGTCGA GCTGCTGTCG CGCGAGGGCG AAATCGCCAT CGCCAAGCGG ATCGAGGCCG GCCGCGAGGC GATGATCGCA GGGCTCTGCG AAAGCCCGCT GACGTTCCAG GCCATCATCA TCTGGCGCGA TGAATTGAAC GAGGGAAAGA TCTTCCTGCG CGACATCATC GATCTGGAAG CCACCTATGC GGGTCCGGAC GCCAAGAACA ACATGAACCC GGCGCTGATC GCGCCCCCCG CCGCCGCCGA TGGCGAGGCC GCCGACGGCG CCGAAGCCGT TGCAGCGCCT CCGGCCGCGC CGCCCTCGGC GACCCCGTTC CGCGCCGCGC CGCCGCCGCG CCCGGCCGAC GAGCCCAAGG AATCCTCGGA GTCCGCCGAT GGCGATGCCG ACGACGACGA GTTCGAAAAC CAGATGTCGC TCGCCGCGAT CGAGGCCGAG CTGAAGCCGA AGGTGGTCGA GACCTTCGAC AAGATCGCCG AGAACTACAA GAAGCTGCGC CGCCTGCAGG AGCAGGACAT CTCCAACCAG CTGCAGAACG AGACGCTGTC GCCGGCGCAG GAGCGCAAAT ACAAGAAGCT CAAGGACGAG ATCATCGTCG AGGTGAAGTC GCTGCGGCTG AACCAGGCGC GTATCGATTC GCTCGTGGAG CAGCTCTACG ACATCAACAA GAAGCTGGTT TCGTTCGAAG GCCGCCTGCT GCGGCTCGGC GACAGCCACG GCGTCGCCCG CGAAGATTTT CTGCGCAACT ACCAGGGCTC CGAGCTCGAT CCGCGCTGGC TCAACCGGGT GTCGAAACTT TCCGCCAAGG GCTGGAAGAA TTTCGTCGCC CACGAAAAGG ACCGCATCAA GGAGCTGCGC GGCGAGATCC AGTCGCTGGC CGCTCTCACC GGGCTCGAGA TCGGCGAATT CCGCAAGATC GTGCACGGCG TGCAGAAGGG CGAGCGCGAG GCGCGCCAGG CCAAGAAGGA AATGGTCGAG GCGAATTTGC GTCTCGTCAT CTCGATCGCG AAAAAATACA CCAACCGCGG CCTGCAATTC CTCGACCTCA TTCAAGAGGG CAATATCGGC CTGATGAAGG CGGTCGATAA GTTCGAATAT CGCCGCGGCT ACAAGTTCTC GACCTACGCC ACCTGGTGGA TCCGGCAGGC GATCACCCGC TCGATCGCCG ACCAGGCCCG CACGATCCGC ATTCCCGTGC ACATGATCGA GACCATCAAC AAGATCGTGC GCACCTCGCG GCAGATGCTG AACGAGATCG GCCGCGAACC CACTCCCGAG GAATTGGCCG AAAAGCTCGG CATGCCGCTG GAGAAGGTCC GCAAGGTCCT GAAGATCGCC AAGGAGCCGT TGTCGCTGGA GACCCCGGTC GGCGACGAGG AGGATTCGCA TCTCGGCGAT TTCATCGAGG ACAAGAACGC GATCCTGCCG ATCGACGCCG CGATCCAAAG CAACCTGCGC GAAACCACCA CGCGGGTGCT GGCGTCGCTG ACACCGCGCG AAGAGCGCGT GCTGCGCATG CGCTTCGGCA TCGGCATGAA CACCGACCAC ACGCTGGAAG AAGTCGGCCA GCAGTTCTCG GTGACCCGCG AACGCATCCG GCAGATCGAA GCCAAGGCGC TGCGCAAGCT GAAGCATCCG AGCCGGTCGA GGAAGCTGCG GAGCTTCTTG GATAACTAA
|
Protein sequence | MATKAKTLQT KDTKDDKVVD APEKDAPDAP SPLLDLSDAA VKKMIKQAKK RGFVTFDQLN EVLPSDTTSP EQIEDIMSML SDMGINVSEA DDADSDEEDA KEEAEEEPDN ELVEVTAKAV TETKKSEPGE RTDDPVRMYL REMGTVELLS REGEIAIAKR IEAGREAMIA GLCESPLTFQ AIIIWRDELN EGKIFLRDII DLEATYAGPD AKNNMNPALI APPAAADGEA ADGAEAVAAP PAAPPSATPF RAAPPPRPAD EPKESSESAD GDADDDEFEN QMSLAAIEAE LKPKVVETFD KIAENYKKLR RLQEQDISNQ LQNETLSPAQ ERKYKKLKDE IIVEVKSLRL NQARIDSLVE QLYDINKKLV SFEGRLLRLG DSHGVAREDF LRNYQGSELD PRWLNRVSKL SAKGWKNFVA HEKDRIKELR GEIQSLAALT GLEIGEFRKI VHGVQKGERE ARQAKKEMVE ANLRLVISIA KKYTNRGLQF LDLIQEGNIG LMKAVDKFEY RRGYKFSTYA TWWIRQAITR SIADQARTIR IPVHMIETIN KIVRTSRQML NEIGREPTPE ELAEKLGMPL EKVRKVLKIA KEPLSLETPV GDEEDSHLGD FIEDKNAILP IDAAIQSNLR ETTTRVLASL TPREERVLRM RFGIGMNTDH TLEEVGQQFS VTRERIRQIE AKALRKLKHP SRSRKLRSFL DN
|
| |