Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3972 |
Symbol | |
ID | 4024489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 4419952 |
End bp | 4422087 |
Gene Length | 2136 bp |
Protein Length | 711 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637964175 |
Product | RNA polymerase sigma factor RpoD |
Protein accession | YP_571092 |
Protein GI | 91978433 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.379149 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAGCA AGGCGAAGAC GGTTCAGTTG AAAGACAAGG AAAAGGACGA CAAGGCCGAC GCGCCGGAGA AGGACTCCGC CGACGCTCCC TCGCCGTTGC TCGACCTGTC GGACGCGGCC GTCAAGAAGA TGATCAAGCA GGCCAAGAAG CGCGGCTTCG TGACCTTCGA TCAGCTCAAC GAAGTGCTGC CCTCCGACAC CACGTCGCCG GAGCAGATCG AGGACATCAT GTCGATGCTG TCCGATATGG GCATCAACGT TTCCGAGGCC GAGGAAAGCG ACAGCGAGGA CGAAGAGTCC AAGGACGAGG CCGAGGAAGA GCCCGATAAC GACCTCGTCG AGGTCACCCA AAAGGCCGTC ACCGAGACCA AGAAGTCCGA GCCCGGCGAG CGCACCGACG ATCCCGTCCG GATGTATCTG CGCGAGATGG GCACCGTCGA GCTGTTGTCG CGCGAGGGCG AAATCGCCAT CGCGAAACGG ATCGAGGCCG GGCGCGAGGC GATGATCGCC GGGCTGTGCG AAAGCCCGCT GACCTTCCAG GCGATCATCA TCTGGCGCGA CGAGCTCAAC GAAGGAAAGA TCTTCCTTCG CGACATCATC GATCTCGAAG CGACCTATGC GGGTCCCGAC GCCAAGAACA ACATGAACCC GGCGATGGCC GGCGAGACCG GCGAAGAAGC CTCGGCCGAA GGCGAGGGCG GAGCGCCCGC GCATCTCGCG CCGCCGGCCG CGCCGCCGTC GGCGACGCCG TTCCGCCCCG CGCAGCAGCG CGCCGCGCCG TCTCAGGCCC CCGCCGGAGA AGGCGGAGGT GAAGGCGCCG CCGAAGGCGA CATGGACGAC GACGAGTTCG AAAACCAGAT GTCGCTCGCC GCGATCGAGG CCGAACTCAA GCCGAAGGTG GTCGAGACCT TCGACAAGAT CGCCGACAAC TACAAGAAGC TCCGCAAGCT GCAGGAGCAG GACATCGCCA ACCAGCTCGA GAGCGCGTCG CAGGGACCAT CGCTGTCGCC GTCGCAGGAG CGCAAGTACA AGAAGCTCAA GGACGAAATC ATCGTCGAGG TGAAGTCGCT GCGGCTCAAT CAGGCCCGTA TCGATTCGCT GGTCGAGCAG CTCTACGACA TCAACAAAAA GCTGGTGTCG TTCGAAGGCC GCCTGCTGCG GCTCGGCGAC AGCCACGGCG TCGCCCGCGA AGACTTTCTG CGCAACTATC AGGGCTCCGA GCTCGATCCG CGCTGGCTCA ACCGCGTCTC GAAACTGAGC GCTAAAGGCT GGAAGAACTT CGTCCACCAC GAGAAGGACC GGATCAAGGA ATTGCGCCAG GAGATCCAGT CGATGGCCGC ATTGACCGGC CTCGAGATCG GCGAATTCCG CAAGATCGTG CACTCGGTGC AGAAGGGCGA GCGCGAAGCC CGCCAGGCCA AGAAGGAAAT GGTCGAGGCC AATCTGCGTC TCGTGATCTC GATCGCCAAG AAATACACCA ATCGCGGCCT GCAGTTCCTC GATCTCATTC AAGAGGGCAA TATCGGCCTG ATGAAGGCGG TCGACAAATT CGAATATCGC CGCGGCTACA AATTCTCGAC CTACGCGACG TGGTGGATCC GGCAGGCGAT CACGCGCTCG ATCGCCGACC AGGCCCGCAC GATTCGCATC CCGGTGCACA TGATCGAGAC GATCAACAAG ATCGTGCGCA CCTCGCGGCA GATGCTCAAC GAGATCGGCC GCGAACCGAC CCCGGAGGAG CTTGCCGAAA AGCTCGGCAT GCCGCTGGAG AAGGTGCGCA AGGTCCTAAA GATCGCCAAG GAGCCGCTGT CGCTCGAAAC CCCGGTGGGT GACGAAGAGG ACAGCCATCT CGGCGATTTC ATCGAGGACA AGAACGCGGT GCTGCCGATC GATGCCGCGA TCCAGTCGAA CCTGCGCGAG ACCACCACGC GCGTGCTCGC CTCCCTGACG CCGCGCGAAG AACGCGTACT CCGGATGCGC TTCGGCATCG GCATGAACAC CGACCACACG CTGGAAGAAG TCGGCCAGCA GTTTTCGGTG ACCCGCGAAC GTATCCGCCA GATCGAAGCC AAGGCGCTGC GCAAGCTGAA GCATCCGTCA CGGTCGCGGA AGCTGCGGAG CTTCTTGGAT AACTGA
|
Protein sequence | MASKAKTVQL KDKEKDDKAD APEKDSADAP SPLLDLSDAA VKKMIKQAKK RGFVTFDQLN EVLPSDTTSP EQIEDIMSML SDMGINVSEA EESDSEDEES KDEAEEEPDN DLVEVTQKAV TETKKSEPGE RTDDPVRMYL REMGTVELLS REGEIAIAKR IEAGREAMIA GLCESPLTFQ AIIIWRDELN EGKIFLRDII DLEATYAGPD AKNNMNPAMA GETGEEASAE GEGGAPAHLA PPAAPPSATP FRPAQQRAAP SQAPAGEGGG EGAAEGDMDD DEFENQMSLA AIEAELKPKV VETFDKIADN YKKLRKLQEQ DIANQLESAS QGPSLSPSQE RKYKKLKDEI IVEVKSLRLN QARIDSLVEQ LYDINKKLVS FEGRLLRLGD SHGVAREDFL RNYQGSELDP RWLNRVSKLS AKGWKNFVHH EKDRIKELRQ EIQSMAALTG LEIGEFRKIV HSVQKGEREA RQAKKEMVEA NLRLVISIAK KYTNRGLQFL DLIQEGNIGL MKAVDKFEYR RGYKFSTYAT WWIRQAITRS IADQARTIRI PVHMIETINK IVRTSRQMLN EIGREPTPEE LAEKLGMPLE KVRKVLKIAK EPLSLETPVG DEEDSHLGDF IEDKNAVLPI DAAIQSNLRE TTTRVLASLT PREERVLRMR FGIGMNTDHT LEEVGQQFSV TRERIRQIEA KALRKLKHPS RSRKLRSFLD N
|
| |