Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0655 |
Symbol | |
ID | 3908580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 739032 |
End bp | 740672 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637882545 |
Product | RNA polymerase factor sigma-54 |
Protein accession | YP_484277 |
Protein GI | 86747781 |
COG category | [K] Transcription |
COG ID | [COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog |
TIGRFAM ID | [TIGR02395] RNA polymerase sigma-54 factor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.220841 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACTAA CTCAACGCCT CGAGTTCCGA CAGTCACAGT CGCTGGTGAT GACGCCGCAG CTGATGCAGG CGATCAAGCT GCTGCAACTG TCGAATCTGG ACCTCGCGAC CTTCGTCGAG GACGAACTCG AGAAAAACCC CCTGCTGGAC CGGGCCAGTG ACAACGCCGA ACCGCCGGTC GCCGGCGAGG CAGTGATGGA GCGCGCCGAG GCCAGCGGCG ACGATTTCGG CGGGAGCGAG AGCAGCGGAG ACGGCTCTGA CTTCTCCGAC GGCGTCGGCA GCGACTCGTT CGAGCCGGGC GCCGAGGATT GGATGCACCG CGACCTCGGC AGCCGCAGCG AGATCGAGCA GACGCTCGAT ACCGGCATGG AAAACGTGTT TCCGGAAGAG CCGGCCGAGG CCGCCGCCCG CGCCGCGCAG GACGCGGCTC CAGCCTCCTA CACCGAATGG GGCGGCGGCG CCTCCAGCGA CGAGGGCTAC AATCTCGAGG CCTTCGTCGC CGCCGAAACC TCCCTGGCCG ATCGCCTCGC CGAGCAGCTT GCGGTGGCGC TCACCGCGCC GTCGCAGCGC ATGATCGGCC AATATCTGAT CGACCTCGTG GATGACGCCG GATATCTGCC GCCCGACCTC GGTGACGCCG CGGAGCGTCT CGGCACCACC CAGGCCGAAG TCGAAGCCGT CGTCGCCGTT CTGCAGACCT TCGATCCGCC GGGGATCTGC GCGCGCTCGC TGGCCGAATG CCTGGCGATC CAGTTGCGCG AACTCGACCG GTTCGACCCG GCGATGCAGG CTTTGGTCGA GAATCTGGAT CTCCTCGCCA AGCGCGACAT CGCCAGCCTC CGCAAGCTCT GCGGCGTCGA CGACGAGGAT CTCGCCGATA TGATCGGCGA AATCCGCCAT CTCGACCCGA AGCCGGGTCT GAAATTCGCA TCGTCGCGGG TGCAGACCGT GGTGCCGGAC GTGTTCGTCC GCCCCGGCCC GGACGGCGGC TGGCTGGTCG AACTCAACAG CGACACGCTG CCGAAGGTGC TGGTCAACCA GTCCTATTAC TCCGAACTGT CGAAGACGAT CCGCAAGGAC GGCGACAAGT CGTACTTCTC CGACTGCCTG CAGAACGCCA CCTGGCTGGT GCGCGCGCTC GACCAGCGTG CCCGCACCAT CCTGAAAGTG GCGACCGAGA TCGTGCGCCA GCAGGACGGC TTCTTCACCC ACGGCGTCGC GCATCTGCGG CCGCTGAATC TGAAGGCGGT GGCCGACGCG ATCCAGATGC ACGAATCCAC GGTATCGCGC GTGACCGCCA ACAAATACAT GGCGACCAAT CGGGGCACGT TCGAACTCAA GTATTTCTTT ACCGCTTCGA TCGCTTCCGC CGACGGCGGC GAGGCGCATT CGGCCGAAGC CGTGCGCCAT CACATCCGGC AGTTGATCGA CGGCGAAGAG CCGACCGCCA TCCTGTCGGA CGACACCATC GTCGAACGGC TGCGCGAAGC CGGCATCGAG ATTGCGCGCC GCACCGTCGC GAAGTATCGC GAGGCGATGC GGATCCCGTC GTCGGTGCAG CGGCGGCGCG ACAAGCAGAG CATGCTCGGC ACGGCCCTGG CGGCGCCCGC CGATCGGTCC CGCGACACCG CTCCGGCTTG A
|
Protein sequence | MALTQRLEFR QSQSLVMTPQ LMQAIKLLQL SNLDLATFVE DELEKNPLLD RASDNAEPPV AGEAVMERAE ASGDDFGGSE SSGDGSDFSD GVGSDSFEPG AEDWMHRDLG SRSEIEQTLD TGMENVFPEE PAEAAARAAQ DAAPASYTEW GGGASSDEGY NLEAFVAAET SLADRLAEQL AVALTAPSQR MIGQYLIDLV DDAGYLPPDL GDAAERLGTT QAEVEAVVAV LQTFDPPGIC ARSLAECLAI QLRELDRFDP AMQALVENLD LLAKRDIASL RKLCGVDDED LADMIGEIRH LDPKPGLKFA SSRVQTVVPD VFVRPGPDGG WLVELNSDTL PKVLVNQSYY SELSKTIRKD GDKSYFSDCL QNATWLVRAL DQRARTILKV ATEIVRQQDG FFTHGVAHLR PLNLKAVADA IQMHESTVSR VTANKYMATN RGTFELKYFF TASIASADGG EAHSAEAVRH HIRQLIDGEE PTAILSDDTI VERLREAGIE IARRTVAKYR EAMRIPSSVQ RRRDKQSMLG TALAAPADRS RDTAPA
|
| |