Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_0052 |
Symbol | |
ID | 6407694 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 55551 |
End bp | 57197 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 642709960 |
Product | RNA polymerase factor sigma-54 |
Protein accession | YP_001989090 |
Protein GI | 192288485 |
COG category | [K] Transcription |
COG ID | [COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog |
TIGRFAM ID | [TIGR02395] RNA polymerase sigma-54 factor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.15874 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCTGT CGCAACGTCT CGAATTCCGC CAGACCCAGT CTCTGGTGAT GACCCCGCAA CTGATGCAGG CGATCAAGCT GCTTCAGCTG TCGAATCTCG ACCTCGCGGC TTTCGTCGAG GACGAGATCG AGAAGAACCC GTTGCTCGAT CGCGCCGGCG ACAATGCGGA ACCGCCGGTG GCCGGTGAGG CATCGGCGGA AGGCGCAGAG GGCGGCGGTG AATTCGGCGG GAGCGGCGGC GAGGATCTCG GCGGCGAGGG CACGTCGGAT TTCGTCGATC CCGCAGGCGC CAGCGCATTC GAGCCCGGCA CCGAAGAATG GATGCATCGC GATCTCGGCA GCCGCAGCGA TATCGAGCAG ACCCTCGATA CCGGTATGGA AAACGTTTTC CCTGAAGAGC CAGCCGAAGC CGCGGCCCGC GCCGCGCAGG ACGCCGCACC GGCCTCCTAT ACCGAATGGG GCGGCGGGGC TTCATCGGAT GAGGGCTACA ATCTCGAAGC CTTCGTGGCC GCTGAATCGT CGCTGGCCGA TCATCTCGCC GAACAGCTCG CGGTCGCGGT AACGACGCCA TCGCAGCGGC TGATCGGACA ATACCTGATA GATCTGGTCG ATGATGCCGG CTATCTGCCG GCTGATCTCG GCGATGCCGC CGAACGGTTG GGCGCGAGCC AGGCCGAAGT GGAAGCGCTG GTGCAGGTGC TGCAGACCTT CGATCCGCCC GGTATCTGCG CCCGCAACCT GAGCGAATGC CTCGCCATCC AACTGCGCGA GCGCGACCGC TACGATCCGG CGATGCAGGC GTTGGTCGAG CACCTCGACT TGCTGGCCAA ACGCGACGTC GCGTCATTAC GCAAGATCTG CGGCGTCGAC GACGAAGACC TCGTCGACAT GATCGGCGAG ATTCGTCATC TCGATCCGAA GCCCGGCCTG AAGTTCGGCT CGTCGCGGGT GCAGACAGTT GTGCCCGACG TCTTCGTCCG TCCCGGCCCC GACGGCGGTT GGCTGGTCGA GCTCAACAGC GACACACTGC CGAAGGTGCT GGTCAACCAG TCATATTATT CCGAGCTGTC GAAGACGATC CGCAAGGACG GCGACAAGTC GTATTTCTCC GACTGCCTGC AGAACGCTAC CTGGCTGGTG CGCGCGCTCG ATCAGCGCGC CCGCACCATC CTGAAGGTGG CGACCGAGAT CGTGCGCCAG CAGGACGGCT TCTTCACCCA CGGCGTCAAG CATCTGCGGC CGCTGAATCT CAAGGCCGTG GCTGACGCGA TTCAGATGCA CGAATCGACG GTGTCGCGCG TCACCGCCAA CAAATACATG GCGACCAATC GCGGCACTTT CGAACTTAAG TATTTCTTCA CCGCGTCGAT CGCGTCCGCC GACGGCGGCG AGGCGCACTC GGCCGAAGCA GTACGGCACC AGATCCGCCA ACTGATCGAC AGCGAAGATC CGTCAGCGAT CCTGTCGGAT GATACGATCG TCGAACGGCT TCGCGAGGCC GGCATCGACA TCGCGCGCCG CACCGTCGCG AAATATCGCG AAGCGATGCG CATTCCCTCT TCAGTGCAGC GCCGTCGCGA CAAGCAGAAC ATGCTGGGCA CACAGGCCGG GAGCGCAAGC CGCTCCCGCG ACACAGCCCC AGCTTGA
|
Protein sequence | MALSQRLEFR QTQSLVMTPQ LMQAIKLLQL SNLDLAAFVE DEIEKNPLLD RAGDNAEPPV AGEASAEGAE GGGEFGGSGG EDLGGEGTSD FVDPAGASAF EPGTEEWMHR DLGSRSDIEQ TLDTGMENVF PEEPAEAAAR AAQDAAPASY TEWGGGASSD EGYNLEAFVA AESSLADHLA EQLAVAVTTP SQRLIGQYLI DLVDDAGYLP ADLGDAAERL GASQAEVEAL VQVLQTFDPP GICARNLSEC LAIQLRERDR YDPAMQALVE HLDLLAKRDV ASLRKICGVD DEDLVDMIGE IRHLDPKPGL KFGSSRVQTV VPDVFVRPGP DGGWLVELNS DTLPKVLVNQ SYYSELSKTI RKDGDKSYFS DCLQNATWLV RALDQRARTI LKVATEIVRQ QDGFFTHGVK HLRPLNLKAV ADAIQMHEST VSRVTANKYM ATNRGTFELK YFFTASIASA DGGEAHSAEA VRHQIRQLID SEDPSAILSD DTIVERLREA GIDIARRTVA KYREAMRIPS SVQRRRDKQN MLGTQAGSAS RSRDTAPA
|
| |