Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1808 |
Symbol | |
ID | 3908889 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2067910 |
End bp | 2068815 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637883702 |
Product | RNA polymerase sigma factor SigJ |
Protein accession | YP_485427 |
Protein GI | 86748931 |
COG category | [K] Transcription |
COG ID | [COG1595] DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family [TIGR02957] RNA polymerase sigma-70 factor, TIGR02957 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.505584 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.519453 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAAGC CTCCTGAGCC CACGATCAGC GACGACGCCG CAGCGAGCTT CGCAACGCTG CGCGCGCGGC TGGTCCGCGT CGCCTATCGC ATGCTCGGCT CGGTCGCCGA GGCCGAAGAC GTGGTGCAGG ACGCTTACTT GCGCTGGCAC CGGACCGATC GCGCCGAGGT GCGCGATCCC GCGGGGTTTC TGACCCGGAC CGTGACCCGG CTGTGCCTCG ACGTGCTGAA ATCGGCGCGG CTCAGGCGCG AGACCTATAT CGGACCATGG CTGCCCGAGC CGCTGATCGC CGATCCGTGC GAGGACGAGG GCGACGACAT CACGCTCACC TTGATGCTGG CGCTGGAGCG GCTGTCGCCG CTCGAGCGCG CGGCGTTCCT GCTGCACGAC GTGTTCGGCC TCGGCTTCGA CGAGATCGCC CGCACCCTCG ACCGCGACGC CGCCGCCTGC CGCCAGCTCG CCGCGCGCGC CCGCGGCCAT GTCCGGGCCG AGCGGCCGCG CTTTCCGGTG TCGGAGGAGC GCGGCCAGGC GATCGCCGGG GCGTTCTTCG AGGCGTCGCG CAGCGGCGAT CTGAAGGCGC TGACCGCGCT GCTCGCCGAC GACGTGGTGT TCTACGGCGA CGGCGGCGGC AAGCGCCCGG CGACGCTGAA CCCGATCTTC GGCCTCGCCA AAGTGGCCCG GCTGTTCGAA GGCCTCGCGC GCAAGCATGC CCCCGGTGCG TCGGTCGTGG TCTCGACCGG GCGCATCGAC GGGCTGCCCG GCTTCGTCAC CACCGAGCCC GACGGGCTGA TCCAGACCAC CGCGCTCGCC ATCGAACACG ACCGCATCGT CGCGATCTAT GTGGTGCGCA ATCCGGACAA GCTGCGCCAT CTGCTGGCGC TCTCGGCCGT CCCCGCGCGG CCCTGA
|
Protein sequence | MPKPPEPTIS DDAAASFATL RARLVRVAYR MLGSVAEAED VVQDAYLRWH RTDRAEVRDP AGFLTRTVTR LCLDVLKSAR LRRETYIGPW LPEPLIADPC EDEGDDITLT LMLALERLSP LERAAFLLHD VFGLGFDEIA RTLDRDAAAC RQLAARARGH VRAERPRFPV SEERGQAIAG AFFEASRSGD LKALTALLAD DVVFYGDGGG KRPATLNPIF GLAKVARLFE GLARKHAPGA SVVVSTGRID GLPGFVTTEP DGLIQTTALA IEHDRIVAIY VVRNPDKLRH LLALSAVPAR P
|
| |