Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_3642 |
Symbol | |
ID | 6411318 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 3902350 |
End bp | 3903369 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 642713522 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_001992617 |
Protein GI | 192292012 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.236787 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACGATCC AGAAAAATTG GCAGGAATTG ATTCGGCCGA ACAAGCTCCA GGTCACCCCC GGGAGCGATG CCACCCGGTT CGCGACGCTG GTCGCCGAGC CGCTCGAGCG CGGCTTCGGT CAGACGCTCG GCAACGCGCT GCGTCGCGTG CTGTTGTCGT CGCTGCAGGG CGCGGCGGTG CAGTCGGTCC ACATCGACGG CGTTCTGCAC GAGTTCTCCT CGATCGCCGG CGTGCGCGAG GACGTCACCG ACATCGTGCT GAACATCAAG GACATCTCGC TGAAGATGCA GGGCGAAGGC CCGAAGCGGA TGGTCGTGAA GAAGCAGGGT CCGGGTGCCG TCACCGCTGG TGACATCCAG ACCGTCGGCG ACATCGTCGT GCTGAACCCG GATCTGCAGC TCTGCACGCT GGACGACGGC GCCGAGATCC GCATGGAGTT CACCGTCAAC ACCGGCAAGG GCTACGTCGC CGCCGAGCGC AACCGTCCCG AGGACGCGCC GATCGGCCTG ATCCCGGTCG ACAGCCTGTA CTCGCCGGTC CGCAAGGTGT CGTACAAGGT CGAGAACACC CGCGAGGGCC AGATCCTCGA CTACGACAAG CTCACCATGA CGGTCGAGAC CAACGGCGCG ATCTCGCCGG AAGACGCGGT GGCGTTCGCC GCCCGCATCC TCCAGGATCA GCTCAACGTC TTCGTCAACT TCGAAGAGCC GCGCAAGGAA GTCACCCAGG AGATCATCCC GGATCTGGCC TTCAACCCGG CCTTCCTCAA GAAGGTGGAC GAGCTCGAGC TGTCGGTGCG TTCGGCGAAC TGCCTGAAGA ACGACAACAT CGTCTACATT GGCGACCTGG TGCAGAAGTC GGAAGCGGAA ATGCTCCGCA CCCCGAACTT CGGGCGCAAG TCGCTGAACG AGATCAAGGA AGTGCTGGCG CAGATGGGCC TGCATCTCGG CATGGAAGTG CCGGGCTGGC CGCCGGAGAA CATCGACGAG CTCGCCAAGC GCTTCGAAGA TCACTACTAA
|
Protein sequence | MTIQKNWQEL IRPNKLQVTP GSDATRFATL VAEPLERGFG QTLGNALRRV LLSSLQGAAV QSVHIDGVLH EFSSIAGVRE DVTDIVLNIK DISLKMQGEG PKRMVVKKQG PGAVTAGDIQ TVGDIVVLNP DLQLCTLDDG AEIRMEFTVN TGKGYVAAER NRPEDAPIGL IPVDSLYSPV RKVSYKVENT REGQILDYDK LTMTVETNGA ISPEDAVAFA ARILQDQLNV FVNFEEPRKE VTQEIIPDLA FNPAFLKKVD ELELSVRSAN CLKNDNIVYI GDLVQKSEAE MLRTPNFGRK SLNEIKEVLA QMGLHLGMEV PGWPPENIDE LAKRFEDHY
|
| |