Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4259 |
Symbol | |
ID | 3912072 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4844586 |
End bp | 4845851 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637886164 |
Product | putative sigma factor |
Protein accession | YP_487858 |
Protein GI | 86751362 |
COG category | [K] Transcription |
COG ID | [COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.294953 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACC TCGCCTGGAT CGACACCGCG ATCACGGCGT CCCGGCCCCA GGCGATCGGC GCGCTGCTGC GCTATTTCCG CGATCTCGAC ACCGCCGAGG AGGCGTTCCA GAACGCCTGC CTGCGGGCGC TGAAAGCCTG GCCGCAGAAC GGCCCGCCGC GCGATCCGGC CGCATGGCTG ATCCTGGTCG GCCGCAATGT CGCGATCGAC GACCTCCGCC GCGGCAAGAA GCAACAGCCG CTGCCCGACG ACGAAGCGAT CTCCGATCTC GACGACGCCG AGAGCGCGCT CGCCGAGCGG CTCGACGGCT CGCATTATCG CGACGACATC CTGCGGCTGC TGTTCATCTG CTGTCATCCC GAATTGCCGC CCACCCAGCA GATCGCGCTG GCGCTGCGCA TCGTCTCGGG GCTGACCGTG CCGCAGATCG CGCGGGCGTT TCTGGTGTCG GACGCGGCGA TGGAGCAGCG CATCACCCGC GCCAAGGCCA AAGTCGCCCG CGCCCGCGTG CCGTTCGAAA CCCCGGGCGC GCCGGAGCGC AGCGAACGGC TGGGCGCGGT GGCGGCGATG ATCTACCTGG TCTTCAACGA GGGCTATTCG GCGTCGGGCG ACACTGCGGG AATCCGCGCG CCCTTGTGCG AGGAGGCGAT CCGGCTGGCG CGGCTGCTGC TGCGGCTGTT TCCGTCCGAG CCCGAGATCA TGGGGCTCAC TGCTTTGATG CTGCTGCAGC ATGCGCGCGC GCCGGCGCGC TTCGATGCCC ATGGCGAGAT CGTGCTGCTC GACGAGCAGG ACCGCGGCCT GTGGGACACA AAGCTGATCG CCGAGGGCCT GGCGCTGATC GACAAGGCGA TGCGTCATCG CCGCACCGGC GCGTATCAGA TCCAGGCCGC GATCGCCGCC CTGCACGCCC GTGCGACCCG GCCCGAGGAT ACCGACTGGG CGCAGATCGA TCTGCTGTAC GGCTCGCTGG AGATCCTGCA GCCGTCGCCG GTGATCACGC TCAACCGCGC GGTCGCGGTG TCCAAAGTGC GCGGCGCCGA GGCGGCGCTG GCGATGATCG CACCGCTGGA AGAGAGGTTG TCGAACTACT TCCATTATTT CGGCACCAGG GGCGCGCTGC TGCTGCAGCT GGGTCGCCGT GACGAGGCGC GGACCGCCTT CGACCGCGCC ATCGCGCTGG CCCGGACCAC CGCCGAGGCC AACCACATCC GCATGCATCT CGACCGCTCG AAGCGCGACG ACGCGGCCGA GCGGATCAAT CCGTAG
|
Protein sequence | MTDLAWIDTA ITASRPQAIG ALLRYFRDLD TAEEAFQNAC LRALKAWPQN GPPRDPAAWL ILVGRNVAID DLRRGKKQQP LPDDEAISDL DDAESALAER LDGSHYRDDI LRLLFICCHP ELPPTQQIAL ALRIVSGLTV PQIARAFLVS DAAMEQRITR AKAKVARARV PFETPGAPER SERLGAVAAM IYLVFNEGYS ASGDTAGIRA PLCEEAIRLA RLLLRLFPSE PEIMGLTALM LLQHARAPAR FDAHGEIVLL DEQDRGLWDT KLIAEGLALI DKAMRHRRTG AYQIQAAIAA LHARATRPED TDWAQIDLLY GSLEILQPSP VITLNRAVAV SKVRGAEAAL AMIAPLEERL SNYFHYFGTR GALLLQLGRR DEARTAFDRA IALARTTAEA NHIRMHLDRS KRDDAAERIN P
|
| |