Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_4160 |
Symbol | |
ID | 4024682 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 4627937 |
End bp | 4629196 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637964368 |
Product | sigma-70 region 2 |
Protein accession | YP_571280 |
Protein GI | 91978621 |
COG category | [K] Transcription |
COG ID | [COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGATC TCGCCTGGAT CGAAACCGCG ATCACCGCGG CGCGGCCCCA GGCGATCGGC GCGTTGCTGC GCTATTTCCG CAACCTCGAC ACCGCCGAGG AGGCGTTTCA GAACGCCTGC CTGCGCGCGC TGAAGGCGTG GCCGCAGAAC GGCCCGCCGC GCGATCCGGC GGCGTGGCTG ATCATGGTCG GCCGCAACGT CGCGATCGAC GACATCCGAC GCGGCAGGAA GCTGCAGCCG CTGCCCGACG ACGACGCGAT CTCCGATCTC GACGACGCCG AGGACGCGCT CGCCGAGCGG CTCGACGGCT CGCATTATCG CGACGACATC CTGCGGCTGC TGTTCATCTG CTGCCATCCC GAATTGCCGC CGACCCAGCA GATCGCGCTG GCCCTGCGCA TCGTCAGCGG CCTGACCGTG GCGCAGATCG CGCGCGCGTT TCTCGTCTCG GATGCTGCGA TGGAGCAGCG CATCACCCGC GCCAAAGCCA GGGTCGCCCG CGCGAGCGTG CCGTTCGAGA CGCCGGGCGC GCCGGAGCGC AGTGAACGGC TCGGCGCGGT GGCGGCGATG ATCTACCTGG TGTTCAACGA GGGCTATTCC GCCTCCGGCG ACACCGCCGG CCTCCGCGCG CCGCTGTGCG AGGAGGCGAT CCGCCTGGCG CGGCTGCTGC TGCGGCTGTT TCCGTCCGAG CCCGAGATCA TGGGCCTGAC CGCGCTGATG CTGCTGCAGC ACGCCCGCGC GCCGGCGCGG TTCGATCCGG GCGGCGAGAT CGTGCTGCTC GACGATCAGG ACCGCAGCCT CTGGAACGCA AAATTCATCG CCGAGGGTCT GGCGCTGATC GACAAGGCGA TGCGCCATCG CCGCACCGGG GCGTATCAGA TCCAGGCCGC GATCGCCGCG CTACATGCGC GGGCCGAAAA GCCCGAAGAT ACCGACTGGG CGCAGATCGA TCTGTTGTAC GGTTCGCTGG AAATCCTGCA GCCGTCGCCG GTGGTGACGC TCAACCGCGC GGTCGCGGTG TCGAAAGTGC GCGGCGCGGC GGCGGCGCTG TCGATGATCG CGCCGCTGGA GCAGCGGCTG TCGAACTACT TCCATTATTT CGGCACCAAG GGCGCTCTGC TGCTGCAACA GGGTTGCCGC GACGAGGCCC GCATCGCGTT CGATCGCGCC ATCGCGCTCG CCCGCACCAC CGCCGAGGCT TCGCACATCC GGATGCATCT CGATCGGCTG AAGCGCGACA GCGAAGCGAT CGGAACGTGA
|
Protein sequence | MTDLAWIETA ITAARPQAIG ALLRYFRNLD TAEEAFQNAC LRALKAWPQN GPPRDPAAWL IMVGRNVAID DIRRGRKLQP LPDDDAISDL DDAEDALAER LDGSHYRDDI LRLLFICCHP ELPPTQQIAL ALRIVSGLTV AQIARAFLVS DAAMEQRITR AKARVARASV PFETPGAPER SERLGAVAAM IYLVFNEGYS ASGDTAGLRA PLCEEAIRLA RLLLRLFPSE PEIMGLTALM LLQHARAPAR FDPGGEIVLL DDQDRSLWNA KFIAEGLALI DKAMRHRRTG AYQIQAAIAA LHARAEKPED TDWAQIDLLY GSLEILQPSP VVTLNRAVAV SKVRGAAAAL SMIAPLEQRL SNYFHYFGTK GALLLQQGCR DEARIAFDRA IALARTTAEA SHIRMHLDRL KRDSEAIGT
|
| |