Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3551 |
Symbol | |
ID | 3911353 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4064406 |
End bp | 4065521 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637885453 |
Product | hypothetical protein |
Protein accession | YP_487157 |
Protein GI | 86750661 |
COG category | [S] Function unknown |
COG ID | [COG5330] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.335217 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCCGA AATCGGCCCT TTCCGCCGCC ACCCTCCTCG ACGAGCTGCA ATCCACGCTG GCGCATGGCA CAGTGGCGCG ACGCGTCGAG ACCTTGCGCC GCGTCACCGA CCTGTATCTC GACAGCGGCG TGGACTACAG CGACGATCAG ATCGCGGTGT TCGACGACGT CTTCAACTGT CTGGTGGATA GTATCGAGAC CCATGCCAAG GTGCTGCTGG CCGAGCGCCT GGCGCCCGTC AATGCCCCGC CGCGGATCAT TCATCATCTC GCCTTCGAGG ACCTGATCGA GATCGCGGCG CCGGTGCTGT CGCAGTCCGA TCAGCTCGAC GACGCCATGC TGATCGCCAA TGCCCGAAGC AAAGGGCAGA GCCACATGAT GGCGATCTCG ACCCGCAGAT CGCTCAGCGG CGCGGTGACC GACGTCCTGG TCGAGCTCGG CAATCAGCAG GTGGTGCAGA GCACCGTCAA GAATCCCGGC GCGGAATTCT CGGACAATGG CTATTCGGTA CTGGTCAAGC GCGCCGAACT GGATGACGAC CTCGCCACCG AGCTGGGCCG GCGGGCGATC CCTCGCGCGC AATATCTCAA GCTGATCGCG ATCGCATCGG CCTCGGTGCG GGCGAAACTC AAGGCCGCGA ACCCGAACGC GGCCTCCGAG GTCGCGACCG CGGTGAAGCA GGCGTCGCGT CTGGCGCGCT CGGCGCCGGC GGCGATCAGC CGCCAGACCA GCATCGCCCA TGGTCTGGTC CGGTCGCTGT ACGAAGACGG CCGCATCACC GAAGAGCAGG TCAACACCTT CGCAAACGAA CGCAAATTCG ACGAGATCAA TCAGGCGCTC GCATGCCTCG CCGGCACCTC GGTCGAGACC GCCGAGGCGA TGATGATCGA ATCCCGCGAC GAGGGTCTGC TGATCCTCGC CAAGGTCTGC AAATTGTCGT GGCCGACGGT CAAGGCGATC ATCAGGATGC GCGACGAGGC GACCGGCACG ATGTCCACCG ATCTCGACGA ATGTCGCTTC ACCTACGAGC GGTTGCGCAT CGCGACCGCG CAGCAGGTGC TCCGCTTTCA CCGCATGCAG CAATCCAGCG CCGCAACCAA GGCGCCGGCC GCCTGA
|
Protein sequence | MSPKSALSAA TLLDELQSTL AHGTVARRVE TLRRVTDLYL DSGVDYSDDQ IAVFDDVFNC LVDSIETHAK VLLAERLAPV NAPPRIIHHL AFEDLIEIAA PVLSQSDQLD DAMLIANARS KGQSHMMAIS TRRSLSGAVT DVLVELGNQQ VVQSTVKNPG AEFSDNGYSV LVKRAELDDD LATELGRRAI PRAQYLKLIA IASASVRAKL KAANPNAASE VATAVKQASR LARSAPAAIS RQTSIAHGLV RSLYEDGRIT EEQVNTFANE RKFDEINQAL ACLAGTSVET AEAMMIESRD EGLLILAKVC KLSWPTVKAI IRMRDEATGT MSTDLDECRF TYERLRIATA QQVLRFHRMQ QSSAATKAPA A
|
| |