Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2034 |
Symbol | |
ID | 3909849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2313277 |
End bp | 2314794 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637883927 |
Product | hypothetical protein |
Protein accession | YP_485652 |
Protein GI | 86749156 |
COG category | [S] Function unknown |
COG ID | [COG3333] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.32397 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.23521 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTGATC TGTTCTCCAA TCTCGCGCTC GGCTTCCAGG TCGCGGCCTC GCCGATGAAT CTCGGGCTCT GTCTCGTCGG CGCCCTGGTC GGCACGCTGA TCGGCGTGCT GCCCGGCATC GGCACCATCG CCACCGTGGC GATGCTGCTT CCGATCACCT TCGGCCTGCC GCCGATCGGC GCGCTGATCA TGCTCGCCGG TATCTATTAC GGCGCGCAAT ATGGCGGCTC GACCACATCG ATCCTGGTCA ACATTCCGGG CGAGGCGACC TCGGTGGTCA CGACCCTCGA CGGCTTCCAG ATGGCCAAGC AGGGCAGGGC CGGTCCGGCG CTGGCGATCG CTGCGATCGG CTCCTTCGCC GCCGGCTGTT TCGCCACCGT GCTGATCGCG GTGCTGGGCG CGCCGCTGAC CAAGCTGGCG CTGGAGTTCG GGCCGGCAGA ATATTTTTCC CTGATGGTGC TCGGTTTGAT CTTCGCGGTG GTGCTGGCGA AGGGGTCGGT GCTCAAGGCG GTCGCGATGA TCGCGCTCGG CCTGCTGCTG TCGATGATCG GCTCCGACAT CGAAACCGGC GCGTCCCGCA TGACCTTCGG CATTCCCGAA CTCGCCGACG GTCTGGGCTT CGCCACCGTA GCGATGGGGG TGTTCGGCTT CGCCGAGATC ATTCGCAACC TCGACGGCGG CACCGAGGCC GATCGGCAAT TGGTGCAGCA GAAGATCACC GGCCTGATGC CGACCCGGAA GGATCTGCGC GACGCCGCGC CCGCGATCGC CCGCGGGACC GTGCTCGGCT CGATTCTCGG CATCCTGCCG GGCGGCGGCG CCGTGATCGC GTCGTTCGCG GCCTATACGC TGGAAAAGAA GATCTCCCGG ACGCCCTACC GGTTCGGCCG GGGCGCGATC GAAGGCGTGG CGGGGCCGGA AAGCGCCAAC AATGCCGCTG CGCAAACGTC TTTCATCCCG CTGCTGACGC TCGGCATTCC GCCGAACGCG GTGATGGCGC TGATGGTCGG CGCAATGACC ATTCACGGCA TCGTGCCGGG ACCGCAGGTG ATGCAGAATC AGCCGGAACT GGTGTGGGGC ATGATCGCCT CGATGTGGAT CGGCAATCTG ATGCTGCTGA TCATCAACCT GCCGCTGGTC GGAGTCTGGG TACGGTTGCT GCGCGTGCCG TACCGGCTGA TGTTTCCGGC GATCGTGGTA TTCTGCGCCA TCGGCATCTA TTCGGTGAAC AATGCGCCGA TCGACGTCGT GATGGCGGGT ATTTTCGGAC TGATCGGCTA TTGGCTGGTC AAGCACGATT TCGAACCGGC TCCGCTGCTG CTCGGAATGG TGCTCGGACC GCTGATGGAG GAGAATCTGC GCCGGGCGCT GCTGATTTCG CGCGGCGACG CGACGATCTT CGTGACCCAG CCGCTGTCGG CGACGTTGCT CGCGGTAGCC GCAGGACTTC TGGTGCTCGC GGTGCTTCCG TCGCTGCGCA GCAAGCGCGA CGAGGTTTTC GTCGAGTCCG AGAACTGA
|
Protein sequence | MLDLFSNLAL GFQVAASPMN LGLCLVGALV GTLIGVLPGI GTIATVAMLL PITFGLPPIG ALIMLAGIYY GAQYGGSTTS ILVNIPGEAT SVVTTLDGFQ MAKQGRAGPA LAIAAIGSFA AGCFATVLIA VLGAPLTKLA LEFGPAEYFS LMVLGLIFAV VLAKGSVLKA VAMIALGLLL SMIGSDIETG ASRMTFGIPE LADGLGFATV AMGVFGFAEI IRNLDGGTEA DRQLVQQKIT GLMPTRKDLR DAAPAIARGT VLGSILGILP GGGAVIASFA AYTLEKKISR TPYRFGRGAI EGVAGPESAN NAAAQTSFIP LLTLGIPPNA VMALMVGAMT IHGIVPGPQV MQNQPELVWG MIASMWIGNL MLLIINLPLV GVWVRLLRVP YRLMFPAIVV FCAIGIYSVN NAPIDVVMAG IFGLIGYWLV KHDFEPAPLL LGMVLGPLME ENLRRALLIS RGDATIFVTQ PLSATLLAVA AGLLVLAVLP SLRSKRDEVF VESEN
|
| |