Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1978 |
Symbol | |
ID | 3909483 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2247164 |
End bp | 2248372 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637883872 |
Product | extensin-like protein |
Protein accession | YP_485597 |
Protein GI | 86749101 |
COG category | [S] Function unknown |
COG ID | [COG3921] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.259223 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCGCG GAGTTCGTTT GTATCTCGTC GGCTCCTTCG TCCTCGTGTC GCTTGCCGGT TGCGGACGCG GTTTGTTTCA AACCGCCGAG CGTGAACCGT GGCGGACCGA GGCCGAGGTC GCTTGTCTGA AATCGGGCGC CGTCAAGGAA GGCCCCGAAC TGGTCCGGGT CGATCCGATC TCGGGCCCCG GCGTCTGCGG CGCCGAGTTT CCCCTCAAAG TGGCTGCGCT CGGCGAGAGT GGCGCGATCG GCTTTGCCGA CGATTTGCGT CCGCCAGCGG CGATCGGAGG TCGCGCCAGC CAGCCGCGCT GGCCGGGGGC GCAGCCGTCT TATGCGGCGC CGGCGCGCGG CTATCCGCAA CAGCAAACCG GCTACGGCGC TTCGAATCCT CCCTACGGCA GCAACAATGC TCCGGTGTCG CTGACCGCGC CCGGCGTCGG CCCCGCGGGC CGGGACATCG ATCTGCCGGA CGAGGGCGCG CTGCCGCCTG CGGATCGTCC GCCGGCCGAG CACGTCACCG GCTATTCGCG CGATCCGAGC TACGCACCGG CGCCCGCCGG TCGTGCGCCG GACGACGCGC GGCGCCCATT GCCGCGGCTC GGTCCGGCGC AGCAGGGCAA CATCACCGGC TCGGTCGGGC CGGTCGCGAT CAAGCCGGTG GCGACGCTGG CGTGTCCGAT CGTCTCCGCG CTGGACCGCT GGCTGGTGGA ATCGGTGCAG CCGTCGGCGA TGCGCTGGTT CGGTGTCCCC GTCGTCGAAA TCAAGCAGAT CTCGGCCTAT TCGTGCCGCG GCATGAACGG CAATCCGAAC GCGCACATCT CCGAACACGC CTTCGGCAAC GCGCTCGACA TCTCCGCCTT CGTGCTGGCC GACGGCCGGC GCGTGACGGT GAAGGGCGGC TGGAAGGGAT TGCCGGAAGA GCAGGCGTTC CTGCACGACG TGCAGAACTC GGCGTGCCAA ATGTTCAACA CCGTGCTGGC GCCGGGCTCG AACATCTATC ACTACGATCA CATCCACGTC GACCTGATGC GCCGCAAGAG CCAGCGCAGC ATCTGCAAGC CCGCCGCGGT GCCGGGCGAA GTGATCGCGC AGCGGCTGCA GGGGCGCAAT CCCTATGCGT CGGGCAATTG GAACGGCGTC ACCGGCTCGA TCGGCAAAGT CCCGGCGCGT GCGAAGGCGG TGGATCGCGA CGAAGCCGAA GACGATTAG
|
Protein sequence | MTRGVRLYLV GSFVLVSLAG CGRGLFQTAE REPWRTEAEV ACLKSGAVKE GPELVRVDPI SGPGVCGAEF PLKVAALGES GAIGFADDLR PPAAIGGRAS QPRWPGAQPS YAAPARGYPQ QQTGYGASNP PYGSNNAPVS LTAPGVGPAG RDIDLPDEGA LPPADRPPAE HVTGYSRDPS YAPAPAGRAP DDARRPLPRL GPAQQGNITG SVGPVAIKPV ATLACPIVSA LDRWLVESVQ PSAMRWFGVP VVEIKQISAY SCRGMNGNPN AHISEHAFGN ALDISAFVLA DGRRVTVKGG WKGLPEEQAF LHDVQNSACQ MFNTVLAPGS NIYHYDHIHV DLMRRKSQRS ICKPAAVPGE VIAQRLQGRN PYASGNWNGV TGSIGKVPAR AKAVDRDEAE DD
|
| |