Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1166 |
Symbol | |
ID | 3910101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1336848 |
End bp | 1337879 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637883060 |
Product | extensin-like protein |
Protein accession | YP_484787 |
Protein GI | 86748291 |
COG category | [S] Function unknown |
COG ID | [COG3921] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.27994 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTTCCGCG GCAAAGCGTC GACGCGCGCG GTGCTGATCG CGCTGGTTGT GCTCGGCGCG TCGTCCGCGT GGGCGCAGGG TGAGATACCG TTGCCGAAGC CGCGCCCGGC CGAGGCGCCG CAACTGCAGG GCGAACGCGC GGCGGACCGG CCCGAGGCCG ATGCGCCGCA GGCGGAAGCG GCGCCGCCGC CGAAGCCGGA GCCGAAGCCG GAGCCGAAGC CGCCCTCGGC ATGCCGGCTG GCGCTGACCG ACGCGATCGC GATCGCGCCG AGCCTTGCCG ACATTGCCGG CCCCGGCAGT TGTGGCGGAA CCGATCTGGT GAAGCTCGAA GCGGTGGTGT TGCCGGACGG CAGCCGCGTG CCGCTGACGC CGGCGGCGAC GTTGCGCTGC CCGATGGCCA GCGCGCTCGT CGACTGGGTT CGCAGCGACC TCGCGCCGCT CGCCGCGTCG TTGGCCACGC GTCTCGCCGC GCTCGACAAT TACGACTCCT ACGATTGCCG CGGTCGCAAC CGGGTGCGCG GGGCCAAGCT GTCGGAGCAC GGCCGCGCCA ATGCGATCGA TCTGCGCGGC TTCAAGCTGG CCGATGGCCG CATGCTGTCG CTGACCGACC GCGCCGCGCC GCGCGCGGTG CGCGAGAGCG TGCGGCAGTC GGTGTGCGAC CGCTTCGCCA CCGTGCTCGG CCCGGGCTCG GACGGCTATC ACGAGGAGCA CGTCCACCTC GATCTCGCCG AGCGTCGCGG CGGCTACAAG ATGTGTCAAT GGGAGGTGTG GGAGCCGCTG CCCGTCATCG CGCCGCTGCT GCCGGCGGAG CGCCCGGCCG AAGCGCCGCC GCGCGAGGTG GCGGCCGGCG AGCCACAGGA CCGCGACCCC AACCGCTCGC CGCAGCAGGC TCAGCCTGAG AACGCGCCGC CGCAGCAGGT TGAGCCGGAG CAGGGCGAGC AAGCCGCCAA GCCAGAGCCG CCGCCGGCGA GCAGCAAGGC GAAGAGAAAG CCGAAGAAGT CGCGGGGTGA GCGGCAGTCG GAACTGCGGT AG
|
Protein sequence | MFRGKASTRA VLIALVVLGA SSAWAQGEIP LPKPRPAEAP QLQGERAADR PEADAPQAEA APPPKPEPKP EPKPPSACRL ALTDAIAIAP SLADIAGPGS CGGTDLVKLE AVVLPDGSRV PLTPAATLRC PMASALVDWV RSDLAPLAAS LATRLAALDN YDSYDCRGRN RVRGAKLSEH GRANAIDLRG FKLADGRMLS LTDRAAPRAV RESVRQSVCD RFATVLGPGS DGYHEEHVHL DLAERRGGYK MCQWEVWEPL PVIAPLLPAE RPAEAPPREV AAGEPQDRDP NRSPQQAQPE NAPPQQVEPE QGEQAAKPEP PPASSKAKRK PKKSRGERQS ELR
|
| |