Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3029 |
Symbol | |
ID | 3910828 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 3452086 |
End bp | 3453441 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637884935 |
Product | agarase |
Protein accession | YP_486642 |
Protein GI | 86750146 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.933436 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACA TTCCGAGAAC CCGCTGGGGC GGCCTCGCCG AAGACAAGGC TGCGGCCACC GGCTTTTTCC GCGTCGCGCA GATCGACGGC GTGTGGTGGT TCATCGATCC GGACGGCGGC CGCTTCCTGT CGAAGGGCGT CACCGCGGTG AATTTCGACC ACGACAGTAT CAAAGGCACC GAGCGTCACC CCTATCGCGA GGCGAGCCTG CACAAATACG GCAGCCGCAA CGCCTGGCGC AGCGCCGTCG CCGATCGCCT GCACCGCTGG GGCTTCAACA CGATCGGGGC GTGGTCGGAG CCGGAGGTGG CATCGGCCGG CTGCGCCCCG CTGGCCTCGG CCGCCGGCGT GGTCTATCTC GCCACCGCCT ACAGTGACGG CCGCGGCTGG CCGCAATCCG ATCCGTTCGC TCCGGCCTTC GAGACCTTCG CGCAGCAACG CGCCCGCGAG ATCTGCGCGC CGCGGCGTGA CGATCCGAGC GTGCTCGGCT GGTTCATCGA CAACGAGTTG CAATGGGGCC CGGACTGGCG CGGCGAGAAC GAACTGCTGC CGGTGATCCT GCGCGACAAC GCGGCGCCGC ATTCACGCCA GGTAGCGGTC GACCTGCTGC GCCGCCGCTA CGCCAGCGTC GCAGAGTTCA ACACAGCGTG GCGATGCTCT GCATCATCGT GGGACGCGCT GGCGACCGTG CCGATCGCGG CCCCGCCCTT CACCCGCAAT TTCTTCACAC ATGATCACGC GCAGGAACGC GATCCGTTGC GCGGGCGTTA CTTCGCCGAT TGCGATGCCT TCGCCGGGCT GCTCGCCGAG CGCTACATGG CGGTGAGCGC CGCGGCGATC CGCGCGGCCG CGCCGCATCA TCTCGTGCTC GGCAGCCGCT TCGCCTACGC GCCGCAGCCG CAGGTCATCG CTGCCGCCGG CCGGCATTGC GACGTCATCA GCATCAATTG CTACGACGCT TTGCCCGACG CAGTGATCGA CGCCTATGCC GAATGCGGCC GGCCCTGCCT GATCGGCGAG TTCTCGTTCC GCGGCGACGA CGCCGGGTTG CCGAACACGC AAGGCGCCGG GCCGCGCGTC GAGACGCAGG CGGATCGCGC CGCAGGCTTT GCGCGCTATG TCGGTGCCGG CCTGCGCCAC CCGAACCTGA TCGGCTATCA CTGGTTCCTG CACGCCGATC AGCCGGCGGA AGGCCGCTGG GACGGCGAGA ATTCCAACTA CGGCGTCGTC ACCATCGACG ATGAGGTTTA TGTCGAACTG ACCGAGGCGA TGACGGTGGT CAATGACGAC GCCGAATGGC TGCACGCCGG CGCAGCGCAG GTCCGGCGAC ACATCGCAAC GCCGTCGGCG GCCTGA
|
Protein sequence | MTDIPRTRWG GLAEDKAAAT GFFRVAQIDG VWWFIDPDGG RFLSKGVTAV NFDHDSIKGT ERHPYREASL HKYGSRNAWR SAVADRLHRW GFNTIGAWSE PEVASAGCAP LASAAGVVYL ATAYSDGRGW PQSDPFAPAF ETFAQQRARE ICAPRRDDPS VLGWFIDNEL QWGPDWRGEN ELLPVILRDN AAPHSRQVAV DLLRRRYASV AEFNTAWRCS ASSWDALATV PIAAPPFTRN FFTHDHAQER DPLRGRYFAD CDAFAGLLAE RYMAVSAAAI RAAAPHHLVL GSRFAYAPQP QVIAAAGRHC DVISINCYDA LPDAVIDAYA ECGRPCLIGE FSFRGDDAGL PNTQGAGPRV ETQADRAAGF ARYVGAGLRH PNLIGYHWFL HADQPAEGRW DGENSNYGVV TIDDEVYVEL TEAMTVVNDD AEWLHAGAAQ VRRHIATPSA A
|
| |