Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1029 |
Symbol | |
ID | 3909153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 1181161 |
End bp | 1182048 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637882922 |
Product | phenylacetic acid degradation-related protein |
Protein accession | YP_484650 |
Protein GI | 86748154 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2050] Uncharacterized protein, possibly involved in aromatic compounds catabolism |
TIGRFAM ID | [TIGR00369] uncharacterized domain 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.800657 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTTCG AGGACAAGGC GCGCATGATC CAGCTCCGCC GCTCGATCCA TGGCGCCATC ATCGGCCTGC AGCTCGATCG CTTCGCGCCG GGCGAGGCCT GGAGCAGCCT GCCCTATCAT CCGGTGTTCG TCGGCGACAT CAGAACCGGC GTGATTCATG GCGGCGTCGT CACCGCGATG CTCGACGAGA GCTGCGGCAT GGCGGTGCAG CTGGCGCTGC CGGGCACGGC GGCGATCGCC ACGCTGGATC TGCGCATCGA CTATCTGCGG CCGGCGACGC CCGGCCAGGC GATCCGGGCG CACGCGCATT GCTATCACCT CACCCGCTCG ATCGCCTTCG TGCGCGCCAC CGCCTATCAG GAGTCCGAGG CCGACCCGAT CGCCAGCGCC ACCGCGATGT TCATGATCGG CGCCAACCGC ACCGACATGC TGCGGCAGGA GCCGAAGGTG CGCTTCGACA CGCCGGCGCC GCTGCAGGCC CCGGACGATC CCGGCGGCGC CGACGGCCTT CTCGCCATCA GTCCGTATCC GCGCTTTCTC GGCATCCGCG TCGACGCCGA CGCCGAGCCG GTGATGCCTT ACGATCCGAA ACTGGTCGGC AATCCGATCC TGCCGGCGCT GCATGGCGGG GTGATCGGCG CATTCCTCGA AACCGCGGCG ATCGTCGGCG TCCACCGCGA GATCGGCCTC GCCACCGCGC CGAAGCCGAT CGGCCTGACG GTGAACTATC TGCGCTCCGG CCGCCCGCTC GATACTTACG CCCGGGTCTC GATCGTCAAG CAGGGCCGCC GCGTCGTCGC GTTCGAGGCG CAGGCCTATC AGGCCGATCC GTCCAAGCCG ATCGCGTCCT GCTACGGCCA TTTCAAGCTG CGGGCGGAGC CCGGCTGA
|
Protein sequence | MSFEDKARMI QLRRSIHGAI IGLQLDRFAP GEAWSSLPYH PVFVGDIRTG VIHGGVVTAM LDESCGMAVQ LALPGTAAIA TLDLRIDYLR PATPGQAIRA HAHCYHLTRS IAFVRATAYQ ESEADPIASA TAMFMIGANR TDMLRQEPKV RFDTPAPLQA PDDPGGADGL LAISPYPRFL GIRVDADAEP VMPYDPKLVG NPILPALHGG VIGAFLETAA IVGVHREIGL ATAPKPIGLT VNYLRSGRPL DTYARVSIVK QGRRVVAFEA QAYQADPSKP IASCYGHFKL RAEPG
|
| |