Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A2719 |
Symbol | |
ID | 3836158 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 3144087 |
End bp | 3145748 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637826829 |
Product | phenylpyruvate decarboxylase |
Protein accession | YP_427803 |
Protein GI | 83594051 |
COG category | [G] Carbohydrate transport and metabolism [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG3961] Pyruvate decarboxylase and related thiamine pyrophosphate-requiring enzymes |
TIGRFAM ID | [TIGR03394] indolepyruvate/phenylpyruvate decarboxylase, Azospirillum family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.572163 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCACCA TCGCCACGGC CTTGCTTGAC GCGTTGAAGG CCCATGGGGC GACCCGGATC TTCGGCATTC CCGGCGATTT CGCCCTGCCG TTCTTCCGGG TGGCCGAACA AAGCGCCCTC TTGCCGCTGT ACACCCTGAG CCACGAACCC GGAGTCGGCT TCGCCGCCGA CGCCTGCGCC CGCATGGGGC GCGGCCTCGG GGTGGCGGCG GTCACTTACG GCGCCGGGGC CTTGAACATG GTCAATCCGG TGGCCGGGGC TTGGTCGGAG AAATCACCCC TGGTGGTGAT TTCCGGCGCC CCCGGCGTCG CCGAATCCGC CGGCGGCCTG CTGCTCCACC ATCAGGCCAA AACCCTCGAC AGCCAGTGGC GGATCTTTGA AGAGATCACC TGCGCCCGCA CCCGCCTTGA TGACCCGCTG ACCGCCCCCG GGGAAATCGC CCGGGTGTTG CGGGCCTGCC TTGAACACTC CCGCCCCGTC TATATCGAAA TCCCCCGCGA CATCGTCGAT GCGCCCTGCG CCGCCGTGGA CCGCCTGCCG CCCACCCCGG TCGATGGCGA AGCGGTCGAA GCCGCCGCCG GCGAGATCAT GGCCCGCTTG GCTGCGGCCA GCGCCCCGGC GCTGCTGCTG GGGGTCGAGG TCCGTCGCCA CGGCATCGAA GCCGATGTCG CCGAACTGGC CCGCCGCCTG GGCCTGCCCA TCGCCACCAC CTTCATGGGC CGGGGTCTGT TATCCGAGGA GGGCGCGGCC GGCGGCGCGC CCGACAGCCT GATGGGCACC TATCTGGGGC TGGCCGGCCG CCCCGAGGTG CGCGCCGTCA TCGAAGACTC CGATGGCCTG CTGATGCTCG GCGTCATCTT GTCCGACACC AATTTCGGCG TTTCGGGCAA GCGCATCGAC CTGCGCCGCG CCATGCTCGC CGCCGACCGT CAGGTCGCCC TGGGCTTTCA TACCTATAAC GATATCCCGC TGGCCGATCT GGTCGCCGCC CTGCTGCGTC AGGCCGAGGG CTTCGCCCGC CAGGACGCCA AGGCCCTACC CAAGCCGACC GCCCTGCCCC GGGACATGAT CGCCGATGGG GCGCCGATCG GCCCGATGGA TATCGCCGCG GCCATCAACG ATCTATTCTC GGCCCATGGG GTGATGCCGA TCGCCTCGGA TATGGGCGAT TGCCTGTTCA CCGCCCTTGA TACCACCCAT GCGCCGCTGG TCGCCCCGGG CTATTACGCC ACCATGGGCT TTGGCGTGCC GGCGGGATTG GGCGTTCAGG CCAGCTGTGG CCGCCGGCCG CTGATCCTGG TCGGCGACGG CGCCTTTCAG ATGACCGGTT GGGAGTTGGG CAATTGCGCC CGCTACGGCT GGGACCCGAT CGTCATCGTC TTCAACAACG CCAGTTGGGA GATGCTGCGC ACCTTCCAAC CCGACACCGC CTATAACGAT CTGGCCGATT GGCGATTCGC CGATCTGGCC GCCGGCCTGG GCGGCGTTGG TCACCGTTGC CAAACCCGCG CCGATCTGGC CCGGGCCCTG GATCGGGCGG CCCGCGAACC GGGGCGCTTT CACCTGATCG AGGCGGTTCT GGCGCGCGGG GCGATCTCGG ACACCCTCCA GCGCTTCGTC ACCACGATGA AAGGCCGCCA CGCCGCGGCG GCCGATGCCT GA
|
Protein sequence | MPTIATALLD ALKAHGATRI FGIPGDFALP FFRVAEQSAL LPLYTLSHEP GVGFAADACA RMGRGLGVAA VTYGAGALNM VNPVAGAWSE KSPLVVISGA PGVAESAGGL LLHHQAKTLD SQWRIFEEIT CARTRLDDPL TAPGEIARVL RACLEHSRPV YIEIPRDIVD APCAAVDRLP PTPVDGEAVE AAAGEIMARL AAASAPALLL GVEVRRHGIE ADVAELARRL GLPIATTFMG RGLLSEEGAA GGAPDSLMGT YLGLAGRPEV RAVIEDSDGL LMLGVILSDT NFGVSGKRID LRRAMLAADR QVALGFHTYN DIPLADLVAA LLRQAEGFAR QDAKALPKPT ALPRDMIADG APIGPMDIAA AINDLFSAHG VMPIASDMGD CLFTALDTTH APLVAPGYYA TMGFGVPAGL GVQASCGRRP LILVGDGAFQ MTGWELGNCA RYGWDPIVIV FNNASWEMLR TFQPDTAYND LADWRFADLA AGLGGVGHRC QTRADLARAL DRAAREPGRF HLIEAVLARG AISDTLQRFV TTMKGRHAAA ADA
|
| |