Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_2494 |
Symbol | |
ID | 3971251 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 2704327 |
End bp | 2706051 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637925602 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_532364 |
Protein GI | 90423994 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000806623 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.595086 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACGCCA ACTCCAACAT CAAGGCGAGG TTGCCCAGCC GTCACGTGAC GGAAGGCCCG GAGCGTGCGC CGCACCGCTC CTACCTCTAT GCGATGGGGT TGACCACGCA GCAGATCCAC CAGCCGTTCG TCGGCGTGGC ATCGTGCTGG AATGAAGCCG CGCCCTGCAA CATCTCGCTG ATGCGCCAGG CCCAGGCGGT GAAGAAGGGC GTCGCCGCCG CGGGTGGTAC GCCACGCGAA TTCTGCACCA TCACCGTCAC CGACGGCATC GCCATGGGCC ATGACGGCAT GCGCTCGTCG CTGCCGTCGC GCGAATGCAT CGCCGACTCG GTCGAGCTGA CCATCCGCGG CCACTCCTAC GACGCGCTGG TCGGGCTTGC CGGCTGCGAC AAGTCGCTGC CCGGAATGAT GATGGCGATG GTCCGGCTCA ACGTGCCGTC GATCTTCATC TATGGCGGCT CGATCCTGCC CGGCACCTTC CGCGGCCAGC AGGTCACCGT GCAGGACATG TTCGAGGCGG TCGGCAAGCA TTCGGTCGGC GCGATGTCGG ACGCCGACCT CGACGAAATC GAACGGGTGG CGTGCCCCTC GGCCGGCGCC TGCGGCGCCC AGTTCACCGC CAACACCATG GCGACGGTGT CGGAGGCGAT CGGCCTGGCG CTGCCTTATT CGGCCGGAGC ACCTGCGCCC TATGAGATCC GCGACGCCTT CTGCATGACC GCCGGCGAGC AGATCATGAC GCTGATCGCC AAGAATATCC GGCCGCGCGA CATCGTCACC TTGAAGGCGC TGCAGAACGC CGCGGCGGTG GTGGCGGCCT CCGGCGGCTC GACCAATGCG GCGCTGCACC TGCCGGCGAT CGCGCATGAA TGCGGCATCA AATTCGACCT GTTCGACGTC GCCGAAATCT TCAAAAAGAC ACCCTATGTC GCGGATTTGA AACCCGGCGG CCGTTATGTC GCCAAAGACA TGTACGAAGT TGGTGGCATA CCGCTTCTGA TGAAAACATT GCTCGATCAT GGCTACCTGC ACGGCGACTG CCTGACGGTC ACCGGCCGGA CGATTGCGGA AAATTTGGCA ACCGTGAAAT GGAATCCCGA CCAGGACGTG GTGCGCGCAG CGGATAACCC GATCACCGTG ACCGGTGGGG TGGTCGGGCT GCAAGGCAAC CTCGCCCCCG AGGGGGCGAT CGTGAAGGTC GCCGGGATGT CCAACTTGAA ATTCTCCGGC CCGGCGCGCT GCTTCGATCG CGAAGAGGAC GCCTTCGAGG CGGTGCAGCA CAAGACCTAT CGCGAAGGCG AAGTGATCGT GATCCGCTAC GAAGGGCCGC GCGGCGGCCC CGGCATGCGC GAGATGCTGT CGACCACCGC GGCGCTGACC GGGCAGGGCA TGGGCGGCAA GATCGCGCTG ATCACCGACG GCCGGTTCTC CGGCGCCACC CGCGGCTTCT GCATCGGCCA TGTCGGACCG GAAGCCGCGC TCGGCGGCCC GATCGCGCTG TTGCAGGACG GCGACATCAT CGAGATCGAC GCGGTGGCCG GCACGCTTAA CGTAAAATTG ACCGAAGCCG AACTCTCCGC GCGCAAGACC AATTGGCAGC CGCGTGAGAC CAACCATTCG TCAGGCGCGT TGTGGAAGTA TGCCCAACAG GTCGGCCCCG CGCTCGGTGG CGCGGTGACC CATCCGGGTG GTTCGCACGA GAAACAGTGT TATGCGGATG TTTAA
|
Protein sequence | MDANSNIKAR LPSRHVTEGP ERAPHRSYLY AMGLTTQQIH QPFVGVASCW NEAAPCNISL MRQAQAVKKG VAAAGGTPRE FCTITVTDGI AMGHDGMRSS LPSRECIADS VELTIRGHSY DALVGLAGCD KSLPGMMMAM VRLNVPSIFI YGGSILPGTF RGQQVTVQDM FEAVGKHSVG AMSDADLDEI ERVACPSAGA CGAQFTANTM ATVSEAIGLA LPYSAGAPAP YEIRDAFCMT AGEQIMTLIA KNIRPRDIVT LKALQNAAAV VAASGGSTNA ALHLPAIAHE CGIKFDLFDV AEIFKKTPYV ADLKPGGRYV AKDMYEVGGI PLLMKTLLDH GYLHGDCLTV TGRTIAENLA TVKWNPDQDV VRAADNPITV TGGVVGLQGN LAPEGAIVKV AGMSNLKFSG PARCFDREED AFEAVQHKTY REGEVIVIRY EGPRGGPGMR EMLSTTAALT GQGMGGKIAL ITDGRFSGAT RGFCIGHVGP EAALGGPIAL LQDGDIIEID AVAGTLNVKL TEAELSARKT NWQPRETNHS SGALWKYAQQ VGPALGGAVT HPGGSHEKQC YADV
|
| |