Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_2806 |
Symbol | |
ID | 4023304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 3126518 |
End bp | 3128242 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637963004 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_569935 |
Protein GI | 91977276 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.852293 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGCGA AGGCGACGCT CAAGTCGAAG CTCCCCAGCC GGCACGTGAC CGAAGGGCCG GCGCGCGCGC CCCATCGCTC TTACCTCTAC GCCATGGGCC TCACCACCGA GCAGATCCAC CAGCCGTTCG TCGGGGTGGC GTCGTGCTGG AACGAGGCCG CGCCCTGCAA CATTTCGCTG ATGCGGCAGG CTCAGGCGGT CAAGAAGGGC GTCGCCTCCG CCGGCGGCAC CCCGCGCGAA TTCTGCACCA TCACCGTGAC TGACGGCATC GCCATGGGCC ACGAGGGCAT GCGCTCGTCG CTGCCGTCGC GCGAGGTGAT CGCCGACTCC GTCGAGCTGA CAATCCGCGG CCACTCCTAT GACGCGCTGG TCGGGCTGGC CGGCTGCGAC AAGTCTCTGC CCGGGATGAT GATGGCGATG GTCCGGCTCA ACGTGCCGTC GATCTTCATC TATGGCGGCT CGATCCTGCC GGGCACCTTC CGGGGCCAGC AGGTCACCGT TCAGGACATG TTCGAGGCGG TCGGCAAGCA CTCGGTCGGC GAGATGTCGG ACGACGACCT CGACGAAATC GAGCGGGTCG CCTGTCCGTC GGCCGGCGCC TGCGGCGCGC AGTTCACCGC CAACACCATG GCGACCGTGT CCGAGGCGAT CGGCCTAGCG CTGCCGTATT CGGCCGGCGC ACCTGCTCCT TACGAAATCC GCGATGCGTT CTGCACGGCG GCCGGCGAGA AGGTGATGGA GCTGATCGCC GCCAACATCC GGCCGCGCGA CATCGTCACC CGCAAGGCGC TGGAGAATGC GGCCGCGGTG GTAGCAGCGT CCGGCGGCTC GACCAATGCT GCGCTGCACC TACCAGCGAT CGCGCATGAA TGTGGTATCA AGTTCGATCT GTTTGATGTC GCCGAAATCT TCAAAAAGAC ACCATATATC GCGGATTTGA AGCCAGGCGG CCGTTATGTC GCCAAAGACA TGTATGAAGT TGGCGGCATA CCGCTCCTGA TGAAGACCCT GCTCGATCAT GGCTTCCTCC ACGGCGACTG CCTGACCGTC ACGGGACGGA CGATCGCCGA GAATCTGAAA GCCGTGAAGT GGAATCCGCA TCAGGACGTG GTGCGGCAGG CGAACCATCC GATCACCGTG ACTGGGGGCG TCGTCGGGCT GAAGGGAAAC CTCGCACCAG AAGGTGCGAT CGTGAAGGTC GCGGGAATGT CGAACCTGAA GTTTTCCGGG CCTGCCCGCT GCTTCGATCG CGAGGAAGAC GCGTTCGAGG CGGTGCAGAA GCGGACCTAC AAGGAAGGCG AGGTCCTCGT GATCCGCTAC GAGGGGCCGC GGGGCGGCCC CGGAATGCGG GAAATGCTCG CCACCACTGC GGCGCTGACC GGCCAGGGCA TGGGCGGCAA GATCGCGCTG ATCACCGACG GCCGGTTCTC CGGCGCCACC CGCGGCTTCT GCATCGGCCA TGTCGGCCCG GAAGCGGCGC TGGGTGGTCC GATCGCGCTG CTGCGCGACG GTGACATCAT CGTCATCGAC GCCGAGGCCG GAACGCTTGA CGTAAATTTG ACCGACGACG AACTGGCCGC GCGCAAGTCC GAATGGGCGC ATCGCGCGAC AAACCACACG TCGGGTGCGC TTTGGAAATA TGCCCAGCAG GTCGGGCCCG CAGTCAGCGG CGCTGTGACT CATCCGGGCG GGGCGGCCGA GAAGCAGTGC TATGCGGATG TTTGA
|
Protein sequence | MDAKATLKSK LPSRHVTEGP ARAPHRSYLY AMGLTTEQIH QPFVGVASCW NEAAPCNISL MRQAQAVKKG VASAGGTPRE FCTITVTDGI AMGHEGMRSS LPSREVIADS VELTIRGHSY DALVGLAGCD KSLPGMMMAM VRLNVPSIFI YGGSILPGTF RGQQVTVQDM FEAVGKHSVG EMSDDDLDEI ERVACPSAGA CGAQFTANTM ATVSEAIGLA LPYSAGAPAP YEIRDAFCTA AGEKVMELIA ANIRPRDIVT RKALENAAAV VAASGGSTNA ALHLPAIAHE CGIKFDLFDV AEIFKKTPYI ADLKPGGRYV AKDMYEVGGI PLLMKTLLDH GFLHGDCLTV TGRTIAENLK AVKWNPHQDV VRQANHPITV TGGVVGLKGN LAPEGAIVKV AGMSNLKFSG PARCFDREED AFEAVQKRTY KEGEVLVIRY EGPRGGPGMR EMLATTAALT GQGMGGKIAL ITDGRFSGAT RGFCIGHVGP EAALGGPIAL LRDGDIIVID AEAGTLDVNL TDDELAARKS EWAHRATNHT SGALWKYAQQ VGPAVSGAVT HPGGAAEKQC YADV
|
| |