Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1810 |
Symbol | |
ID | 4896032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 1908063 |
End bp | 1909844 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640112404 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_001043689 |
Protein GI | 126462575 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.360755 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACA CCCCCACCGG CAGGCGCTTC CGCTCACAGG CGTGGTTCGA CAATCCCGAC AATCCCGGGA TGACCGCGCT CTATGTCGAG CGCTACCAGA ACCAGGGCTT CACGCGGCGC GAGCTGCAGG GCGACCGGCC CATCATCGGC ATCGCGCAGT CGGGTTCGGA TCTCGCGCCC TGCAACAAGA TCCACCTCTT CCTCGCCGAG CGGGTCAAGG CGGGCATCCG CGACGCGGGC GGCGTGCCGA TGGAATTTCC CGTCCATCCG ATCCAGGAGA CCGGGCGCAG GCCCACCGCC GCGCTCGACC GCAACCTCGC CTATCTCGGC CTCGTCGAGG TGCTGCACGG CTATCCGATC GACGGGGTGG TGCTGACCAC CGGATGCGAC AAGACCACGC CCGCGCAGCT GATGGCAGCG GCGACGGTGG ATCTTCCTTC CATCGTGCTC TCGGGCTGGC CGATGCTCGA CGGCTGGTGG GAAGGCAAGC TCGCAGGCTC GGGCACGATC ATCTGGGAGA GCCGGCGGCT CTTGGCCGAG GGCGAGATCG ACTATCCGGA GTTCATGGAG CGTGCCTGCG CTTCGGCCCC CTCGCTCGGC CATTGCAACA CGATGGGCAC CGCCTCGACC CTGAACGCGC TGGCCGAGGC GCTGGGCATG TCGCTGCCCG GATGCTCGGC CATTCCCGCG CCGTTCCGCG AGCGGATGAA CATGGCCTAT GCCACGGGCC GGCGCATCGT CGAGATGGTG CTGGCCGACC TGAAGCCCTC GGACATCCTC ACGCGGCAGG CTTTCGAGAA TGCGATCCGC GTCAATTCGG CCATCGGCGG CTCGACCAAC GCGCCGCCGC ATCTGCAGGC CATCGCGCGC CATGCGGGTG TCGAGCTTGC GGTGGAGGAC TGGCAGACGG TGGGCTTCGA CCTGCCGCTG CTGGTGAACA TGCAGCCCGC CGGAGAATAT CTGGGTGAGA GCTTCTTCCG GGCGGGCGGC GTGCCTGCCG TCATGGGCGA GCTGCTCGCG GCGGGGCTTC TCCATGCGGA GGCGCTGACC GTCACGGGAG AGAGCATCGG CCACAATCTC GCGGGCGAGC GCAGCCGCGA CCGGCGGGTG ATCCGGTCGG TCGAGGATCC CCTGCGCGAG AAGGCGGGGT TCCTCGTGCT GCGGGGCAAT CTCTTCGACT CGGCGCTGAT GAAGACCTCG GTCATTTCGG CCGAGTTCCG GCACCGCTTC CTCGCCCAGC CGGGGCGGGA GGGCATCCAC GAGGCCCGCG CCGTGGTCTT CGAGGGACCG GAAGATTATC ACGCCCGCAT CAACGACCCC GATCTCGGGA TCGACGAGAC GACGATCCTC TTCATCCGCG GCGTGGGCTG CGTGGGCTAT CCGGGCTCAG CCGAGGTGGT GAACATGCAG CCGCCCGACG GGCTTCTGCG CGAGGGAGTG ACGCATCTGC CGACGGTGGG CGATGGGCGG CAGTCGGGCA CTTCCGAGAG CCCGTCGATC CTCAACGCCT CGCCCGAGGC GGCGGTGGGC GGCGGCCTTG CGCTCCTGCG GACCGGCGAC CGGGTGCGGC TCGATCTGAA TGCCTGCCGG CTCGACGCGC TGGTGGACGA GGCCGAGTGG GAGGCGCGCC GCGCCGCCTG GACGCCGCCC GTCCTGCACC ACCAGACCCC CTGGCAGGAG ATCTATCGCC GCCTCGTGGG GCAGCTCGCC GATGGCGGCT GCCTCGAGCT TGCCACCGCC TATCACCGGG TGGCGCGCGA TCTGCCACGG GACAATCATT AG
|
Protein sequence | MSDTPTGRRF RSQAWFDNPD NPGMTALYVE RYQNQGFTRR ELQGDRPIIG IAQSGSDLAP CNKIHLFLAE RVKAGIRDAG GVPMEFPVHP IQETGRRPTA ALDRNLAYLG LVEVLHGYPI DGVVLTTGCD KTTPAQLMAA ATVDLPSIVL SGWPMLDGWW EGKLAGSGTI IWESRRLLAE GEIDYPEFME RACASAPSLG HCNTMGTAST LNALAEALGM SLPGCSAIPA PFRERMNMAY ATGRRIVEMV LADLKPSDIL TRQAFENAIR VNSAIGGSTN APPHLQAIAR HAGVELAVED WQTVGFDLPL LVNMQPAGEY LGESFFRAGG VPAVMGELLA AGLLHAEALT VTGESIGHNL AGERSRDRRV IRSVEDPLRE KAGFLVLRGN LFDSALMKTS VISAEFRHRF LAQPGREGIH EARAVVFEGP EDYHARINDP DLGIDETTIL FIRGVGCVGY PGSAEVVNMQ PPDGLLREGV THLPTVGDGR QSGTSESPSI LNASPEAAVG GGLALLRTGD RVRLDLNACR LDALVDEAEW EARRAAWTPP VLHHQTPWQE IYRRLVGQLA DGGCLELATA YHRVARDLPR DNH
|
| |