Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_1467 |
Symbol | |
ID | 5083928 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 1497983 |
End bp | 1499764 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640483023 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_001167666 |
Protein GI | 146277507 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.454096 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.61506 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACA CCCGGGACAG AAGGCGGTTC CGCTCGCAGG AGTGGTTCGA CAATCCCGAC AACCCCGGCA TGACCGCGCT CTATGTCGAG CGATACCAGA ACCAGGGATT CACGCGGCGC GAATTGCAGG GCGATCGGCC GATCATCGGC ATCGCGCAGT CGGGATCGGA CCTCGCGCCC TGCAACAAGA TCCACCTGTT CCTGGCCGAC CGGATCAAGG CGGGGATCCG CGACGCGGGC GGGGTGCCGA TGGAGTTTCC CGTCCATCCG ATCCAGGAGA CCGGGCGGCG CCCGACGGCG GCGCTCGACC GCAACCTGGC CTATCTGGGT CTGGTCGAGG TGCTGCACGG CTATCCGATC GACGGCGTGG TGCTGACCAC GGGCTGCGAC AAGACCACTC CGGCGCAGCT GATGGCGGCG GCGACGGTGG ACCTTCCCGC GATCGTCCTC TCGGGCGGCC CGATGCTGGA CGGCTGGTGG GAGGGCAAGC TCGCCGGGTC GGGCACGATC ATCTGGGAGA GCCGCCGGCT TCTGGCCGAG GGCGAGATCG ACTATGCCGA GTTCATGGAG CGCGCCTGCG CCTCGGCGCC GTCACTCGGC CATTGCAACA CGATGGGCAC CGCCTCGACG CTGAACGCGC TGGCCGAGGC GCTCGGCATG TCGCTGCCGG GCTGTTCGGC CATTCCCGCG CCGTTCCGCG AGCGGATGAA CATGGCCTAT GCCACGGGCC GGCGGATCGT CGAGATGGTG CTCGAGGATC TCAAGCCGTC GGACATCCTC ACCCGCAAGG CCTTCGAGAA CGCAATCCGG GTCAATTCGG CGATTGGCGG CTCGACCAAT GCGCCGCCGC ATCTGCAGGC CATCGCGCGC CATGTCGGGG TCGAGCTTGC GGTGCAGGAC TGGCAGGAGG TGGGGTTCGA CGTGCCGCTG CTGGTGAACA TGCAGCCCGC GGGCGAGTAT CTGGGCGAGA GCTTCTTTCG CGCGGGCGGG GTTCCGGCCG TGATGGGCGA ACTGATCGCT GCGGGGCTTC TGCATGAGGA GGCGCTGACG GTCACCGGGC AGAGCGTGGG CCACAACCTT CAGGGCGAGC GCAGCCGCGA CCGGCGCGTG ATCCGCCCGG TTGATGAGCC GCTGCGCGAA AAGGCTGGAT TTCTCGTGCT TTCGGGCAAT CTCTTCGACT CGGCTCTGAT GAAGACCTCG GTGATCTCGG CCGAGTTTCG GCACCGCTTC CTCGCGCGGC CGGGACATGA GGGAGTCCAC GAGGCCCGCG CCGTGGTCTT CGAGGGGCCC GAGGATTATC ACGCCCGCAT CAACGACCCC GCTCTCGGCA TCGACGAGAC GACGATCCTC TTCATCCGCG GCGTGGGCTG CATCGGCTAC CCCGGGTCGG CCGAGGTGGT GAACATGCAG CCGCCCGACG CGCTGCTGCG CGAGGGCGTC ACGCATCTGC CGACGGTGGG CGACGGGCGG CAGTCGGGCA CGTCGGAAAG CCCCTCGATC CTCAACGCCT CGCCCGAGGC GGTGGCCGGC GGGGGTCTTG CGCTTCTGCA GACCGGGGAC CGGGTGCGGC TCGACCTCAA CCGCTGCCGG CTCGATGCGC TGGTGGACGA GGCCGAATGG GAGGCGCGGC GGGCCGCATG GCAGCCGCCC GAGCTTCACA ACCAGACCCC GTGGCAGGAA ATCTATCGCC GCCTCTCGGG CCAGCTGGCC GAGGGGGGGT GCATCGAGCT GGCCACCACC TACCGCCGTG TGGCGCGCGA TCTTCCGCGG GACAACCATT GA
|
Protein sequence | MTDTRDRRRF RSQEWFDNPD NPGMTALYVE RYQNQGFTRR ELQGDRPIIG IAQSGSDLAP CNKIHLFLAD RIKAGIRDAG GVPMEFPVHP IQETGRRPTA ALDRNLAYLG LVEVLHGYPI DGVVLTTGCD KTTPAQLMAA ATVDLPAIVL SGGPMLDGWW EGKLAGSGTI IWESRRLLAE GEIDYAEFME RACASAPSLG HCNTMGTAST LNALAEALGM SLPGCSAIPA PFRERMNMAY ATGRRIVEMV LEDLKPSDIL TRKAFENAIR VNSAIGGSTN APPHLQAIAR HVGVELAVQD WQEVGFDVPL LVNMQPAGEY LGESFFRAGG VPAVMGELIA AGLLHEEALT VTGQSVGHNL QGERSRDRRV IRPVDEPLRE KAGFLVLSGN LFDSALMKTS VISAEFRHRF LARPGHEGVH EARAVVFEGP EDYHARINDP ALGIDETTIL FIRGVGCIGY PGSAEVVNMQ PPDALLREGV THLPTVGDGR QSGTSESPSI LNASPEAVAG GGLALLQTGD RVRLDLNRCR LDALVDEAEW EARRAAWQPP ELHNQTPWQE IYRRLSGQLA EGGCIELATT YRRVARDLPR DNH
|
| |