Gene Rsph17029_1810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1810 
Symbol 
ID4896032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1908063 
End bp1909844 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content69% 
IMG OID640112404 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001043689 
Protein GI126462575 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.360755 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA CCCCCACCGG CAGGCGCTTC CGCTCACAGG CGTGGTTCGA CAATCCCGAC 
AATCCCGGGA TGACCGCGCT CTATGTCGAG CGCTACCAGA ACCAGGGCTT CACGCGGCGC
GAGCTGCAGG GCGACCGGCC CATCATCGGC ATCGCGCAGT CGGGTTCGGA TCTCGCGCCC
TGCAACAAGA TCCACCTCTT CCTCGCCGAG CGGGTCAAGG CGGGCATCCG CGACGCGGGC
GGCGTGCCGA TGGAATTTCC CGTCCATCCG ATCCAGGAGA CCGGGCGCAG GCCCACCGCC
GCGCTCGACC GCAACCTCGC CTATCTCGGC CTCGTCGAGG TGCTGCACGG CTATCCGATC
GACGGGGTGG TGCTGACCAC CGGATGCGAC AAGACCACGC CCGCGCAGCT GATGGCAGCG
GCGACGGTGG ATCTTCCTTC CATCGTGCTC TCGGGCTGGC CGATGCTCGA CGGCTGGTGG
GAAGGCAAGC TCGCAGGCTC GGGCACGATC ATCTGGGAGA GCCGGCGGCT CTTGGCCGAG
GGCGAGATCG ACTATCCGGA GTTCATGGAG CGTGCCTGCG CTTCGGCCCC CTCGCTCGGC
CATTGCAACA CGATGGGCAC CGCCTCGACC CTGAACGCGC TGGCCGAGGC GCTGGGCATG
TCGCTGCCCG GATGCTCGGC CATTCCCGCG CCGTTCCGCG AGCGGATGAA CATGGCCTAT
GCCACGGGCC GGCGCATCGT CGAGATGGTG CTGGCCGACC TGAAGCCCTC GGACATCCTC
ACGCGGCAGG CTTTCGAGAA TGCGATCCGC GTCAATTCGG CCATCGGCGG CTCGACCAAC
GCGCCGCCGC ATCTGCAGGC CATCGCGCGC CATGCGGGTG TCGAGCTTGC GGTGGAGGAC
TGGCAGACGG TGGGCTTCGA CCTGCCGCTG CTGGTGAACA TGCAGCCCGC CGGAGAATAT
CTGGGTGAGA GCTTCTTCCG GGCGGGCGGC GTGCCTGCCG TCATGGGCGA GCTGCTCGCG
GCGGGGCTTC TCCATGCGGA GGCGCTGACC GTCACGGGAG AGAGCATCGG CCACAATCTC
GCGGGCGAGC GCAGCCGCGA CCGGCGGGTG ATCCGGTCGG TCGAGGATCC CCTGCGCGAG
AAGGCGGGGT TCCTCGTGCT GCGGGGCAAT CTCTTCGACT CGGCGCTGAT GAAGACCTCG
GTCATTTCGG CCGAGTTCCG GCACCGCTTC CTCGCCCAGC CGGGGCGGGA GGGCATCCAC
GAGGCCCGCG CCGTGGTCTT CGAGGGACCG GAAGATTATC ACGCCCGCAT CAACGACCCC
GATCTCGGGA TCGACGAGAC GACGATCCTC TTCATCCGCG GCGTGGGCTG CGTGGGCTAT
CCGGGCTCAG CCGAGGTGGT GAACATGCAG CCGCCCGACG GGCTTCTGCG CGAGGGAGTG
ACGCATCTGC CGACGGTGGG CGATGGGCGG CAGTCGGGCA CTTCCGAGAG CCCGTCGATC
CTCAACGCCT CGCCCGAGGC GGCGGTGGGC GGCGGCCTTG CGCTCCTGCG GACCGGCGAC
CGGGTGCGGC TCGATCTGAA TGCCTGCCGG CTCGACGCGC TGGTGGACGA GGCCGAGTGG
GAGGCGCGCC GCGCCGCCTG GACGCCGCCC GTCCTGCACC ACCAGACCCC CTGGCAGGAG
ATCTATCGCC GCCTCGTGGG GCAGCTCGCC GATGGCGGCT GCCTCGAGCT TGCCACCGCC
TATCACCGGG TGGCGCGCGA TCTGCCACGG GACAATCATT AG
 
Protein sequence
MSDTPTGRRF RSQAWFDNPD NPGMTALYVE RYQNQGFTRR ELQGDRPIIG IAQSGSDLAP 
CNKIHLFLAE RVKAGIRDAG GVPMEFPVHP IQETGRRPTA ALDRNLAYLG LVEVLHGYPI
DGVVLTTGCD KTTPAQLMAA ATVDLPSIVL SGWPMLDGWW EGKLAGSGTI IWESRRLLAE
GEIDYPEFME RACASAPSLG HCNTMGTAST LNALAEALGM SLPGCSAIPA PFRERMNMAY
ATGRRIVEMV LADLKPSDIL TRQAFENAIR VNSAIGGSTN APPHLQAIAR HAGVELAVED
WQTVGFDLPL LVNMQPAGEY LGESFFRAGG VPAVMGELLA AGLLHAEALT VTGESIGHNL
AGERSRDRRV IRSVEDPLRE KAGFLVLRGN LFDSALMKTS VISAEFRHRF LAQPGREGIH
EARAVVFEGP EDYHARINDP DLGIDETTIL FIRGVGCVGY PGSAEVVNMQ PPDGLLREGV
THLPTVGDGR QSGTSESPSI LNASPEAAVG GGLALLRTGD RVRLDLNACR LDALVDEAEW
EARRAAWTPP VLHHQTPWQE IYRRLVGQLA DGGCLELATA YHRVARDLPR DNH