Gene Rsph17025_1467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1467 
Symbol 
ID5083928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1497983 
End bp1499764 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content69% 
IMG OID640483023 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001167666 
Protein GI146277507 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.454096 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.61506 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACA CCCGGGACAG AAGGCGGTTC CGCTCGCAGG AGTGGTTCGA CAATCCCGAC 
AACCCCGGCA TGACCGCGCT CTATGTCGAG CGATACCAGA ACCAGGGATT CACGCGGCGC
GAATTGCAGG GCGATCGGCC GATCATCGGC ATCGCGCAGT CGGGATCGGA CCTCGCGCCC
TGCAACAAGA TCCACCTGTT CCTGGCCGAC CGGATCAAGG CGGGGATCCG CGACGCGGGC
GGGGTGCCGA TGGAGTTTCC CGTCCATCCG ATCCAGGAGA CCGGGCGGCG CCCGACGGCG
GCGCTCGACC GCAACCTGGC CTATCTGGGT CTGGTCGAGG TGCTGCACGG CTATCCGATC
GACGGCGTGG TGCTGACCAC GGGCTGCGAC AAGACCACTC CGGCGCAGCT GATGGCGGCG
GCGACGGTGG ACCTTCCCGC GATCGTCCTC TCGGGCGGCC CGATGCTGGA CGGCTGGTGG
GAGGGCAAGC TCGCCGGGTC GGGCACGATC ATCTGGGAGA GCCGCCGGCT TCTGGCCGAG
GGCGAGATCG ACTATGCCGA GTTCATGGAG CGCGCCTGCG CCTCGGCGCC GTCACTCGGC
CATTGCAACA CGATGGGCAC CGCCTCGACG CTGAACGCGC TGGCCGAGGC GCTCGGCATG
TCGCTGCCGG GCTGTTCGGC CATTCCCGCG CCGTTCCGCG AGCGGATGAA CATGGCCTAT
GCCACGGGCC GGCGGATCGT CGAGATGGTG CTCGAGGATC TCAAGCCGTC GGACATCCTC
ACCCGCAAGG CCTTCGAGAA CGCAATCCGG GTCAATTCGG CGATTGGCGG CTCGACCAAT
GCGCCGCCGC ATCTGCAGGC CATCGCGCGC CATGTCGGGG TCGAGCTTGC GGTGCAGGAC
TGGCAGGAGG TGGGGTTCGA CGTGCCGCTG CTGGTGAACA TGCAGCCCGC GGGCGAGTAT
CTGGGCGAGA GCTTCTTTCG CGCGGGCGGG GTTCCGGCCG TGATGGGCGA ACTGATCGCT
GCGGGGCTTC TGCATGAGGA GGCGCTGACG GTCACCGGGC AGAGCGTGGG CCACAACCTT
CAGGGCGAGC GCAGCCGCGA CCGGCGCGTG ATCCGCCCGG TTGATGAGCC GCTGCGCGAA
AAGGCTGGAT TTCTCGTGCT TTCGGGCAAT CTCTTCGACT CGGCTCTGAT GAAGACCTCG
GTGATCTCGG CCGAGTTTCG GCACCGCTTC CTCGCGCGGC CGGGACATGA GGGAGTCCAC
GAGGCCCGCG CCGTGGTCTT CGAGGGGCCC GAGGATTATC ACGCCCGCAT CAACGACCCC
GCTCTCGGCA TCGACGAGAC GACGATCCTC TTCATCCGCG GCGTGGGCTG CATCGGCTAC
CCCGGGTCGG CCGAGGTGGT GAACATGCAG CCGCCCGACG CGCTGCTGCG CGAGGGCGTC
ACGCATCTGC CGACGGTGGG CGACGGGCGG CAGTCGGGCA CGTCGGAAAG CCCCTCGATC
CTCAACGCCT CGCCCGAGGC GGTGGCCGGC GGGGGTCTTG CGCTTCTGCA GACCGGGGAC
CGGGTGCGGC TCGACCTCAA CCGCTGCCGG CTCGATGCGC TGGTGGACGA GGCCGAATGG
GAGGCGCGGC GGGCCGCATG GCAGCCGCCC GAGCTTCACA ACCAGACCCC GTGGCAGGAA
ATCTATCGCC GCCTCTCGGG CCAGCTGGCC GAGGGGGGGT GCATCGAGCT GGCCACCACC
TACCGCCGTG TGGCGCGCGA TCTTCCGCGG GACAACCATT GA
 
Protein sequence
MTDTRDRRRF RSQEWFDNPD NPGMTALYVE RYQNQGFTRR ELQGDRPIIG IAQSGSDLAP 
CNKIHLFLAD RIKAGIRDAG GVPMEFPVHP IQETGRRPTA ALDRNLAYLG LVEVLHGYPI
DGVVLTTGCD KTTPAQLMAA ATVDLPAIVL SGGPMLDGWW EGKLAGSGTI IWESRRLLAE
GEIDYAEFME RACASAPSLG HCNTMGTAST LNALAEALGM SLPGCSAIPA PFRERMNMAY
ATGRRIVEMV LEDLKPSDIL TRKAFENAIR VNSAIGGSTN APPHLQAIAR HVGVELAVQD
WQEVGFDVPL LVNMQPAGEY LGESFFRAGG VPAVMGELIA AGLLHEEALT VTGQSVGHNL
QGERSRDRRV IRPVDEPLRE KAGFLVLSGN LFDSALMKTS VISAEFRHRF LARPGHEGVH
EARAVVFEGP EDYHARINDP ALGIDETTIL FIRGVGCIGY PGSAEVVNMQ PPDALLREGV
THLPTVGDGR QSGTSESPSI LNASPEAVAG GGLALLQTGD RVRLDLNRCR LDALVDEAEW
EARRAAWQPP ELHNQTPWQE IYRRLSGQLA EGGCIELATT YRRVARDLPR DNH