Gene Rsph17029_3617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3617 
Symbol 
ID4898189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp708458 
End bp710257 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content70% 
IMG OID640114225 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001045479 
Protein GI126464366 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.886843 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAAC GCCCCCGGAT CACCCCCGAC CAGCTGCGCT CGCGGCTCTG GTTCAACAAT 
CCCGACGATC CGGAGATGAC GGCGCTCTAT CTCGAGCGCT ATCTGAACTA CGGCCTGACC
CGGGCCGAGC TTCAGGGCGG CAAGCCGATC ATCGGCATCG CCCAGACCGG CAGCGACCTC
TCGCCCTGCA ACCGTCACCA CATCGAGCTG GCCAAGCGGG TGCGCGACGG CATCACCGCG
GCGGGCGGCA TCCCGATGGA GATCCCGGTC CATCCGATCC AGGAGACGGG CAAGCGCCCG
ACGGCGAGCC TCGACCGCAA CCTCGCCTAT CTGGGCCTCG TCGAGGCGCT GCACGGCTAT
CCGATCGACG GGGTGGTGCT GACCATCGGC TGCGACAAGA CCACACCCGC GCTGCTGATG
GCGGCGGCGA CGGTGAACAT CCCCGCCATC GCCTATTCGG TGGGCCCGAT GCTGAACGGC
TGGCACCGGG GCGAGCGCGC GGGCTCGGGC ACCGCGGTCT GGCGGGCGCG CGAGCTTCTG
GCAAGGGGCG AGATCGACGA GGAGGGTTTC TACGAGCTCG TGGCCTCCTC GGCCCCCTCG
GTCGGCTATT GCAACACGAT GGGCACGGCC TCGACCATGA ACAGTCTGGC CGAGGTGCTC
GGCATGCAGC TTCCCGGCTC GGCCGCTATC CCCGCCCCCT ACCGCGAGCG CGGGCAGATG
GGGCACGCGA CCGGCCGGCG GATCGTCGAG ATGGTGTGGG AGGATCTGCG CCCCTCCGAC
ATCCTGACGC GGGAGGCCTT CGAGAATGCG ATCGTGGCCT GCTCGGCCCT CGGCGGTTCG
ACCAACGCGC CCATTCACCT CAATGCCGTG GCGCGGCACG CCGGCGTGGC GCTCGACAAT
GACGACTGGC AGAGGCTCGG CCATGCGGTG CCGCTCCTGG TCAATCTCCA GCCCGCGGGC
ACCTATCTCG GCGAGGATTT CTACCGCGCG GGCGGCGTGC CCGCCGTGCT GGGCGAACTC
CTGGCCGCGG ATCTCCTGCC CCATCCCGAG GCGCCCACCG TCTTCGGCAC GCCACTCTCC
GCCGGCGCCA TGCGCAGCCT CGAGACCGAC GTGATCCGCC CGGTGGTCGA ACCGCTGAAG
GGCGAGGCGG GCTTCATCAA TCTGTCGGGC AATCTCTTCG ACAGCGCGAT CATGAAGACG
AGCGTGATCT CGCCCGATTT CCGCGCCCGC TACCTCTCCG ATCCTGCCGA CCCCGAGGCG
TTCGAGGGCA CGGTCTTCGT CTTCGACGGC CCCGAGCATT TCCATGCGGT GATCGACGAC
CCGGCGCTGG GGATGGGCGA GGATGCGGTG CTCGTGATGC GCGGCGCGGG GCCCCTGGGC
TATCCGGGCG CGGCCGAGGT GGTGAACATG CGCCCGCCCG CCTACCTCCT CAAGCGCGGC
ATCCCGGCGG TGCCCTGCAT CGGCGACGGC CGCCAGTCGG GGACCTCGGG CTCGCCCTCG
ATCCTCAATG CCTCGCCCGA GGCCGCGGCG GGCGGAGGCC TCGCGCTTCT GCGCAACGGC
GACCGGATCC GCGTCGACCT GCGTCGCGGA CGGGTCGATG TGCTGCTGCC GGACGAGGAG
CTGGAGGCCC GCAGGACTGC CCTGGCCGAG GCCGGCGGCT ACGCCATGCC GCCCAGCCAG
ACCCCCTGGC AGGCGATCTT CCGCGACCTG ACGGGACAGC TTGCAGAGGG CATGGTGCTT
GACGGCGCGG ACGGGTTCCA CGATCTCGCC CGCAAGGCGC TTGCGCGCAA CAACCATTGA
 
Protein sequence
MTQRPRITPD QLRSRLWFNN PDDPEMTALY LERYLNYGLT RAELQGGKPI IGIAQTGSDL 
SPCNRHHIEL AKRVRDGITA AGGIPMEIPV HPIQETGKRP TASLDRNLAY LGLVEALHGY
PIDGVVLTIG CDKTTPALLM AAATVNIPAI AYSVGPMLNG WHRGERAGSG TAVWRARELL
ARGEIDEEGF YELVASSAPS VGYCNTMGTA STMNSLAEVL GMQLPGSAAI PAPYRERGQM
GHATGRRIVE MVWEDLRPSD ILTREAFENA IVACSALGGS TNAPIHLNAV ARHAGVALDN
DDWQRLGHAV PLLVNLQPAG TYLGEDFYRA GGVPAVLGEL LAADLLPHPE APTVFGTPLS
AGAMRSLETD VIRPVVEPLK GEAGFINLSG NLFDSAIMKT SVISPDFRAR YLSDPADPEA
FEGTVFVFDG PEHFHAVIDD PALGMGEDAV LVMRGAGPLG YPGAAEVVNM RPPAYLLKRG
IPAVPCIGDG RQSGTSGSPS ILNASPEAAA GGGLALLRNG DRIRVDLRRG RVDVLLPDEE
LEARRTALAE AGGYAMPPSQ TPWQAIFRDL TGQLAEGMVL DGADGFHDLA RKALARNNH