Gene RPD_2806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2806 
Symbol 
ID4023304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3126518 
End bp3128242 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content66% 
IMG OID637963004 
Productdihydroxy-acid dehydratase 
Protein accessionYP_569935 
Protein GI91977276 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.852293 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCGA AGGCGACGCT CAAGTCGAAG CTCCCCAGCC GGCACGTGAC CGAAGGGCCG 
GCGCGCGCGC CCCATCGCTC TTACCTCTAC GCCATGGGCC TCACCACCGA GCAGATCCAC
CAGCCGTTCG TCGGGGTGGC GTCGTGCTGG AACGAGGCCG CGCCCTGCAA CATTTCGCTG
ATGCGGCAGG CTCAGGCGGT CAAGAAGGGC GTCGCCTCCG CCGGCGGCAC CCCGCGCGAA
TTCTGCACCA TCACCGTGAC TGACGGCATC GCCATGGGCC ACGAGGGCAT GCGCTCGTCG
CTGCCGTCGC GCGAGGTGAT CGCCGACTCC GTCGAGCTGA CAATCCGCGG CCACTCCTAT
GACGCGCTGG TCGGGCTGGC CGGCTGCGAC AAGTCTCTGC CCGGGATGAT GATGGCGATG
GTCCGGCTCA ACGTGCCGTC GATCTTCATC TATGGCGGCT CGATCCTGCC GGGCACCTTC
CGGGGCCAGC AGGTCACCGT TCAGGACATG TTCGAGGCGG TCGGCAAGCA CTCGGTCGGC
GAGATGTCGG ACGACGACCT CGACGAAATC GAGCGGGTCG CCTGTCCGTC GGCCGGCGCC
TGCGGCGCGC AGTTCACCGC CAACACCATG GCGACCGTGT CCGAGGCGAT CGGCCTAGCG
CTGCCGTATT CGGCCGGCGC ACCTGCTCCT TACGAAATCC GCGATGCGTT CTGCACGGCG
GCCGGCGAGA AGGTGATGGA GCTGATCGCC GCCAACATCC GGCCGCGCGA CATCGTCACC
CGCAAGGCGC TGGAGAATGC GGCCGCGGTG GTAGCAGCGT CCGGCGGCTC GACCAATGCT
GCGCTGCACC TACCAGCGAT CGCGCATGAA TGTGGTATCA AGTTCGATCT GTTTGATGTC
GCCGAAATCT TCAAAAAGAC ACCATATATC GCGGATTTGA AGCCAGGCGG CCGTTATGTC
GCCAAAGACA TGTATGAAGT TGGCGGCATA CCGCTCCTGA TGAAGACCCT GCTCGATCAT
GGCTTCCTCC ACGGCGACTG CCTGACCGTC ACGGGACGGA CGATCGCCGA GAATCTGAAA
GCCGTGAAGT GGAATCCGCA TCAGGACGTG GTGCGGCAGG CGAACCATCC GATCACCGTG
ACTGGGGGCG TCGTCGGGCT GAAGGGAAAC CTCGCACCAG AAGGTGCGAT CGTGAAGGTC
GCGGGAATGT CGAACCTGAA GTTTTCCGGG CCTGCCCGCT GCTTCGATCG CGAGGAAGAC
GCGTTCGAGG CGGTGCAGAA GCGGACCTAC AAGGAAGGCG AGGTCCTCGT GATCCGCTAC
GAGGGGCCGC GGGGCGGCCC CGGAATGCGG GAAATGCTCG CCACCACTGC GGCGCTGACC
GGCCAGGGCA TGGGCGGCAA GATCGCGCTG ATCACCGACG GCCGGTTCTC CGGCGCCACC
CGCGGCTTCT GCATCGGCCA TGTCGGCCCG GAAGCGGCGC TGGGTGGTCC GATCGCGCTG
CTGCGCGACG GTGACATCAT CGTCATCGAC GCCGAGGCCG GAACGCTTGA CGTAAATTTG
ACCGACGACG AACTGGCCGC GCGCAAGTCC GAATGGGCGC ATCGCGCGAC AAACCACACG
TCGGGTGCGC TTTGGAAATA TGCCCAGCAG GTCGGGCCCG CAGTCAGCGG CGCTGTGACT
CATCCGGGCG GGGCGGCCGA GAAGCAGTGC TATGCGGATG TTTGA
 
Protein sequence
MDAKATLKSK LPSRHVTEGP ARAPHRSYLY AMGLTTEQIH QPFVGVASCW NEAAPCNISL 
MRQAQAVKKG VASAGGTPRE FCTITVTDGI AMGHEGMRSS LPSREVIADS VELTIRGHSY
DALVGLAGCD KSLPGMMMAM VRLNVPSIFI YGGSILPGTF RGQQVTVQDM FEAVGKHSVG
EMSDDDLDEI ERVACPSAGA CGAQFTANTM ATVSEAIGLA LPYSAGAPAP YEIRDAFCTA
AGEKVMELIA ANIRPRDIVT RKALENAAAV VAASGGSTNA ALHLPAIAHE CGIKFDLFDV
AEIFKKTPYI ADLKPGGRYV AKDMYEVGGI PLLMKTLLDH GFLHGDCLTV TGRTIAENLK
AVKWNPHQDV VRQANHPITV TGGVVGLKGN LAPEGAIVKV AGMSNLKFSG PARCFDREED
AFEAVQKRTY KEGEVLVIRY EGPRGGPGMR EMLATTAALT GQGMGGKIAL ITDGRFSGAT
RGFCIGHVGP EAALGGPIAL LRDGDIIVID AEAGTLDVNL TDDELAARKS EWAHRATNHT
SGALWKYAQQ VGPAVSGAVT HPGGAAEKQC YADV