Gene RPC_2494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2494 
Symbol 
ID3971251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2704327 
End bp2706051 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content65% 
IMG OID637925602 
Productdihydroxy-acid dehydratase 
Protein accessionYP_532364 
Protein GI90423994 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000806623 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.595086 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGCCA ACTCCAACAT CAAGGCGAGG TTGCCCAGCC GTCACGTGAC GGAAGGCCCG 
GAGCGTGCGC CGCACCGCTC CTACCTCTAT GCGATGGGGT TGACCACGCA GCAGATCCAC
CAGCCGTTCG TCGGCGTGGC ATCGTGCTGG AATGAAGCCG CGCCCTGCAA CATCTCGCTG
ATGCGCCAGG CCCAGGCGGT GAAGAAGGGC GTCGCCGCCG CGGGTGGTAC GCCACGCGAA
TTCTGCACCA TCACCGTCAC CGACGGCATC GCCATGGGCC ATGACGGCAT GCGCTCGTCG
CTGCCGTCGC GCGAATGCAT CGCCGACTCG GTCGAGCTGA CCATCCGCGG CCACTCCTAC
GACGCGCTGG TCGGGCTTGC CGGCTGCGAC AAGTCGCTGC CCGGAATGAT GATGGCGATG
GTCCGGCTCA ACGTGCCGTC GATCTTCATC TATGGCGGCT CGATCCTGCC CGGCACCTTC
CGCGGCCAGC AGGTCACCGT GCAGGACATG TTCGAGGCGG TCGGCAAGCA TTCGGTCGGC
GCGATGTCGG ACGCCGACCT CGACGAAATC GAACGGGTGG CGTGCCCCTC GGCCGGCGCC
TGCGGCGCCC AGTTCACCGC CAACACCATG GCGACGGTGT CGGAGGCGAT CGGCCTGGCG
CTGCCTTATT CGGCCGGAGC ACCTGCGCCC TATGAGATCC GCGACGCCTT CTGCATGACC
GCCGGCGAGC AGATCATGAC GCTGATCGCC AAGAATATCC GGCCGCGCGA CATCGTCACC
TTGAAGGCGC TGCAGAACGC CGCGGCGGTG GTGGCGGCCT CCGGCGGCTC GACCAATGCG
GCGCTGCACC TGCCGGCGAT CGCGCATGAA TGCGGCATCA AATTCGACCT GTTCGACGTC
GCCGAAATCT TCAAAAAGAC ACCCTATGTC GCGGATTTGA AACCCGGCGG CCGTTATGTC
GCCAAAGACA TGTACGAAGT TGGTGGCATA CCGCTTCTGA TGAAAACATT GCTCGATCAT
GGCTACCTGC ACGGCGACTG CCTGACGGTC ACCGGCCGGA CGATTGCGGA AAATTTGGCA
ACCGTGAAAT GGAATCCCGA CCAGGACGTG GTGCGCGCAG CGGATAACCC GATCACCGTG
ACCGGTGGGG TGGTCGGGCT GCAAGGCAAC CTCGCCCCCG AGGGGGCGAT CGTGAAGGTC
GCCGGGATGT CCAACTTGAA ATTCTCCGGC CCGGCGCGCT GCTTCGATCG CGAAGAGGAC
GCCTTCGAGG CGGTGCAGCA CAAGACCTAT CGCGAAGGCG AAGTGATCGT GATCCGCTAC
GAAGGGCCGC GCGGCGGCCC CGGCATGCGC GAGATGCTGT CGACCACCGC GGCGCTGACC
GGGCAGGGCA TGGGCGGCAA GATCGCGCTG ATCACCGACG GCCGGTTCTC CGGCGCCACC
CGCGGCTTCT GCATCGGCCA TGTCGGACCG GAAGCCGCGC TCGGCGGCCC GATCGCGCTG
TTGCAGGACG GCGACATCAT CGAGATCGAC GCGGTGGCCG GCACGCTTAA CGTAAAATTG
ACCGAAGCCG AACTCTCCGC GCGCAAGACC AATTGGCAGC CGCGTGAGAC CAACCATTCG
TCAGGCGCGT TGTGGAAGTA TGCCCAACAG GTCGGCCCCG CGCTCGGTGG CGCGGTGACC
CATCCGGGTG GTTCGCACGA GAAACAGTGT TATGCGGATG TTTAA
 
Protein sequence
MDANSNIKAR LPSRHVTEGP ERAPHRSYLY AMGLTTQQIH QPFVGVASCW NEAAPCNISL 
MRQAQAVKKG VAAAGGTPRE FCTITVTDGI AMGHDGMRSS LPSRECIADS VELTIRGHSY
DALVGLAGCD KSLPGMMMAM VRLNVPSIFI YGGSILPGTF RGQQVTVQDM FEAVGKHSVG
AMSDADLDEI ERVACPSAGA CGAQFTANTM ATVSEAIGLA LPYSAGAPAP YEIRDAFCMT
AGEQIMTLIA KNIRPRDIVT LKALQNAAAV VAASGGSTNA ALHLPAIAHE CGIKFDLFDV
AEIFKKTPYV ADLKPGGRYV AKDMYEVGGI PLLMKTLLDH GYLHGDCLTV TGRTIAENLA
TVKWNPDQDV VRAADNPITV TGGVVGLQGN LAPEGAIVKV AGMSNLKFSG PARCFDREED
AFEAVQHKTY REGEVIVIRY EGPRGGPGMR EMLSTTAALT GQGMGGKIAL ITDGRFSGAT
RGFCIGHVGP EAALGGPIAL LQDGDIIEID AVAGTLNVKL TEAELSARKT NWQPRETNHS
SGALWKYAQQ VGPALGGAVT HPGGSHEKQC YADV