Gene RPC_4321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4321 
Symbol 
ID3971509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4819587 
End bp4821425 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content67% 
IMG OID637927430 
Productdihydroxy-acid dehydratase 
Protein accessionYP_534163 
Protein GI90425793 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.103538 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.169432 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCCT ATCGCTCCAG GACCACAACC CATGGCCGCA ATATGGCGGG TGCTCGCGGC 
TTGTGGCGTG CCACCGGCAT GAAGAACGAG GATTTCGGCA AGCCGATCAT CGCGGTGGTG
AATTCCTTCA CCCAGTTCGT GCCCGGCCAC GTGCATCTGA AGGACCTCGG CCAATTGGTC
GCCCGCGAGA TCGAGAACGC CGGCGGCGTC GCCAAGGAAT TCAACACCAT CGCGATCGAT
GACGGCATCG CGATGGGCCA TGACGGCATG CTGTATTCGC TGCCGTCGCG CGAATTGATC
GCCGACAGCG TCGAATACAT GGTCAACGGC CATTGCGCCG ACGCCATGGT GTGCATCTCG
AATTGCGACA AGATCACCCC CGGCATGTTG ATGGCCTCGC TGCGGCTCAA CATTCCGACC
ATCTTCGTTT CCGGCGGCCC GATGGAAGCC GGCAAGGTCA CGGTCGGCGG CAAGAAGCGC
GCGGTCGACC TGATCGACGC CATGGTGGCG GCGGCCGATG ACCGGGTCAG CGACGCCGAC
GTCGAGGCGA TCGAGCGCTC CGCCTGTCCG ACCTGCGGCT CCTGCTCCGG CATGTTCACC
GCCAATTCGA TGAACTGCCT GACCGAGGCG CTCGGGCTGG CGCTGCCCGG CAACGGCTCG
GTGCTCGCCA CCCACGCCGA CCGCAAGGCG CTGTTCGTCG AGGCCGGCCA CCTGATCGTC
GATCTGGCCC GGCGTTACTA CGAGCAGGAC GACGAGACGG CGCTGCCGCG CAACATCGCC
AGCTTCAAGG CGTTCGAGAA CGCCATGACG CTCGACATCG CGATGGGCGG CTCGACCAAC
ACCGTGCTGC ATCTGTTGGC CGCGGCCTAT GAGGGCGAGA TCCCCTTCAC CATGCAGGAC
ATCGACCGGC TGTCGCGCCG GGTGCCGGTG CTGTGCAAGG TGGCGCCGGC GGTGGCCGAC
GTTCACGTCG AGGACGTGCA TCGCGCCGGC GGCGTGATGG GCATTCTCGG CGAACTCGAC
CGCGCCCGGC TGATCAACGC CGAACTGCCG ACCGTGCACT CGACCTCGCT CGGCGAAGCG
CTGAACCGCT GGGACGTGAT GCGCACCCAG AGCGACAGCG TGCGCAAGTT CTACAAGGCG
GCGCCCGGCG GCGTGCCGAC GCAGGTCGCG TTCAGCCAGG AGCAGCGCTA TGACGACGTC
GATACCGACC GCGCCAAAGG CTGCATCCGC GACGCCGAGC ACGCCTTCTC CAAGGACGGC
GGCCTGGCGG TGCTGTCCGG CAACCTCGCG ATCGACGGCT GCATCGTCAA GACCGCCGGC
GTCGACGCCA GCATCCTGAC CTTCCAGGGC CCGGCGCGGG TGTTCGAGAG CCAGGACGCC
GCGGTCGAAG GCATTCTCGG CGGCAAGATC ACAGCCGGCG ATATCGTGGT GATCCGCTAT
GAGGGGCCGC GCGGCGGCCC CGGCATGCAG GAAATGCTGT ATCCGACCAG CTACCTGAAA
TCGAAAGGCT TGGGCAAAGC CTGCGCGCTG ATCACCGACG GCCGGTTCTC CGGCGGCTCC
TCGGGGCTGT CGATCGGCCA CGTCTCGCCG GAAGCCGCCG AGGGTGGGCT GATCGGCCTC
GTCGAAGAGG GCGACCGCAT CGAGATCGAC ATTCCGCAGC GCTCGATTCG GCTCGCGGTC
GACGACGCCG TGCTGGCGGA ACGCCGCGTC GCCATGCTGG CGCGCAAGGA TGCTTGGAAG
CCCGGCAAGC GCAGCCGCAA GGTCACCTCG GCGCTGAAGG CCTACGCCGC GATGACCACC
AGCGCCGCCC GCGGCGCGGT GCGGGTGGTG AAGGATTAA
 
Protein sequence
MPAYRSRTTT HGRNMAGARG LWRATGMKNE DFGKPIIAVV NSFTQFVPGH VHLKDLGQLV 
AREIENAGGV AKEFNTIAID DGIAMGHDGM LYSLPSRELI ADSVEYMVNG HCADAMVCIS
NCDKITPGML MASLRLNIPT IFVSGGPMEA GKVTVGGKKR AVDLIDAMVA AADDRVSDAD
VEAIERSACP TCGSCSGMFT ANSMNCLTEA LGLALPGNGS VLATHADRKA LFVEAGHLIV
DLARRYYEQD DETALPRNIA SFKAFENAMT LDIAMGGSTN TVLHLLAAAY EGEIPFTMQD
IDRLSRRVPV LCKVAPAVAD VHVEDVHRAG GVMGILGELD RARLINAELP TVHSTSLGEA
LNRWDVMRTQ SDSVRKFYKA APGGVPTQVA FSQEQRYDDV DTDRAKGCIR DAEHAFSKDG
GLAVLSGNLA IDGCIVKTAG VDASILTFQG PARVFESQDA AVEGILGGKI TAGDIVVIRY
EGPRGGPGMQ EMLYPTSYLK SKGLGKACAL ITDGRFSGGS SGLSIGHVSP EAAEGGLIGL
VEEGDRIEID IPQRSIRLAV DDAVLAERRV AMLARKDAWK PGKRSRKVTS ALKAYAAMTT
SAARGAVRVV KD