Gene RPC_3191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3191 
Symbol 
ID3972202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3531089 
End bp3532789 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content67% 
IMG OID637926301 
Productdihydroxy-acid dehydratase 
Protein accessionYP_533052 
Protein GI90424682 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.488817 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATG GACTGCGCAA GGGCCTGACC TCTTACGGCG ACGCCGGGTT CTCGCTGTTC 
CTGCGCAAGG CCTTCATCAA GGCGATGGGC TATTCCGACG ACGCGCTGGA TCGGCCGATC
GTCGGCATCA CCAACACCCA CAGCGACTAC AATCCCTGCC ACGGCAACGT GCCGCAGATC
ATCGAGGCGG TGAAGCGCGG CGTGATGCTG GCGGGCGCGA TGCCGATGGT GTTTCCAACT
ATCTCGATCG CCGAGAGTTT CGCGCATCCG ACCTCGATGT ATCTGCGCAA CCTGATGGCG
ATGGACACCG AGGAGATGAT TCGCGCCCAG CCGATGGACG CGGTGGTGGT GATCGGCGGC
TGCGACAAGA CGCTGCCGGC GCAGATCATG GCCGCGGTGT CGGCGGATCT GCCGACCGTG
GTGATTCCGG TCGGGCCGAT GGTGGTCGGC CATCACAAGG GCGAGGTGCT GGGCGCTTGC
ACCGACTGCC GCAGGTTGTG GGGCAAGTTT CGCGCCGGCG AGATGGATGA GGCCGAGATC
GAGGCGGTCA ACGGCCGGCT GGCGCCCTCG GTCGGCACCT GCATGGTGAT GGGCACCGCC
TCGACCATGG CCTGCATCAC CGAAGCGCTC GGGCTGTCGC TGCCGATGAG CGCGACGATT
CCGGCGCCGC ACGCCGAGCG ATTTCGTTCC GCCGAACAAA GCGGCAAGCT CGCCGCGGCA
ATGGCGGTGG CGAAGGGGCC GAAGCCCAGC GAGCTGTTGA CGCCGGCGGC ATTGCGCAAT
GCGCAAGTGG TGCTGCAGGC GATCGGCGGC TCCACCAACG GGCTCATTCA TCTCACCGCG
ATCGCCGGGC GAACGACGTA TCGCCTCGAT CTGGCGGCGT TCGATCGGCT GTCGCGCGAG
GTGCCGGTGC TGGTCGATCT GAAGCCGTCG GGCGATCACT ACATGGAGCA CTTCCATCAC
GCCGGCGGCG TGCCGAAACT GTTGGCGCAA CTCGGCGAGC TGATCGATCT CGACGCCAAA
ACGATTTACG GCAGCTTGCG CGATGCGGTG GCCGCGGCCG AGGACGTGCC GGGGCAGGAC
GTCATTCGCG CGCGCAACGA TCCGATCCGC AGCGAAGGCG CGATGGCGGT GCTATCCGGC
AATCTGGCGC CGCGCGGCGC GGTGATCAAG CACTCGGCGG CGTCGCCAAA GCTCCTTCAG
CACAGCGGCC GCGCCGTGGT GTTCGACAGT CTCGAGGACA TGGCGGCGCG GATCGACGAT
CCGGGTCTCG ACGTTGCGGC CGATGACGTG CTGGTGCTGC GCAATGCCGG GCCGCAGGGC
GCGCCGGGGA TGCCGGAGGC CGGCTATCTG CCGATTCCGC TGAAGCTGGC GCGCGCCGGC
GTCAAGGACA TGGTGCGGAT TTCCGACGCC CGGATGAGCG GCACCGCGTT CGGCACCATC
GTGCTGCACA TCACGCCGGA GAGCGCGGCC GGCGGGCCTT TGGCTTTGGT GCAAAACGGC
GACGTGATCC GGCTCGACGT CGAAGCGCGC CGCATCGATC TGATGGTCGA GGACGATGAG
TTAGCACGCC GCCGCCAAGC GCTGCCGGCG TCGCGGCAGC CCGCGCCGCT GCGCGGCTAT
GCGCGGTTGT TCCACCAGAC GATCCTGCAG GCCGATCAAG GCTGCGATTT CGATTTTCTG
ACCGGGCAGG GCGGGGATTA A
 
Protein sequence
MSDGLRKGLT SYGDAGFSLF LRKAFIKAMG YSDDALDRPI VGITNTHSDY NPCHGNVPQI 
IEAVKRGVML AGAMPMVFPT ISIAESFAHP TSMYLRNLMA MDTEEMIRAQ PMDAVVVIGG
CDKTLPAQIM AAVSADLPTV VIPVGPMVVG HHKGEVLGAC TDCRRLWGKF RAGEMDEAEI
EAVNGRLAPS VGTCMVMGTA STMACITEAL GLSLPMSATI PAPHAERFRS AEQSGKLAAA
MAVAKGPKPS ELLTPAALRN AQVVLQAIGG STNGLIHLTA IAGRTTYRLD LAAFDRLSRE
VPVLVDLKPS GDHYMEHFHH AGGVPKLLAQ LGELIDLDAK TIYGSLRDAV AAAEDVPGQD
VIRARNDPIR SEGAMAVLSG NLAPRGAVIK HSAASPKLLQ HSGRAVVFDS LEDMAARIDD
PGLDVAADDV LVLRNAGPQG APGMPEAGYL PIPLKLARAG VKDMVRISDA RMSGTAFGTI
VLHITPESAA GGPLALVQNG DVIRLDVEAR RIDLMVEDDE LARRRQALPA SRQPAPLRGY
ARLFHQTILQ ADQGCDFDFL TGQGGD