Gene RPC_3648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3648 
Symbol 
ID3972019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4057774 
End bp4059573 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content66% 
IMG OID637926757 
Productdihydroxy-acid dehydratase 
Protein accessionYP_533502 
Protein GI90425132 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.458648 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAAGC CGGTGACCGG GCGCAAGCTG CGCTCCAGCG AATGGTTCAA TGACCCTCAC 
AATCCGGCGA TGACCGCGCT CTATCTGGAG CGCTATCTGA ACTACGGACT GACACGCAAA
GAGCTGCAGG CCGGCAAGCC GATCATCGGC ATCGCGCAGA CCGGCAACGA TTTGTCGCCG
TGCAACAGGC ATCACCTGGA ACTGGCGCAG CGGGTGCGCG AGGGCATCCG CGAGGCCGGC
GGCATCGCGA TGGAATTCCC GATGCACCCG ATCCAGGAAA CCGGCAAGCG GCCGACCGCG
GCGCTCGACC GCAACCTGGC CTATCTCGGG CTGGTCGAAA TCCTGTTCAG CTACCCGCTC
GACGGCGTGG TGCTGACCAC CGGCTGCGAC AAGACCACCC CGGCCTGCCT GATGGCGGCG
GCGACCGTCA ACCTGCCGGC GATCGTACTG TCCGGCGGGC CGATGCTGAA CGGCTGGCAC
GAGGGCGAGC GCACCGGCTC CGGCACGGTG ATCTGGAAAT CCCGCGAGCG GATGGCCGCG
GGCGAGATCG ACTACGAGGA ATTCATGGAC ATCGTCGCCT CCTCGGCGCC CTCGGTCGGC
CATTGCAACA CCATGGGCAC GGCGTCGACG ATGAATGCGC TGGCGGAAGC GCTGGGGATG
TCGCTGCCCG GCTGTGCCGC GATCCCGGCG CCCTATCGCG AACGCGGCCA GATCGCTTAT
CAGACCGGTT TGCGCGCGGT GCAAATGGTC TGGGAAGATC TCAAGCCCTC CGACATTCTC
ACCAGGCAAG CCTTCGAGAA TGCCATCGTG GTGAACTCAG CGATTGGCGG CTCCACCAAC
GCGCCGATCC ATCTCAACGC GCTGGCCCGC CATATCGGCG TGGAGCTCTC GATCGACGAC
TGGCAGAGCG TCGGCCACAA GATCCCGCTG CTGGTCAACA TGCAGCCGGC GGGCTTCTAT
CTCGGCGAGG AATTCCATCG CGCCGGCGGC GTGCCGGCCG TGGTGCGCGA ACTCATGAAG
CACGGCAAGA TCCACAAGGA CGCGCTGACG GTGAATGGCC GCGGCATCGG CGTGAACTGC
GCCAATGCGC CGTTGCCCGA CGGCGAGGTG ATCAAGACTT ACGACGGCCC GCTGGTGCAG
GACGCCGGCT TCCTGGTGTT GCGCGGCAAC CTGTTCGATT CGGCGATCAT GAAGACCAGC
GTGATCTCGC TGGAATTCCG CGAGCGCTAT CTGGCGACGC CGGGCGATCT CAACGCCTTC
GAGGGCCGCG CCATCGTGTT CGAAGGCCCG GAGGACTATC ATGCCCGGAT CGACGACGAA
GCGCTCGAGG TCGACGAGCA CTGCATCCTG TTCGTACGCG GCACCGGGCC GATCGGCTAT
CCGGGCGGCG CCGAGGTGGT CAACATGCAG CCGCCGGCGG CCTTGATCAA ACGCGGCATC
CACTCGCTGC CCTGCATCGG CGACGGACGG CAATCCGGCA CGTCCGGCTC GCCGTCGATC
CTCAACGCTA CGCCGGAAGC CGCCGCCGAT GGCGGCCTCG CCATCCTGCG CACCGGCGAC
AAGGTGCGCA TCGACCTCAA CCTCGGCAGC GCCAATATCC TGATCTCGGA TGAGGAGCTG
GCGCAACGCC GCGCCGAGCT GAAAGCCCAT GGCGGATTCA AATATCCGGC GCACCAGACG
CCGTGGCAGG AATTGTATCG CGCAACGGTC GGCCAACAGG CCACCGGCGC CTGCCTTGAG
CTTGCGACGC GCTATCACGA CATCGCAGGC AAAGTCGGCG TCGCGAGACA TAATCATTAG
 
Protein sequence
MDKPVTGRKL RSSEWFNDPH NPAMTALYLE RYLNYGLTRK ELQAGKPIIG IAQTGNDLSP 
CNRHHLELAQ RVREGIREAG GIAMEFPMHP IQETGKRPTA ALDRNLAYLG LVEILFSYPL
DGVVLTTGCD KTTPACLMAA ATVNLPAIVL SGGPMLNGWH EGERTGSGTV IWKSRERMAA
GEIDYEEFMD IVASSAPSVG HCNTMGTAST MNALAEALGM SLPGCAAIPA PYRERGQIAY
QTGLRAVQMV WEDLKPSDIL TRQAFENAIV VNSAIGGSTN APIHLNALAR HIGVELSIDD
WQSVGHKIPL LVNMQPAGFY LGEEFHRAGG VPAVVRELMK HGKIHKDALT VNGRGIGVNC
ANAPLPDGEV IKTYDGPLVQ DAGFLVLRGN LFDSAIMKTS VISLEFRERY LATPGDLNAF
EGRAIVFEGP EDYHARIDDE ALEVDEHCIL FVRGTGPIGY PGGAEVVNMQ PPAALIKRGI
HSLPCIGDGR QSGTSGSPSI LNATPEAAAD GGLAILRTGD KVRIDLNLGS ANILISDEEL
AQRRAELKAH GGFKYPAHQT PWQELYRATV GQQATGACLE LATRYHDIAG KVGVARHNH