Gene RPD_3324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3324 
Symbol 
ID4023834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3683658 
End bp3685466 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content66% 
IMG OID637963528 
Productdihydroxy-acid dehydratase 
Protein accessionYP_570449 
Protein GI91977790 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGA TCACCCCGGG GACTGCCCGG CGCAAGCTCC GCTCCAGCGA ATGGTTCAAC 
GACCCGCACA ACCCCGCGAT GACCGCGCTG TATCTCGAGC GCTATCTGAA CTACGGGCTG
ACCCGCGGCG AGCTGCAATC CGGCAAGCCG ATCATCGGCA TCGCGCAAAC CGGAAACGAT
TTGTCGCCGT GCAACCGCCA TCATCTGGAA TTGGCGCAGC GTGTCCGCGA AGGCATCCGC
GCCGCCGGCG GCATCGCGAT GGAATTCCCG GTGCATCCGA TCCAGGAAAC CGGCAAGCGG
CCGACTGCGG CGCTCGATCG CAATCTCGCT TATCTCGGCC TGGTCGAGAT CTTGTTCAGC
TATCCGCTCG ACGGCGTGGT GCTAACCACA GGTTGCGACA AGACCACGCC AGCCTGCCTG
ATGGCGGCGG CGACCGTCAA CATCCCGGCG ATCGTGCTGT CCGGCGGGCC GATGCTGAAC
GGCTGGCACA ATGGCGAACG CTCCGGATCG GGCACAGTGG TCTGGAAATC CCGCGAGCGC
CTCGCCGCCG GCGAGATCGA CTACGAAGAG TTCATGGAGA TCGTCGCATC GTCGGCGCCG
TCGGTCGGCC ATTGCAACAC CATGGGCACC GCCTCGACGA TGAACTCGCT GGCGGAAGCG
CTCGGCATGT CGCTGCCGGG CTGCGCCGCG ATTCCTGCGC CCTATCGCGA ACGCGGCCAG
ATCGCCTACG CCACCGGCCT GCGGGCGGTG GAGATGGTGT GGGAGGATCT GAAGCCGTCC
GACATCCTGA CCCGCAAAGC TTTCGAGAAC GCCATCGTCG TCAATTCGGC GATCGGCGGC
TCGACCAACG CGCCGATCCA TCTCAATGCA CTCGCCCGCC ACATCGGCGT CGAGCTTTCG
ATCGACGACT GGCAGAGCGT CGGCCACGCC ATTCCGCTGC TGGTCAACAT GCAGCCGGCC
GGCTTCTATC TCGGCGAGGA GTATCACCGC GCCGGCGGCG TGCCGGCGGT GGTCCGCGAA
CTGATGAGGC ACGGCAAAAT TCATACGGAC GCGATCACCG TCAACGGCCG TACCATGGGC
GACAATTGCG CGTCGGCCCC CGCCCCCGAT GGCGAGGTGA TCAAGTCCTA CGACGGGCCG
CTGGTGCAGG ACGCCGGATT CCTGGTGCTG CGCGGCAATC TGTTCGACTC GGCGATCATG
AAGACCAGCG TGATCTCGCT GGAATTCCGC GAGCGCTATC TCGCCAATCC GAACGATCCG
AACGCGTTCG AGGGCCGCGC CATCGTGTTC GAAGGGCCGG AAGACTATCA CGACAGGATC
GACGATCCGG CGCTCGACAT CGATGAGCAT TGCATCCTGT TCGTGCGCGG CACCGGGCCG
ATCGGCTATC CCGGTGGCGC CGAGGTGGTG AACATGCAGC CGCCGGCGGC GCTGATCAAA
CGCGGCATCC ACTCGCTGCC CTGCATCGGC GACGGCCGCC AGTCCGGCAC CTCGGGCTCG
CCCTCGATCC TGAACGCGAC ACCAGAAGCC GCCGCCAATG GCGGGCTCGC GATCCTCAAG
ACCGGCGACC GCGTCCGCGT CGACCTGAAC AAAGGGAGCG CCAACATTCT GATATCGGAC
GATGAACTGC GGCAGCGCCG CGCCGACCTC GAGGCGCATG GCGGCTTCGC CTATCCGAAG
CATCAGACGC CGTGGCAGGA GCTGTATCGC GCCACGGTCG GCCAGCAGGC CACCGGCGCC
TGCCTCGAAC TCGCCACCCG CTATCGCGAC ATCGCCGGCA CCGTCGGCGT CGCGCGGCAC
AATCATTGA
 
Protein sequence
MNKITPGTAR RKLRSSEWFN DPHNPAMTAL YLERYLNYGL TRGELQSGKP IIGIAQTGND 
LSPCNRHHLE LAQRVREGIR AAGGIAMEFP VHPIQETGKR PTAALDRNLA YLGLVEILFS
YPLDGVVLTT GCDKTTPACL MAAATVNIPA IVLSGGPMLN GWHNGERSGS GTVVWKSRER
LAAGEIDYEE FMEIVASSAP SVGHCNTMGT ASTMNSLAEA LGMSLPGCAA IPAPYRERGQ
IAYATGLRAV EMVWEDLKPS DILTRKAFEN AIVVNSAIGG STNAPIHLNA LARHIGVELS
IDDWQSVGHA IPLLVNMQPA GFYLGEEYHR AGGVPAVVRE LMRHGKIHTD AITVNGRTMG
DNCASAPAPD GEVIKSYDGP LVQDAGFLVL RGNLFDSAIM KTSVISLEFR ERYLANPNDP
NAFEGRAIVF EGPEDYHDRI DDPALDIDEH CILFVRGTGP IGYPGGAEVV NMQPPAALIK
RGIHSLPCIG DGRQSGTSGS PSILNATPEA AANGGLAILK TGDRVRVDLN KGSANILISD
DELRQRRADL EAHGGFAYPK HQTPWQELYR ATVGQQATGA CLELATRYRD IAGTVGVARH
NH