Gene RPB_3247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3247 
Symbol 
ID3911048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3709231 
End bp3710940 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content68% 
IMG OID637885149 
Productdihydroxy-acid dehydratase 
Protein accessionYP_486854 
Protein GI86750358 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.652611 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACG GATTGCGCAA GGGGCTGACC AGCTACGGCG ACGCCGGTTT TTCGCTGTTC 
CTGCGCAAGG CGTTCATCAA GGCGATGGGC TATTCCGACG ACGCGCTGGA GCGGCCGATC
GTCGGCATCA CCAATACCCA CAGCGATTAC AATCCGTGCC ACGGCAACGT GCCGCAGATC
ATCGAGGCGG TGAAGCGCGG CGTGATGCTG GCGGGCGCGA TGCCGATGGT GTTTCCGACC
ATCTCGATCG CCGAGAGCTT CGCGCATCCG ACCTCGATGT ATCTGCGCAA TCTGATGGCG
ATGGACACCG AGGAGATGAT CCGCGCCCAG CCGATGGATG CGGTGGTAGT GATCGGCGGC
TGCGACAAGA CGCTGCCGGC GCAGATCATG GCGGCGGTGT CGGCGGATCT GCCGACGGTG
GTGATCCCGG TCGGGCCGAT GGTGGTCGGC CATCACAAGG GCGAAGTGCT GGGTGCCTGC
ACCGACTGCC GGCGGTTGTG GGCGAAGCAT CGCGCCGGTG AGATCGACGA GGCGGAGATC
GAGGCCGTCA ACGGCCGGCT GGCGCCGTCG GTCGGCACCT GCATGGTGAT GGGCACCGCC
TCGACGATGG CGTGTCTCAC CGAGGCGATG GGCCTGTCGC TGCCGATGAG CGCGACGATC
CCGGCGCCGC ATGCCGAGCG GTTTCGCTCG GCGGAAGAAA GCGGCAGGGT CGCGGCTGCG
ATGGCCAAGG CGAAAGGCCC GAAGCCGAGC GATCTGCTGA CCCCCGCCGC GTTCCGCAAC
GCGCAAGTCG TGCTGCAGGC GATCGGCGGC TCGACCAACG GACTGATTCA TCTCACCGCG
ATCGCCGGCC GCGTGCCGCA TAAGATCGAC CTCGACGGTT TCGACCGGAT CGGCCGCGAC
GTGCCGGTGC TGGTCGATCT GAAGCCGTCG GGCGATCACT ACATGGAGCA TTTTCATCAC
GCCGGCGGCG TGCCGAAGCT GATGGCGCAG CTCGGCGAAC TGATCGATCT CGACGCGCGG
ACGATCACCG GCGCGCCGCT GCGCGACATC GTCGCCAGGG CCGAACACGT GCAGGGCCAG
GACGTGATCC GCTCGCGCGA CAATCCGATC CGGCGCGAGG GCGGGCTCGC GATGCTCACC
GGCAATCTGG CGCCGCGCGG CGCGGTGATC AAACACGCCG CCGCGTCGCC GCAACTGATG
CAGCACACCG GCCGCGCCGT GGTGTTCGAC TCGGTCGAGG ACATGACGCT GCGGATCGAC
GATCCCGATC TCGACGTTGC GGCCGACGAC GTGCTGGTGC TGCGCAATGC CGGGCCGCGC
GGCGCGCCGG GGATGCCGGA GGCGGGCTAT CTGCCGATCC CGATGAAGCT GGCGCGGGCG
GGCATCAAAG ACATGGTGCG CATTTCGGAC GCGCGGATGA GCGGCACCGC GTTCGGTACC
ATCGTGCTGC ACATCACGCC AGAGAGCGCG GATGGTGGGC CGCTGGCGCT GGTCGAAACC
GGCGACCGGA TCGCGCTGGA TGTCGCGGCG CGGCGGATCG ATCTGTTGGT TGACGAAAGC
GAACTCGCGC GCCGCCGTGC CGCATTGTCG TCGTCAGCCG CGGCGCGGCC GACGCGCGGC
TATGCGCAAC TGTTTCACGA CACCATCCTG CAGGCCGACG AGGGCTGCGA TTTCGATTTT
CTCACCGCAG CCGGGCGCAG CGAGCGTTGA
 
Protein sequence
MADGLRKGLT SYGDAGFSLF LRKAFIKAMG YSDDALERPI VGITNTHSDY NPCHGNVPQI 
IEAVKRGVML AGAMPMVFPT ISIAESFAHP TSMYLRNLMA MDTEEMIRAQ PMDAVVVIGG
CDKTLPAQIM AAVSADLPTV VIPVGPMVVG HHKGEVLGAC TDCRRLWAKH RAGEIDEAEI
EAVNGRLAPS VGTCMVMGTA STMACLTEAM GLSLPMSATI PAPHAERFRS AEESGRVAAA
MAKAKGPKPS DLLTPAAFRN AQVVLQAIGG STNGLIHLTA IAGRVPHKID LDGFDRIGRD
VPVLVDLKPS GDHYMEHFHH AGGVPKLMAQ LGELIDLDAR TITGAPLRDI VARAEHVQGQ
DVIRSRDNPI RREGGLAMLT GNLAPRGAVI KHAAASPQLM QHTGRAVVFD SVEDMTLRID
DPDLDVAADD VLVLRNAGPR GAPGMPEAGY LPIPMKLARA GIKDMVRISD ARMSGTAFGT
IVLHITPESA DGGPLALVET GDRIALDVAA RRIDLLVDES ELARRRAALS SSAAARPTRG
YAQLFHDTIL QADEGCDFDF LTAAGRSER