Gene Rru_A1786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A1786 
Symbol 
ID3835208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp2076676 
End bp2078553 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content66% 
IMG OID637825883 
Productdihydroxyacid dehydratase 
Protein accessionYP_426873 
Protein GI83593121 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATCAAGA CAGGCTCAAG GCAAATGACC CCATACCGTT CCCGCACATC AACCCACGGG 
CGCACGATGG CCGGTGCCCG TGGCCTTTGG CGCGCCACTG GCATGAAGGA CGAGGATTTC
GGCAAGCCGA TCATCGCCAT CGCCAATTCC TTCACCCAAT TCGTGCCCGG CCATGTCCAC
CTGAAGGACA TGGGCCAATT GGTGGCCGCC GAGATTGCCG CCGCCGGTGG CGTGGCCAAG
GAATTCAACA CCATCGCCGT CGATGACGGC ATCGCCATGG GCCATGACGG CATGCTCTAT
AGCCTGCCGT CGCGCGAACT GATCGCCGAT GCGGTCGAAT ACATGGTCAA CGCCCATTGC
GCCGACGCCC TGGTGTGCAT TTCCAATTGC GACAAGATCA CCCCGGGCAT GCTGATGGCG
GCGATGCGCC TGAACATCCC GACCATCTTC GTCTCCGGCG GGCCGATGGA AGCGGGCAAG
GTGGTGTTGG GCGGCACGGA ACGCAGCGTC GATCTGATCG ACGCCATGGT GGTCGCCGGC
GATGCCAAGG TTTCGGATGC CGATGTCGAG ACCATCGAGC GCTCGGCCTG TCCGACCTGC
GGCTCGTGTT CGGGAATGTT CACCGCCAAT TCGATGAACT GCCTGACCGA GGCCCTGGGG
CTGTCGCTGC CGGGCAATGG CAGCGCCCTG GCCACCCATG TGGCGCGGCG CGGGCTGTTC
GAGGAGGCCG GACGGCGCAT CGTCGATCTG GCCAAGCGAC GCTATGAGCA CGACGACGAA
AGCACCCTGC CGCGCGCCAT CGCCTCGTTC AAGGCCTTTG AAAACGCCAT GAGCGTCGAT
ATCGCCATGG GCGGGTCGAC CAATACGGTG CTCCACCTGC TCGCCGCCGC CCAGGAGGGC
GAGGTGCCCT TCACCATGGC CGATATCGAC CGGCTGTCGC GGCGCATTCC CCATCTGTGC
AAGGTTTCGC CAAGCACGGC CGACTTCTAT ATGGAGGATG TTCACCGCGC CGGCGGCGTC
ATGGGCATCA TGGGCGAGTT GTCGCGGGCC GGCCTGCTCC ATGAGGACCT GCCGACGGTG
CATACGCCGA CGCTGAAGGC GGCGCTTGAT CACTGGGATA TCCGGCGGCC GGTCGACGAC
GCGGTGCGCG CCTTCTTTCG GGCGGCGCCC GGCGGCGTGC GCACCGTCGT GCCGTTTTCC
ACCGACCGCC TGTGGGACAG CCTGGACGAT GACCGCGAGA CCGGCTGCAT CCGCGATCTC
GACCACGCCT ATTCCCGCGA TGGCGGGCTG GCCGTGCTCT ATGGCAATCT GGCGCCCAAC
GGCTGCATCG TGAAAACCGC CGGAGTCGAC GCCTCGATCC TCACCTTCAC AGGCACCGTC
CGCCTCTGCG AAAGCCAGGA CGAGGCGGTG GCGCGTATTC TCGGCGGCGA AATTCAGGCC
GGCGACGTCG TGCTGGTGCG CTATGAGGGT CCCAAGGGCG GCCCGGGCAT GCAGGAAATG
CTTTATCCCA CCAGCTATCT GAAGTCGCGC GGCCTGGGCA AGGTCTGCGC CCTGGTCACC
GATGGCCGTT TCTCGGGCGG CAGCTCGGGC CTGTCGATCG GCCATGTCTC GCCGGAAGCC
GCCGCTGGCG GGCCGATCGG TCTGGTCGAG GAGGGGGATA TCATCGTCAT CGATATCCCG
GCGCGCAGCA TTGTCGTCGA CCTGTCCGAC GAGGAGCTGG CGGCGCGGCG AAGCGCCATG
GAGGCGCGGG GACGCGCGGG ATGGAAGCCG GCCAAACCGC GCAAGCGGGC GGTCTCTCCG
GCCCTGCGCG CCTATGCGGC CCTGACCACC AGCGCCGATC GCGGCGCCGT CCGCGACGTG
TCGCAGGTCG AGCGCTAA
 
Protein sequence
MIKTGSRQMT PYRSRTSTHG RTMAGARGLW RATGMKDEDF GKPIIAIANS FTQFVPGHVH 
LKDMGQLVAA EIAAAGGVAK EFNTIAVDDG IAMGHDGMLY SLPSRELIAD AVEYMVNAHC
ADALVCISNC DKITPGMLMA AMRLNIPTIF VSGGPMEAGK VVLGGTERSV DLIDAMVVAG
DAKVSDADVE TIERSACPTC GSCSGMFTAN SMNCLTEALG LSLPGNGSAL ATHVARRGLF
EEAGRRIVDL AKRRYEHDDE STLPRAIASF KAFENAMSVD IAMGGSTNTV LHLLAAAQEG
EVPFTMADID RLSRRIPHLC KVSPSTADFY MEDVHRAGGV MGIMGELSRA GLLHEDLPTV
HTPTLKAALD HWDIRRPVDD AVRAFFRAAP GGVRTVVPFS TDRLWDSLDD DRETGCIRDL
DHAYSRDGGL AVLYGNLAPN GCIVKTAGVD ASILTFTGTV RLCESQDEAV ARILGGEIQA
GDVVLVRYEG PKGGPGMQEM LYPTSYLKSR GLGKVCALVT DGRFSGGSSG LSIGHVSPEA
AAGGPIGLVE EGDIIVIDIP ARSIVVDLSD EELAARRSAM EARGRAGWKP AKPRKRAVSP
ALRAYAALTT SADRGAVRDV SQVER