Gene RPD_1841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1841 
Symbol 
ID4022323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2060790 
End bp2061917 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content62% 
IMG OID637962035 
Producttartrate dehydrogenase 
Protein accessionYP_568978 
Protein GI91976319 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR02089] tartrate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.473858 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGACGC TGAATCACTG CGCAGCTCAC GAATTCGAGG GACAAGCCTT GAATACCAAG 
AAGAACGAAT ATCGGATCGC GGTTATTCCG GGAGACGGCA TCGGCAAGGA AGTGATGCCG
GAGGGACTGC GCGCGCTCGA AGCCGCGGCG AAGAAGCACC GCGTCAAGCT CGCGTTCGAT
CATTTCGACT TCGCGAGCTA CGACTACTAC GAAAAGCACG GCCAGATGAT GCCGGACGAC
TGGAAGGAAG CGATCGGCGG GCACGACGCG ATCTTCTTCG GTGCTGTCGG CTGGCCGGAG
AAGATCCCCG ATCACATTTC GCTGTGGGGC TCGCTCATCA AGTTTCGCCG CGAGTTCGAT
CAGTACGTCA ATCTGCGCCC GGTGCGGCTG ATGCCGGGCG TGCCGTCGCC GCTCGCCGGC
CGCAAGCCGG GCGATATCGA TTTCTGGGTG GTGCGCGAGA ACACCGAGGG TGAGTATTCA
TCGGTCGGCG GCCGTATGTT TCCTGACACC GACCGCGAAT TCGTGACGCA GCAGACGGTG
ATGACCCGGA CCGGCGTCGA CCGGATTCTG AAATTCGCTT TCGAACTTGC GCTGTCGCGG
CCGAAGCGGC ATTTGACCTC GGCCACCAAA TCGAACGGCA TCTCGATCAC CATGCCGTAT
TGGGACGAGC GGGTCGAGGC GATGGCGAAA CGCTATCCCG ATGTGAAGTG GGACAAGTAT
CACATCGATA TTCTCACCGC GCATTTCGTT CTGCATCCGG ATTGGTTCGA CGTCGTGGTC
GGCTCCAATC TGTTCGGTGA CATTCTGTCG GACCTCGGTC CGGCCTGCAC CGGGACGATC
GGGATTGCGC CGTCCGGCAA CATCAATCCG GAAGGCAATT TCCCGAGCGT GTTCGAGCCG
GTGCACGGCT CGGCGCCGGA TATTGCGGGG CAGGGCATCG CCAATCCGAT CGGGATGATC
TGGTCGGGCG CGATGATGCT GGAGCATCTC GGCGAGAAGA CTGCCGCCGA CGCGATCGTC
AAGGCGATCG AACGCACGCT CGCCGAGCGG ACGCTCCGCA CCCGTGATCT CGGCGGTCAG
GCCGACACGA CCGCCTGCGG CAAGGCGGTG GCGGAGATGC TGGAGTAG
 
Protein sequence
MLTLNHCAAH EFEGQALNTK KNEYRIAVIP GDGIGKEVMP EGLRALEAAA KKHRVKLAFD 
HFDFASYDYY EKHGQMMPDD WKEAIGGHDA IFFGAVGWPE KIPDHISLWG SLIKFRREFD
QYVNLRPVRL MPGVPSPLAG RKPGDIDFWV VRENTEGEYS SVGGRMFPDT DREFVTQQTV
MTRTGVDRIL KFAFELALSR PKRHLTSATK SNGISITMPY WDERVEAMAK RYPDVKWDKY
HIDILTAHFV LHPDWFDVVV GSNLFGDILS DLGPACTGTI GIAPSGNINP EGNFPSVFEP
VHGSAPDIAG QGIANPIGMI WSGAMMLEHL GEKTAADAIV KAIERTLAER TLRTRDLGGQ
ADTTACGKAV AEMLE