Gene RPD_0619 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0619 
Symbol 
ID4021088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp701286 
End bp702539 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content62% 
IMG OID637960807 
Productputative L-sorbosone dehydrogenase 
Protein accessionYP_567758 
Protein GI91975099 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTCGA TCTTCAGACG ATCCGTTGTT GCATGCGCTG CAGTCCTGGC GATCGGGGCT 
CTCGATCAAG CTGTCGCTCA GGGCCTCAAG AAATACGATT CCGACAAGAA GGACTTCTGG
ACCAACCCGC CGCCGGATTG GTTCCTCGGC GACGAGACCG AGGCGCAGAA GGGTCTCGCG
CCGCCGGCCG GCCCGCCGAC CGGATCGTCC GATGCCGAAC TCGCCGCGAT GATGAAGAAG
ATCAAGCTGC CGCCGGGCTT CAAGATCGAA GTCTACGCCT CGGGCGTGCT GGCGGCGCGG
CAAATGGCCT GGGGCGACAA CGGCACGCTG TTCGTCGGCT CGTTCGGCCT CGGCAACGTC
TATGCGATCA CCGAGAAGGA CGCCAAGAAA CAGGTCAAGA CCGTCCTCAA GGGCATGAAG
ATGCCGACCG GCATCGCATT CCAGAACGGC GCGCTCTACG TGATCGATAT CGACAAGCTG
ATCCGCTACG ACAACGCCGA AGCCAATCTC GACAAGCTCG GCGACGGCAA GGTCGTCTAT
GACGACATGC CGTCTTACGT CGCGCACGGA TGGAAGTATC TCGCGGCGGA CAAGGACGGC
TGGTTCTACG TGCCGTTCGG CCCGCCCTTC AACATCGGCC TGCCGCCGAC CTCGCTGTCG
CAGATCCGCC GCATCGATCC CAAGACCGGC AACGCCGAAT TGGTCGCGCT CGGCGTGCGC
AATTCGGTCG GCGGCGACGT CGATCCGCGC ACCGGCAAAT ACTGGTTCAC CGAAAACGCC
CGCGACTGGA TCAGCGACGA CATGCCGAGC GACAAGCTCA ACATGATCTC GAAGCTCGGC
GAGCATTTCG GCTATCCGTA TTGCCATCAG GGCGACATGC CGGACCCGAA ATTCGCGATG
GGGCACAAAT GCTCCGAGTT CACGCCGCCG GTGCTGAACC TCGGCGCGCA TGTCGCTCCG
CTCGGCATGA AGTTCTACAC CGGCGACCAG TTCCCCGCCG AGTACAAGAA CAACATCTTC
ATCGCCGAGC ACGGCTCCTG GAATCGTCAC AAGTATCAGG GCGCGCTGAT CAAGCGCGTG
ATCGTCGATC CGGACGGCAA GAACGCCAAG CAGGAAAACT TCGCCACCGG GTGGATCGAG
GGCGACCAGG GCTATCTCGG CAGACCCGCC GACATCGTGC TGGCCAAAGA CGGTTCGATG
CTGGTGGCGG ACGATTGGGC CGGCGCGATC TATCGCATCA GCTACAGCAA GTGA
 
Protein sequence
MKSIFRRSVV ACAAVLAIGA LDQAVAQGLK KYDSDKKDFW TNPPPDWFLG DETEAQKGLA 
PPAGPPTGSS DAELAAMMKK IKLPPGFKIE VYASGVLAAR QMAWGDNGTL FVGSFGLGNV
YAITEKDAKK QVKTVLKGMK MPTGIAFQNG ALYVIDIDKL IRYDNAEANL DKLGDGKVVY
DDMPSYVAHG WKYLAADKDG WFYVPFGPPF NIGLPPTSLS QIRRIDPKTG NAELVALGVR
NSVGGDVDPR TGKYWFTENA RDWISDDMPS DKLNMISKLG EHFGYPYCHQ GDMPDPKFAM
GHKCSEFTPP VLNLGAHVAP LGMKFYTGDQ FPAEYKNNIF IAEHGSWNRH KYQGALIKRV
IVDPDGKNAK QENFATGWIE GDQGYLGRPA DIVLAKDGSM LVADDWAGAI YRISYSK