Gene RPD_3110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3110 
Symbol 
ID4023615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3458902 
End bp3461082 
Gene Length2181 bp 
Protein Length726 aa 
Translation table11 
GC content65% 
IMG OID637963311 
ProductPyrrolo-quinoline quinone 
Protein accessionYP_570237 
Protein GI91977578 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4993] Glucose dehydrogenase 
TIGRFAM ID[TIGR01168] Gram-positive signal peptide, YSIRK family
[TIGR03075] PQQ-dependent dehydrogenase, methanol/ethanol family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.125237 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAGC CTGCCGAGCC GGTGTGGTTT GCGCACCGAT TCCTCCGTGA CAGCAAGCAA 
AAGCGATCCG CCAGCAGAGA CCTGAGCGCC GGCGTAGCCG CCGCCGTCGT CGCGGTTCTG
TTGCATGGAA GCGCTGTGGC GCAGGGCGCC AAGGGCTCCG CCGAACACAT CCGCGCGGTC
ACCGGCGCGA TCGACAGCGC GGCAATCGTC GCCAATGCCA AGACCACCAA CGATTGGCCG
AGCTACGGGC TCGATTACGC CGAGACCCGA TTCAGCAAGC TCGATCAGAT CAATGCCGAC
AACGTCAAGT CGCTCGGCCT GCAATGGAGC TACAGTCTCG GATCGGACCG CGGTGTCGAG
GCGACGCCGG TGGTGGTCGA CGGCATCATG TATGTGACCG CGTCGTGGAG CGTCGTCCAT
GCGGTCGACA CGCGAACCGG CAAGAAGCTC TGGACCTACG ACCCCGGCGT CGACCGGTCG
AAGGGTTATC GCGGCTGCTG CGACGTCGTG AATCGCGGCG TTGCGCTCTA CAAGGGCAAG
GTCTTCGTCG GCGCCTATGA TGGCCGCCTG GTTGCGCTCG ATGCTGCGAC CGGCGCGAAG
GTCTGGGAGA AGGATACGCT GATCGACCAC GAGCATTCCT ACACGATCAC CGGCGCGCCG
CGGGTGTTCA ACGGCAAGGT GGTGATCGGC AATGGCGGCG CCGAGTATGG CGTGCGCGGC
TATGTCACCG CTTACGACGC CGAGACCGGC AACCAGGCCT GGCGCTGGTT CACCGTTCCG
GGCGATCCAT CAAAGCCGTT CGAGGACGCC TCGATGGAAG CGGCGGCGAA GACCTGGGAT
CCGGCCGGCA AATGGTGGAT CAACGGCGGC GGCGGCACGG CGTGGGACAC CATCACCTTC
GATCCCGACC TCAACATGGT CTACATCGGC ACTGGCAACG GATCGCCCTG GAACAGGAGC
TTGCGCAGCC CGGCCGGCGG CGACAACCTC TATCTCGGCT CGATCGTCGC GCTCAACGCC
GACACCGGCA AATACGTCTG GCACTATCAG GAGACGCCCG GCGACAATTG GGACTACACA
TCCACCCAGC CGATGATCCT CGCCGACCTC ACGATCGACG GTCAGCCGCG CAAGGTGGTG
CTGCACGCGC CGAAGAACGG CTTCTTCTTC GTGATCGACC GCACCAACGG CAAGTTCATC
TCGGCGAAGA ACTTCGTCGA TGTGAACTGG GCCACCGGCT ACGACGCGGG CGGCAGGCCG
ATCGAGGCGC CGGAGGCGCG CTCGCCGGAC AAATCCTTCG ACAGCATTCC GGGGCCGTAC
GGCGCGCATA ACTGGCACCC GATGTCGTTC AATCCGCAGA CCGGCCTCGT CTATCTGCCG
GCGCAAGGCG TGCCCATCAA CCTGACCGGT GAAAAGGCGC TGACCCAGAA CAAGATGGAG
CCGTTCAAAT TCGGCAGCAC CACCGGCTGG AACGTCGGTT TTGCGCTGAA TGCGACGCCT
CCAAAGAACT TGCCGTTCGG CCGGCTGCTG GCCTGGGATC CTGTCCAGCA GAAGGAGGTC
TGGCGCGCGG AGTATGTCGC GCCCTGGAAC GGCGGCACCC TGACCACCGC AGGCAATCTC
GTATTCCAGG GCACGGCGGA CGGCCGCTTC GTCGCGTACA ACGCCAAGAC CGGCGAGAAA
TTGTGGCAAA GTCCGCTCGG CACCGGCGCG GTCGCTGCGC CGGCCACCTA CATGGTCGAC
GGCGTGCAAT ATGTCTCGAT CGCGGTCGGC TGGGGCGGCG TGTTCGGCAT CAGCCAACGC
GCGACCGAAA CCGAAGCGCC GGGCACGGTC TACACTTTCG CGGTGGGCGG CAAGGCGCCG
ATGCCGGAAT TCGCCAAGTA TCAGATGGGC AATCTGCTGA CCGGCATCGA ATACGATCCG
AAGGATGTCC TGGAGGGCAC CGCGATCTAT GTCGCGGCCT GCGCCACCTG CCACGGGGTG
CCCGGCGTCG ACCGCGGCGG CAACGTCAAG AACCTCGGCT ACAGCACCAC CGAAAAGATC
GGGCACCTGA AGGACATCGT GTTCAAGGGG CCATTCCGCG ACAAGGGGAT GCCGGACTTC
ACCGGCAAGC TCACCGAGGC GGATGTGGTG AAGATCCAGG CCTTTATCCA GGGCACCGCA
GACGCGATCC GGCCGAAATA G
 
Protein sequence
MKQPAEPVWF AHRFLRDSKQ KRSASRDLSA GVAAAVVAVL LHGSAVAQGA KGSAEHIRAV 
TGAIDSAAIV ANAKTTNDWP SYGLDYAETR FSKLDQINAD NVKSLGLQWS YSLGSDRGVE
ATPVVVDGIM YVTASWSVVH AVDTRTGKKL WTYDPGVDRS KGYRGCCDVV NRGVALYKGK
VFVGAYDGRL VALDAATGAK VWEKDTLIDH EHSYTITGAP RVFNGKVVIG NGGAEYGVRG
YVTAYDAETG NQAWRWFTVP GDPSKPFEDA SMEAAAKTWD PAGKWWINGG GGTAWDTITF
DPDLNMVYIG TGNGSPWNRS LRSPAGGDNL YLGSIVALNA DTGKYVWHYQ ETPGDNWDYT
STQPMILADL TIDGQPRKVV LHAPKNGFFF VIDRTNGKFI SAKNFVDVNW ATGYDAGGRP
IEAPEARSPD KSFDSIPGPY GAHNWHPMSF NPQTGLVYLP AQGVPINLTG EKALTQNKME
PFKFGSTTGW NVGFALNATP PKNLPFGRLL AWDPVQQKEV WRAEYVAPWN GGTLTTAGNL
VFQGTADGRF VAYNAKTGEK LWQSPLGTGA VAAPATYMVD GVQYVSIAVG WGGVFGISQR
ATETEAPGTV YTFAVGGKAP MPEFAKYQMG NLLTGIEYDP KDVLEGTAIY VAACATCHGV
PGVDRGGNVK NLGYSTTEKI GHLKDIVFKG PFRDKGMPDF TGKLTEADVV KIQAFIQGTA
DAIRPK