Gene RPD_3041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3041 
Symbol 
ID4023544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3388051 
End bp3389340 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content67% 
IMG OID637963240 
ProductDNA polymerase IV 
Protein accessionYP_570168 
Protein GI91977509 
COG category[L] Replication, recombination and repair 
COG ID[COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.678009 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.273789 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGAAG CGGGGCCGCC CGGGCCGCTG TGCTTCTGTC GCGACTGCCT CACCGATCTC 
GGGGCGGACG CGCGGCGCTG CAGCGCGTGC GGCTCTCCGC GACTGCTCCG CCATCCCGCG
CTGTCGACGC TGACGCTCGC GCATATCGAC TGCGATGCGT TCTACGCGAC CGTCGAGAAG
CGCGATAATC CCGACCTCGC CGACCGGCCG GTGATCATCG GCGGCGGCAG GCGCGGCGTC
GTCTCGGCCG CCTGCTACAT CGCGCGCACC TTCGGCGTTC GCTCGGCGAT GCCGATGTTC
AAGGCGCTGG CGCTGTGCCC CTCGGCTGCT GTCGTGCGCC CCGACATGGC GAAATACGTC
CGCGTCGGCC GCGAGGTTCG CCAGGCGATG CTGCAACTCA CGCCGCTGGT CGAGCCGCTG
TCGATCGATG AGGCGTTTCT CGATCTGTCC GGCACCGAGC GGATGCACGG CGCGATCGCC
GCCAAGGTAT TGGCGCGGTT CGCCCGCGAC ACCGAACGCG ACATCGGCAT CACCGTGTCG
GTGGGCCTGT CGTGCAACAA ATTCCTCGCC AAGATCGCCT CCGACCTCGA CAAGCCGCGC
GGTTTCGCCA CGCTCGATCA GGACGATGCG AAGGCGATGC TGGGGCCCCG CCCCGTAAGC
TTCATCTTCG GCGTCGGCCC CGCGACGGCG GCTCGGGTCG CTCAGTACGG CTTCCGCACC
ATCGCCGATC TGCAGAAGGC CGACGAGATC GACCTGATGC GGCAGTTCGG CGACGAAGGA
CGGCGACTGT GGCGGCTCGC CCGCGGCATC GACAATCGCA AGGTCGTGCC GGATCGCGGC
GCCAAGTCGA TCTCCAATGA AACCACCTTC GAAACCGACA TCCGCGATCT GGAGACGCTG
GAACGGATCC TGTGGCGACT GTCGGACAAG GTTTCGTCGC GGCTGAAAAG CGCCGGCCTC
GCCGGTTCGA CCATCACGTT GAAACTGAAG TCGAGCGACT TTCGCCAGCG CACCCGCTCG
CAGACGATTC ACGCGCCGAC TCAGCTCGCC AATCGCATTT TCGCGGTGTC GCGCGAGATG
CTGGTCAAGG AAATCGACGG CACCGCCTTC CGCCTGATCG GCACCGGCGT CAGCGCGCTG
ACCGAACAGG CACAGGCCGA CGAGACCGAC ATGCTGGATG CCCGCGCCGC GACAGCCGAG
CGCGCGATCG ACGATCTGCG CAAGAAGTTC GGTGACGCCG CGGTGATCCG CGGCCTCGCC
TATAACGGAC CGGACAAACC GCGGAGTTAG
 
Protein sequence
MSEAGPPGPL CFCRDCLTDL GADARRCSAC GSPRLLRHPA LSTLTLAHID CDAFYATVEK 
RDNPDLADRP VIIGGGRRGV VSAACYIART FGVRSAMPMF KALALCPSAA VVRPDMAKYV
RVGREVRQAM LQLTPLVEPL SIDEAFLDLS GTERMHGAIA AKVLARFARD TERDIGITVS
VGLSCNKFLA KIASDLDKPR GFATLDQDDA KAMLGPRPVS FIFGVGPATA ARVAQYGFRT
IADLQKADEI DLMRQFGDEG RRLWRLARGI DNRKVVPDRG AKSISNETTF ETDIRDLETL
ERILWRLSDK VSSRLKSAGL AGSTITLKLK SSDFRQRTRS QTIHAPTQLA NRIFAVSREM
LVKEIDGTAF RLIGTGVSAL TEQAQADETD MLDARAATAE RAIDDLRKKF GDAAVIRGLA
YNGPDKPRS