Gene RPD_3031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3031 
Symbol 
ID4023534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3376534 
End bp3377802 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content70% 
IMG OID637963230 
ProductDNA processing protein DprA, putative 
Protein accessionYP_570158 
Protein GI91977499 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.461494 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.318128 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGGCC GCCGCAGATC TTGGCGCGTG GTCTCATCAG GACCGAAGTA CCACCGCCGA 
CCCTCCCGCC CCGCCTTCAT CGAACTTGCC CACACCTATA AGTTGCGCAA CACTCGCCTT
TCGGAGGTGG GCGTGAGTGA CAGCGGCGGA AACCAAGGCA CTACGCGCCT CACCGAGGCG
CAGCGGATCG ACTGGCTGCG GCTGATCCGC GCCGACAATG TCGGGCCGCG CACCTTTCGC
TCGCTGGTCA ATCATTTCGG TTCGGCGCGC GCCGCGCTGG AGCGGCTACC CGAACTCGCC
CGCCGCGGCG GCGCAGCGCG GGCCGGTCGC ATTCCCAGCG AAGACGAGGC GCGCCGGGAG
ATCGATGGCG GCCACCGGCT CGGCGTCGAA CTGGTCGCGC CGGGCGAACC CGGCTACCCG
CCTCGCCTCG CGCTGATCGA CGACGCGCCG CCGCTGCTCG GAGTTCATTG CGTGCCCGAT
GCGCTCGCCG AGATGCAGCG GCCAATGATC GCGATCGTCG GCTCGCGCAA CGCCTCCGGC
GCCGGATTGA AATTCGCCTC CGAACTCGCG CGCGATCTCG GCGCCGCCGG CTTCGTGGTG
ATCTCGGGAC TGGCCCGCGG CGTTGATCAG GCTGCGCATC GCGCGAGCCT CGCCAATGGC
ACGGTCGCCG TGCTGGCCGG CGGTCACGAC AAGATCTATC CGCCCGAACA CGAAGACCTG
CTGCTCGACA TCGTCGAGGC GCGCGGCGCA GCGATTTCAG AGATGCCGCT CGGCCACGTC
CCGCGCGGCA AGGATTTTCC CCGCCGCAAC CGGTTGATTT CCGGCGCGGC TGTCGGGGTC
GCGGTGATCG AGGCGGCCTA TCGCTCCGGC TCGCTGATCA CCGCCCGCCG CGCCGCCGAC
CAAGGCCGCG AGGTATTTGC CGTGCCGGGC TCGCCACTCG ATCCGCGCGC CGCCGGAACC
AACGATCTGA TCAAGCAGGG GGCGACGCTG ATCACCTCGG CCGACGACAT TATCCAGGCC
GTCGCCCCGA TCATGGACCG GCCGGTGGAA TTGCCGGGCC GCGAGCCGGA ACACCCGGCT
CCGGCGAGCG AGCCGGATGC CAGCCACCGC GGCCGTATCG TCAACCTGCT CGGGCCGAGC
CCGATCGGCA TCGACGATCT GATCCGGCTG TCCGGCATCC CGCCGGCTGT CGTGCGTACC
GTGCTGCTCG AACTGGAACT CGCCGGCCGC CTCGACCGCC ACGGCGGCGG ATTGGTGTCG
CTGCTCTAG
 
Protein sequence
MIGRRRSWRV VSSGPKYHRR PSRPAFIELA HTYKLRNTRL SEVGVSDSGG NQGTTRLTEA 
QRIDWLRLIR ADNVGPRTFR SLVNHFGSAR AALERLPELA RRGGAARAGR IPSEDEARRE
IDGGHRLGVE LVAPGEPGYP PRLALIDDAP PLLGVHCVPD ALAEMQRPMI AIVGSRNASG
AGLKFASELA RDLGAAGFVV ISGLARGVDQ AAHRASLANG TVAVLAGGHD KIYPPEHEDL
LLDIVEARGA AISEMPLGHV PRGKDFPRRN RLISGAAVGV AVIEAAYRSG SLITARRAAD
QGREVFAVPG SPLDPRAAGT NDLIKQGATL ITSADDIIQA VAPIMDRPVE LPGREPEHPA
PASEPDASHR GRIVNLLGPS PIGIDDLIRL SGIPPAVVRT VLLELELAGR LDRHGGGLVS
LL