Gene RPD_3950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3950 
Symbol 
ID4024466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4393639 
End bp4396758 
Gene Length3120 bp 
Protein Length1039 aa 
Translation table11 
GC content66% 
IMG OID637964152 
Productconjugal transfer relaxase TraA 
Protein accessionYP_571070 
Protein GI91978411 
COG category[L] Replication, recombination and repair 
COG ID[COG0507] ATP-dependent exoDNAse (exonuclease V), alpha subunit - helicase superfamily I member 
TIGRFAM ID[TIGR02768] Ti-type conjugative transfer relaxase TraA 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.630291 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGACGGC GTAGTCCCTC CCGAGATTTT TTCGAGGGAG GGCGCGCTTA TACGTCGTTC 
CGACGTACGC TCCGCCGTGG CAATGATCTG GTCGCGATGG CGATCTATCA TCTTCACGTC
AAGGTCATTG GCCGCAAGGC CGGCTCCAGC GCGGTGGCGT CGGCCGCCTA CCGCTCGGGC
TCGCGGTTGC GTGACGAGCG TCTGGACCGC AGCCATGATT TTTCCGCCAA GCGCGGCGTC
GTTCATTCCG AGGTCATGCT GCCGGAGAAT GCGCCGGAGT CGTGGAGCGA CCGTGAACGG
CTCTGGAACG ATGTCGAAGC CGTCGAGATC CGCAAGGATG CGCAGCTTGC CCGCGAGGTC
GAGTTCGCCA TTCCGCGCGA GATGACACAG GTGCAGGGCA TCGAACTGGC CCGCGACTTC
GCGCAGTCGG AGTTCGTGGA CCGGGGCATG ATCGCCGATC TCAATGTGCA TTGGGATATG
GCCGAGGATG GAAGCCCCAA ACCGCATGCG CATATCATGC TGACCATGCG CGCCGTGGAC
GAGAATGGTT TCGGCCCGAA GGTGAGGGAG TGGAACCGCA CCGAGATAGT CGAGCGCTGG
CGGGAGCGCT GGGCCGAGAT CGCCAATCAG CGGCTGGCCG AACTCGACAT CGACGCTCGG
ATCGACCATC GCAGCTTCGA GGCGCAGGGC ATCGGGCTGG AGCCGCAAAG TCAGATCGGC
GCGCCCGCGC AGCGGATCGA AAGCCAAAGG ATCGAGGGGC AAGGGATCGA GGCCGCCGCC
AATGTGGCCG ACCGCGCGGA GATGCACCGC GAGATCGCGC GCGGCAACGG TGAGCGGATT
ATTGCCGATC CTTCCATCGC ATTGGACGCG ATCACGCATC AGCAATCGAC CTTCACGCAG
CGCGACATGG CAAAGTTTGC GCATCGCCAC AGCGACGGGA TCGACCAGTT CAACGCCGTG
ATCGGCGCGA TGCGCAGCGC GCCCGATCTG GTCGAACTCG GACAGGATGG TCGCGGCGAG
GACCGCTTCA CCACCCGCGA CATGATCGAG GCCGAGCAGC GCCTGCACCA CGCCGCCAAA
CTCATGGCCG AGCGGGAACG CCATGAGGTG AACGACGCTG ACAGGCAAGC GGCTCTGACG
CGTGCGGAAG CGCGCGGCCT TGTGCTGTCC GGCGAGCAGG CCGACGCGCT GGCGCATGTC
ACGGACGGGC GCGATCTCGG GATCGTGGTC GGCTATGCCG GAGCGGGAAA GAGCGCGATG
TTGGGCGTGG CGCGGGAGGC ATGGGAAGCG GCGGGTTACG AGGTTCGGGG CGTGGCGTTG
TCCGGCATCG CCGCCGAGAA TCTGGAAAGC GGATCGGGCA TTGCGTCACG CACCATTGCC
AGCCTGGAAC ATGGCTGGGG CCAGGGCCGC GACCTGCTCA GCGCGCGCGA TGTGTTGGTG
ATCGACGAGG CCGGCATGGT CGGCACGCGG CAATTGGAGC GCGTGCTGTC CCATGCCGCC
GAGGCTGGCG CCAAGGTCGT GCTGGTCGGC GATCCGCAAC AGCTCCAAGC AATAGAAGCC
GGTGCGGCGT TCCGCTCCAT TCACGAACGC CACGGCGGCG TCGCGATCGG CGAGGTACGC
CGCCAACGCG AGGAATGGCA GCGCGACGCC ACGCGCGATC TGGCGACTGG CAAGACCGGC
GCAGCGATCC ATGCCTATGA CGAAAACGGC ATGGTGCATG CTGCGGCGTC CCGCGAGCGG
GCACGGGGCG AACTGATCGA CCGCTGGGAC CGCGACCGGC AAGCATCGCC GGACCGCAGC
CGCATCATCC TCACCCATAC CAACGACGAG GTTCGCGCGC TCAACGAAGG GGCGCGGGAA
CGGATGCGGG CCGCGGGTGA TCTCGGTGAG GATGTGCATG TCACGGTCGA ACGCGGCGCG
CGCAGCTTCG CCACTGGGGA TCGCGTCATG TTCCTGCAAA ACGAACGCAG TCTTGGCGTG
AAGAACGGCA CGCTCGGCGC CATTGAGCAG GTCAGTCAGC AATCCATATC CGTGCGCATT
GATGACGGGC GCAGCGTCTC GTTCGACCTG AAGGATTACA ACCGCATCGA CCACGGCTAT
GCTGCGACCA TCCACAAGGC GCAGGGCATG ACGGTGGACC GCACCCATGT GCTGGCAACG
CCGGGCATGG ATGCCCATAG CAGCTATGTC GCGCTGTCGC GACATCGCGA CGGCATGGAC
CTGCATTATG GGCGTGACGA CTTCGCCAGC CAGGATCGGC TGGTCCGCAC CCTGTCGCGC
GACCGCGCTA AGGACATGGC GTCGGATTAC GAGCGCGCCG ATCCGCTGCA GGGCTACGCA
GAGCGGCGCG GCATCACTTT CCGCGAGCGT GTCGCCGAAA TCGTCCAGAA ACTCGTGCCG
GAGAAGCTAC GCGGTATGTT TGACGGCCTG CGCTTTCCTG GAGATGTGTC CGGCCACGAT
GGTGGACGGA GGCCAGAGCG GGAAACGCCG GAAAGGAGAA GCGCCGTGGC GGCGGTCCGG
CGAACGACGG ATACGCCGGA AAGGACCGTG GCGGCCGACA GGGACGCGGC GCTGCGCCAA
ACTCGCACCA AGGCGCTCAT TCGCCACGCC CGCGCCGTGA ATGCGATCTT CGAGATGCAG
CAGCAGGGCG GCAAGGCTAG CCCGGAGCAG GTAAAGGAAT TGCAGGAGGC CCGCAAGGAA
TTCGAGGCGG TGCGGCCCTA TGGATCGCAC GACGCCGAAG CCGCCTACAA GACGAACCCG
GAACTCGCAG AGCAAGCCGC TTCGGGCAAT CCCGCCCGCG CGATCCGCGC TCTCCAGTTC
GAAACCGAGC TGCGGACCGA CCCCGCCCGG CGCGCCGACC GTTTCGTGGA ACGTTGGCAT
AAGCTTGATC AGACCAGCCA GCACCAGTAT CAGGCGGGCG ACATGTCGGG CTACAAATCC
ACGCGGGCGG CGATGGGCGA CCTGGCCAAG AGCTTAGAGC GCGATCCGCA GCTTGAATCC
ATTCTCGCGG ACCGCAAGCA ACAACTCGGC ATCGCGTTCG AAACAGGCCG CCGGCTCGGC
CAGGAATTGG CATTCACCCA CGGCATCGAT CTCGGCAGAG GCCGGGGTAT CGGGCTCTGA
 
Protein sequence
MRRRSPSRDF FEGGRAYTSF RRTLRRGNDL VAMAIYHLHV KVIGRKAGSS AVASAAYRSG 
SRLRDERLDR SHDFSAKRGV VHSEVMLPEN APESWSDRER LWNDVEAVEI RKDAQLAREV
EFAIPREMTQ VQGIELARDF AQSEFVDRGM IADLNVHWDM AEDGSPKPHA HIMLTMRAVD
ENGFGPKVRE WNRTEIVERW RERWAEIANQ RLAELDIDAR IDHRSFEAQG IGLEPQSQIG
APAQRIESQR IEGQGIEAAA NVADRAEMHR EIARGNGERI IADPSIALDA ITHQQSTFTQ
RDMAKFAHRH SDGIDQFNAV IGAMRSAPDL VELGQDGRGE DRFTTRDMIE AEQRLHHAAK
LMAERERHEV NDADRQAALT RAEARGLVLS GEQADALAHV TDGRDLGIVV GYAGAGKSAM
LGVAREAWEA AGYEVRGVAL SGIAAENLES GSGIASRTIA SLEHGWGQGR DLLSARDVLV
IDEAGMVGTR QLERVLSHAA EAGAKVVLVG DPQQLQAIEA GAAFRSIHER HGGVAIGEVR
RQREEWQRDA TRDLATGKTG AAIHAYDENG MVHAAASRER ARGELIDRWD RDRQASPDRS
RIILTHTNDE VRALNEGARE RMRAAGDLGE DVHVTVERGA RSFATGDRVM FLQNERSLGV
KNGTLGAIEQ VSQQSISVRI DDGRSVSFDL KDYNRIDHGY AATIHKAQGM TVDRTHVLAT
PGMDAHSSYV ALSRHRDGMD LHYGRDDFAS QDRLVRTLSR DRAKDMASDY ERADPLQGYA
ERRGITFRER VAEIVQKLVP EKLRGMFDGL RFPGDVSGHD GGRRPERETP ERRSAVAAVR
RTTDTPERTV AADRDAALRQ TRTKALIRHA RAVNAIFEMQ QQGGKASPEQ VKELQEARKE
FEAVRPYGSH DAEAAYKTNP ELAEQAASGN PARAIRALQF ETELRTDPAR RADRFVERWH
KLDQTSQHQY QAGDMSGYKS TRAAMGDLAK SLERDPQLES ILADRKQQLG IAFETGRRLG
QELAFTHGID LGRGRGIGL