Gene RPD_3934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3934 
Symbol 
ID4024450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4373401 
End bp4374600 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content64% 
IMG OID637964138 
Producttransposase, IS204/IS1001/IS1096/IS1165 
Protein accessionYP_571056 
Protein GI91978397 
COG category[L] Replication, recombination and repair 
COG ID[COG3464] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCGAG CCCTACGTCC ATCGGCGCTG ATCCCTCGTG GGTTTGATGT CGAGAGCGCC 
ATCTGCGACG GCACCACGAC CGTGATCACC GTTCGCTCCA CCAGCGACAC GAGTCGCTGC
CCGGGGTGTG GAGAAAGCTC GAGACGAATC CACAGCCGAT ATCGCCGATG CATTGCGGAT
TTGCCGCTGG CAGGGCGGAG GGTTCGGCTT GTGGTCGCGG CGCGGCGATT TCGCTGCGAT
GCAGTTCTGT GCGGCCGACG TGTCTTCACG GAACGCTTCG TTGACGGCGT CCTGGCGCCC
TGGGCGCGAC GAACAGCTCG ACTCGACTAT GTCGTCCATC AGCTTGGCCT GGCATTGGGC
GGGCGCCCGG CGGCAACTAT CGCCCGCCGA CTTATGCTGC CCGTGAGCAA TGATACCTTG
CTCCGTGTCG TTCGGAGGCG CGGCTGCCCA CTGTTTCCTG CACCAAGCGT TGTCGGTATT
GACGATTGGG CCTGGCGCCG CAATCAGCGA TACGGAACGA TCATTTGCGA CCTTGAACGC
CGGCGGCCGA TCACCCTCCT TCCGGACAGG GAGGCCGCCA CCGCCCAAGC CTGGCTCGCA
GGGCAGCCGC AGATCGCTGT GGTCGCACGC GACCGCGGCG GCAGCTACGC TCTTGCCGCG
GCCAAGGCGC TACCACACGC CACCCAGGTC GCCGATCGCT GGCATCTCAT GGAGAATGCC
AGCCACGCGT TTCTCGATGC GGTTCGCAAA TCCATGCGAC AGATTCGCGC CGCGGTCGGC
GCCGCCACGA TCAATCCGGG CCTGCTCACC GCCGCCGAGC GCCTCCAATA CGAAGGCTTT
CTCCGGCGGG AGGACGCCAA TGCGGCAATC CTCAAGCTGG TCGCGACGGG CACCTCCATC
AAAGAGATCG TACGGCTCAG CGGACATAGC CGGGGCCTGG TCCGTCGCAT TCTTCGGGGG
CAACGAACCG ATGTGTTCCG AATTCGCGAA AGCTCCCTCG AACCTCATCT GCAATGGCTT
GATGCGCAAT GGCTTGCGGG TCATCGCAAT GGTGCCGAAC TATGGCGCCG CCTCAAGAGC
CTGGGATTCA GAGGCTCACT GCGGGTCGTT GCAGAATGGG CGACACATCG CCGGCGTGTA
GAAACGGTCG ATGACCAGGC GCTACATCGG GTGCCGTCGG CCAGAACCAT CGCCGGCTGA
 
Protein sequence
MQRALRPSAL IPRGFDVESA ICDGTTTVIT VRSTSDTSRC PGCGESSRRI HSRYRRCIAD 
LPLAGRRVRL VVAARRFRCD AVLCGRRVFT ERFVDGVLAP WARRTARLDY VVHQLGLALG
GRPAATIARR LMLPVSNDTL LRVVRRRGCP LFPAPSVVGI DDWAWRRNQR YGTIICDLER
RRPITLLPDR EAATAQAWLA GQPQIAVVAR DRGGSYALAA AKALPHATQV ADRWHLMENA
SHAFLDAVRK SMRQIRAAVG AATINPGLLT AAERLQYEGF LRREDANAAI LKLVATGTSI
KEIVRLSGHS RGLVRRILRG QRTDVFRIRE SSLEPHLQWL DAQWLAGHRN GAELWRRLKS
LGFRGSLRVV AEWATHRRRV ETVDDQALHR VPSARTIAG