Gene RPD_0203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0203 
Symbol 
ID4020661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp229283 
End bp232432 
Gene Length3150 bp 
Protein Length1049 aa 
Translation table11 
GC content69% 
IMG OID637960382 
Producthypothetical protein 
Protein accessionYP_567344 
Protein GI91974685 
COG category[L] Replication, recombination and repair 
COG ID[COG3893] Inactivated superfamily I helicase 
TIGRFAM ID[TIGR02786] double-strand break repair protein AddB, alphaproteobacterial type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.587798 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTTT TCAGTGTTCC GCCCTCCGCG CCGTTTCTGC GCACGACGAT CAAGGCTCTG 
GTCGATGGCG AACTGATCGC GGGGTTCGAC GCGCGCGCAA AGCCCGAGCG CCTTGCCGAA
GCCACGCTGT ATCTGCCGAC CCGCCGCGCC GGTCGCCTCG CGCGCGACAT CTTCCTCGAT
GTCCTGGGCG CCGATGCGGT GTTGCTGCCG CGCATCGTCC CACTCGGCGA TGTCGACGAG
GACGAACTGG CGTTCGCACA GGCCGCAACG GGCGCGGCCG ATCTCGACAT TCCGCCGGCG
CTCGAAGGGC TGCCACGGCG CCTGCAGCTG GCGCAATTGA TCGCGGTCTG GGCAAAAGGG
CTGCGGTCCG GCGATCCGCA GCAGTCGCCA TTGGTGGTCG GCGGCCCGGC GTCGACATTG
GCGCTCGCCG ACGATCTGGC GCGGCTGATC GACGACATGG CGACGCGCGG CGTCGACTGG
GCCGCGCTCG ACTCGCTGGT GCCGGATGCG TTCGATCGCT ACTGGCGACT GACGCTGGAT
TTCCTCAAGA TCGCCGGCCA GTGGTGGCCG CAGCATCTGC GCGAGAGCGA CCAGATCGAA
CCGGCGGCGC GGCGCGACTT GCTGATCGAA GCGGAAGCAG CGCGGCTTGC GGCGCATCGC
GGCGGTCCGG TGATCGCGGC AGGCTCGACC GGCTCGATGC CGGCGACCGC GAAGCTGCTG
CACGCCATCG CCCGATTGCC GAACGGCGCC GTTATCCTGC CGGGGCTGGA CACCGAACTC
GACGAACAGG CGTGGCGACT GATCGGCGCC GTGCGCGACA AGCAGGGCCA ATTGATCTCG
CCGCCGTCGC CGAATCATCC GCAATTCGCG ATGCACGGCC TGCTGACCCG GATGGGACTC
GAGCGGCGCG AAATTGTCCG GCTCGGCGAA GCCGTGCGCC ATGGCCGCGA AGTGCTGGCC
TCCGAGGCGA TGCGGCCGTC GGCGGCGACG GCGTCGTGGC ACGAGCGGCT CGCCGATCCC
GAGGTCGATC GACTGATCGA ACAAGGCGTC AATGGCCTGA CGGTGATCGA GGCGCCGAAT
TCCGAGATCG AGGCGCTGGC GATCGCGGTG GCGCTGCGCG AGGCGCGCGA ACGCGGCCAA
TCCGCCGCAT TGGTGACGCC GGACCGCGCG CTGGCCCGGC GCGTGGTCGC CGCGCTCGGC
CGCTGGAATC TGCCGGTCGA TGATTCCGGC GGCGATTCGC TGATGGAGAC CCAGGCCGGC
ATCTTCGCCC GGCTCGCCGC TGAAGCGGCC CTGCATGGCT GCGAGCCGGC GACGCTGTTG
GCGCTGCTGA AGCATCCGTT GCTGCGGCTC GGCCGCGCCG CGGGCGGCTG GCGGCACGCC
ATCGAGACGC TGGAACTGGC GCTGCTGCGC GGAACGCGTC CGGCTGCCGG CAGCGAAGGC
CTGGTCAAGG AGTTCGCGAA ATATCGCGCC GAACTGACCA GGCTGAAGCG CGGGGAACTC
AGCGCGCTGC ATCCCTCGGA GCCGCGCGCG CGGCTCGGCG ACGAGAGCCT CGACGCCGCG
CAGGAGCTGA TCGAGGCATT GCGCGCGGCG CTGGCGCCGT TGGAGACGGT GGGCGCAGAG
CCGCTCGATC TGTGCGCCTT CGGGCGTCGA CACCGCGATG TGCTGATCGC GCTGTCGATC
GATCACGACG AGATCGCGGT CGCTTTCGAA GGATCGCAGG GATCGGCCCT GCTTAAAGCG
TTCGACGATC TCGCGGCGGT CGAGCCGCTG AGCGGCGTGC TGGTGCCGCC ACACGACTAC
GCGGACGTGT TCGAAACCGC GTTCAGCGAC CGCATCGTGC GACGGCCCGA ACTCGCCGGC
GCTGCGCTGC GCATCTACGG CCCGCTCGAA GCGCGGTTGA CGCAGCATGA TCGCGTCATT
CTCGGTGGCC TGGTCGAAGG CGTCTGGCCG CCGGCGCCGC GGATCGATCC GTGGCTGTCG
CGGCCGATGC GCCATGACCT CGGCCTCGAC CTGCCGGAGC GGCGGATCGG CCTGTCGGCG
CACGACTTCG CGCAACTGCT CGGCGCCGAT GAGGTGATCC TCACTTACGC CAACAAGGTC
GGCGGCGCGC CGGCGGTGGT GTCGCGCTTC CTGCACCGGC TCGAAGCCGT GACCGGCAAG
GCGCGCTGGA GCGCCGTCAA GGCGCGCGGG CAAAGCTATC TCGACTACGC GCAGGCGCTC
GATCGTCCCG AACAGGTCAC GCCGATCGCC CAGCCGGCGC CGAGGCCGCC GCGCGAGGCG
CGGCCGCTGA AGTTGTCGGT CACCGCGATC GAGGACTGGC TGCGTGATCC GTACACGATC
TACGCCAAGT TCATTCTCGG CCTGTCGGCG ATCGATCCGG TCGACATGCC GCTGTCCGCG
GCGGATCGCG GCTCGGCGAT CCACGAAGCG CTCGGCGAAT TCACCGAGCT GTTCCCCGAC
GAGCTGCCCG ACGATCCGGC GCAGGTGCTT CGTGAGATCG GCGAAAAGCA CTTCGCGCCG
CTGATGGCTC ATCCGGAAGC GCGCGCGCTG TGGTGGCCTC GCTTCGCCCG CATCGCCGCG
TGGTTCGGCA ATTGGGAGCA GGCGAGGCGC GCCGACGGAC TCCGCGTGTT TGCCGAGCGC
GACGGTAGCC TCTCCATCCC GCTCGACGGC GGTCGCAACT TCATCCTGTC CGCGCGCGCC
GATCGCATCG AGCATCGCGC CGACGGCAGC TTCGCGATTT TGGACTACAA GACCGGAAAT
CCGCCGACCG GCAAGCAGGT GCGGATGGGG CTGTCGCCGC AACTCACGCT GGAAGCCGCA
ATCCTGCGCG ACGGCGGCTT CGAGGGCATC GACGCCGGTT CGTCGGTGAG CGAACTTACT
TACGTCAAGC TCAGCGGCAA CTCGCCGCCC GGCGACGAAT GCGTGCTGGA ATTGAGGATC
GAGCGCAAGG ACGAGCGGCA GTCTCCCGAC GACGCGGCCG CCGAAGCGCG TAGCAAGCTC
GAAACCCTGA TCCGGCGCTT CGACGACGAG GCGCAGCCGT ATCACGCGCT GGTGCTGTCG
ATGTGGTCGC GTCGCTATGG CCGCTACGAC GATCTGGCGC GGATCAAGGA ATGGTCGGCC
GCCGGCGGCG GTGCGGAGGA TCGGCTGTGA
 
Protein sequence
MRVFSVPPSA PFLRTTIKAL VDGELIAGFD ARAKPERLAE ATLYLPTRRA GRLARDIFLD 
VLGADAVLLP RIVPLGDVDE DELAFAQAAT GAADLDIPPA LEGLPRRLQL AQLIAVWAKG
LRSGDPQQSP LVVGGPASTL ALADDLARLI DDMATRGVDW AALDSLVPDA FDRYWRLTLD
FLKIAGQWWP QHLRESDQIE PAARRDLLIE AEAARLAAHR GGPVIAAGST GSMPATAKLL
HAIARLPNGA VILPGLDTEL DEQAWRLIGA VRDKQGQLIS PPSPNHPQFA MHGLLTRMGL
ERREIVRLGE AVRHGREVLA SEAMRPSAAT ASWHERLADP EVDRLIEQGV NGLTVIEAPN
SEIEALAIAV ALREARERGQ SAALVTPDRA LARRVVAALG RWNLPVDDSG GDSLMETQAG
IFARLAAEAA LHGCEPATLL ALLKHPLLRL GRAAGGWRHA IETLELALLR GTRPAAGSEG
LVKEFAKYRA ELTRLKRGEL SALHPSEPRA RLGDESLDAA QELIEALRAA LAPLETVGAE
PLDLCAFGRR HRDVLIALSI DHDEIAVAFE GSQGSALLKA FDDLAAVEPL SGVLVPPHDY
ADVFETAFSD RIVRRPELAG AALRIYGPLE ARLTQHDRVI LGGLVEGVWP PAPRIDPWLS
RPMRHDLGLD LPERRIGLSA HDFAQLLGAD EVILTYANKV GGAPAVVSRF LHRLEAVTGK
ARWSAVKARG QSYLDYAQAL DRPEQVTPIA QPAPRPPREA RPLKLSVTAI EDWLRDPYTI
YAKFILGLSA IDPVDMPLSA ADRGSAIHEA LGEFTELFPD ELPDDPAQVL REIGEKHFAP
LMAHPEARAL WWPRFARIAA WFGNWEQARR ADGLRVFAER DGSLSIPLDG GRNFILSARA
DRIEHRADGS FAILDYKTGN PPTGKQVRMG LSPQLTLEAA ILRDGGFEGI DAGSSVSELT
YVKLSGNSPP GDECVLELRI ERKDERQSPD DAAAEARSKL ETLIRRFDDE AQPYHALVLS
MWSRRYGRYD DLARIKEWSA AGGGAEDRL