Gene Rpal_0078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0078 
Symbol 
ID6407721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp83407 
End bp86556 
Gene Length3150 bp 
Protein Length1049 aa 
Translation table11 
GC content69% 
IMG OID642709987 
Productdouble-strand break repair protein AddB 
Protein accessionYP_001989116 
Protein GI192288511 
COG category[L] Replication, recombination and repair 
COG ID[COG3893] Inactivated superfamily I helicase 
TIGRFAM ID[TIGR02786] double-strand break repair protein AddB, alphaproteobacterial type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.160312 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACCTTT TCAGCGTTCC ATCTTCGGCG CCGTTCTTGC GCACGACCAT CGCGGCCCTG 
GTCGACGGCC GGCTGGTCGA GGGCTTCGAT GTGCGCACCG CGCCAGAGCG GCTCGCCGAC
GCCACGCTGT ATCTGCCGAC CCGGCGCGCC GGCCGGATGG CGCGCGAGAT CTTCTTGGAC
GTGCTCGGCG CTGACGCCGT GCTGCTGCCG CGGATCGTCG CGCTCGGCGA TATCGACGAA
GATGAGCTGG CATTCGCGCA GGCTGCGTCG GGACTCGCCG ATCTGGAAAT TCCCCCAGCG
CTCGACGGGC TGCCGCGCAG GTTGTTGCTG ACGCAGCTGA TCGCGAGCTG GGCGGCGCGG
CTGAAGCCGG ACGATCCGAC ACAGGCGCCA CTGGTTGTCG GTGGACCGGC CTCGGCGCTG
GCGCTCGCCG ACGATCTGGC GCGGCTGATC GACGACATGT CGACCCGGGG CGTCGATTGG
TCGGCGCTCG AGACACTGGT GCCCGATGCG TTTGATCGCT ACTGGAGCCT GACGCTCGAT
TTCCTCAAGA TCGCCGGCCA CGCCTGGCCG CAATATCTGA AAGAGACCGA CCGGATCGAA
CCGGCCGCGC GGCGCGACCG GCTGATCGAT GCCGAGGCGC AGCGGCTCGC GACCCAGCGC
AGCGGGCCGG TGATTGCAGC CGGCTCGACC GGCTCGATGC CGGCGACCGC GAAGCTGCTC
ACCGCGATCG CGCGTCTGCC GCACGGCGCC GTGGTGCTCC CCGGCCTCGA CATCGATCTC
GACGACGCGG CGTGGGATCT GATCGGCGGC AGCCGGGACC GGCAGGGCAA GCTCGTCACG
CCCCCATCGC CGAACCATCC GCAATACGCC ATGCATGGCC TGCTCGACCG GATGGGCGTG
ACACGTCGGG ATGTGCAGCA GCTCGGCACG CCCGCTCGGC ACGGCCGCGA ACTCCTCTCC
TCCGAGGCGC TGCGGCCGTC ATCGGCGACG GCGGTGTGGC ACGACCGGCT GAAGGATGCC
GAGGTCGATC GTCTGATCGG CGAAGGTACG GACGGACTGA CGCTGATCGA AGCGCCGAAT
TCGGAAGCCG AGGCGTTGGC GATCGCGGTG GTGCTGCGCG AGGCCCGCGA GCAGGGCAAG
ACCGCGGCGC TGGTCACGCC GGATCGCGCG CTGGCGCGCC GCGTCGTCGC CGCGCTCGGC
CGCTGGCATC TGCCGGTCGA CGATTCCGGC GGCGACTCGC TGATGGAGAC GCAGCCCGGC
ATCTTCGCAC GGCTCGCCGC CGAGACCGCG CTCGACGGAT GCGAACCGGC GACGCTGCTG
GCGCTGCTCA AGCATCCGCT GCTTCGGCTC GGACGCACGG GCTATGGCTG GCGCGGCGCG
ATCGAGACCT TGGAGCTGGC GCTGCTGCGA GGCCCCCGGC CATCCGCCGG CTGCGATGGG
CTCGCCAAGG AATTTGCGAT CTTCCGGGGC GAGCTTGCCA AGCTGAAGCG CGGCGAGGCC
AGCGCGCTGC ATCCGTCGGA GCCGCGCACG CGATTGTCCG AACCGAGCCT CGATGAAGCG
CAGGACCTCA TCGCCCAGCT CAAGGCCGCG CTCGCACCAT TGGAAACCGT TGGGCCGAAT
CCGCTCGATC TGTGCGAGAT CGGCGCACGG CATTGCGCGG CGCTGAAGGC GCTGACGACG
GATGACGAGG GCATCGCCGA AGTGTTCGAA GGGCCGCAAG GCTCGGCACT GCTGCGCGCG
TTCGACGATC TCGCCGCGGT CGGACCGGCG AGCGGCGTAT TGGTCGCGGC GCGCGACTAT
GGCGAAGTGT TCGAGACCGC GTTCGGCGAT CGCGTCGTCC GCCGTCCCGA ACTCGCCGAG
GCGCCGATCC GGATCTACGG CCCGCTCGAA GCGCGACTGA CACAGCAGGA CCGGGTTGTG
CTCGGCGGAT TGGTCGAAGG CGTGTGGCCG CCGGCGCCGC GGATCGATCC GTGGCTGAGC
CGGCCGATGC GGCACGAACT TGGGCTCGAT CTGCCGGAGC GCCGCATCGG CCTGTCGGCG
CACGATTTCG CCCAGGCACT CGGCGCCGAT GAGGTGTTCA TCACCCATGC CGCCAAGGTC
GGCGGTGCGC CCGCAGTGGC GTCGCGCTTC CTGCACCGTC TCGAAGCGGT CGCCGGCGAG
GATCGCTGGA AGGCGATGAA AGCGCGCGGG CAGATCTATC TGGACTACAC GCACGAACTC
GATCGCCCCG AGAGCGTCAC GCCGATCGCG CAGCCTGCAC CGAGGCCACC GCGGATCGCG
CGGCCGCTGA AACTGCCGGT GACCGCGATC GAAGATTGGC TGCGCGATCC CTATACGATC
TACGCCAAGT ACATCCTGCG CCTGTCGCCG CTCGATCCGG TCGACATGCC GCTGTCGGGC
GCCGATCGCG GCTCGGCGAT CCATGCGGCG CTGGGCGAAT TCACCGAGCG CTACCAGGAC
GCTCTGCCGG ACGATCCGGT GACGGCGCTG CGCCAAATCG GCCAGAAACA TTTTGCGCCG
CTGATGGATC ATGCCGAGGC GCGCGCGCTG TGGTGGCCGC GCTTCTTGCG GATTGCCACC
TGGTTCGCCG CGTGGGAGCA GCAGCGCCGC ACCGGCGTCG TGCAGGTACA GGCGGAGCGT
CGCGGCACGC TGATGATCCC GCTTGGTGGA GAGCGCAATT TCGAACTGTC GGCGCGCGCC
GACCGCATCG AACGGCGCGA TGATGGCAGC TACGCGATCC TCGACTACAA GACCGGCCAT
CCGCCGACCG GCAAGCAGGT GCGGATGGGG CTGTCGCCGC AGCTCACATT GGAAGCGGCG
ATTCTGCGCG ATGGAGGCTT TGAGGATATT CCGGCGGCCG CCTCAGTGAG CGAGCTCACT
TACGTCAAGC TCAGCGGCAA CACGCCGCCC GGCGATGAGC GCGTGCTTGA GCTCAAGATC
GAGCGCAAGG ATGAGCCGCA GCATCCCGAC GATGCCGCCG ATGAAGCGCG CGCCAAGCTG
GAAGGATTAG TCCGCCGCTT CGAAGACGAG AAGCAGTCCT ATCGCTCGCT GGTGCTGTCG
ATGTGGTCGC AGCGCTACGG CACCTATGAT GATCTGGCGC GGATCAAGGA ATGGTCGGCG
GCGGGCGGTC GCGGAGACGC GTCGTCATGA
 
Protein sequence
MHLFSVPSSA PFLRTTIAAL VDGRLVEGFD VRTAPERLAD ATLYLPTRRA GRMAREIFLD 
VLGADAVLLP RIVALGDIDE DELAFAQAAS GLADLEIPPA LDGLPRRLLL TQLIASWAAR
LKPDDPTQAP LVVGGPASAL ALADDLARLI DDMSTRGVDW SALETLVPDA FDRYWSLTLD
FLKIAGHAWP QYLKETDRIE PAARRDRLID AEAQRLATQR SGPVIAAGST GSMPATAKLL
TAIARLPHGA VVLPGLDIDL DDAAWDLIGG SRDRQGKLVT PPSPNHPQYA MHGLLDRMGV
TRRDVQQLGT PARHGRELLS SEALRPSSAT AVWHDRLKDA EVDRLIGEGT DGLTLIEAPN
SEAEALAIAV VLREAREQGK TAALVTPDRA LARRVVAALG RWHLPVDDSG GDSLMETQPG
IFARLAAETA LDGCEPATLL ALLKHPLLRL GRTGYGWRGA IETLELALLR GPRPSAGCDG
LAKEFAIFRG ELAKLKRGEA SALHPSEPRT RLSEPSLDEA QDLIAQLKAA LAPLETVGPN
PLDLCEIGAR HCAALKALTT DDEGIAEVFE GPQGSALLRA FDDLAAVGPA SGVLVAARDY
GEVFETAFGD RVVRRPELAE APIRIYGPLE ARLTQQDRVV LGGLVEGVWP PAPRIDPWLS
RPMRHELGLD LPERRIGLSA HDFAQALGAD EVFITHAAKV GGAPAVASRF LHRLEAVAGE
DRWKAMKARG QIYLDYTHEL DRPESVTPIA QPAPRPPRIA RPLKLPVTAI EDWLRDPYTI
YAKYILRLSP LDPVDMPLSG ADRGSAIHAA LGEFTERYQD ALPDDPVTAL RQIGQKHFAP
LMDHAEARAL WWPRFLRIAT WFAAWEQQRR TGVVQVQAER RGTLMIPLGG ERNFELSARA
DRIERRDDGS YAILDYKTGH PPTGKQVRMG LSPQLTLEAA ILRDGGFEDI PAAASVSELT
YVKLSGNTPP GDERVLELKI ERKDEPQHPD DAADEARAKL EGLVRRFEDE KQSYRSLVLS
MWSQRYGTYD DLARIKEWSA AGGRGDASS