Gene RPB_2007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2007 
Symbol 
ID3909513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2281414 
End bp2283099 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content71% 
IMG OID637883901 
ProductDNA repair protein RecN 
Protein accessionYP_485626 
Protein GI86749130 
COG category[L] Replication, recombination and repair 
COG ID[COG0497] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00634] DNA repair protein RecN 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.518502 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTCGC GCCTGTCGAT CCGCGATATC GTGTTGATCG AGCGGCTCGA TATCGAGTTC 
TCCCGTGGCC TCGCCGTGCT CACCGGCGAG ACCGGGGCGG GCAAATCGAT CCTGCTCGAT
GCGTTTGCGC TGGCGCTCGG CGGCCGCGGC GATGCCGCGC TGGTCCGCCA CGGCGCCGAA
CACGGCCAGG TCACCGCGAC CTTCGACCTC GCCAAAGGCC ATCCCGCCTT CGGCATTCTG
TCCGCCAACG GGCTCGACGA CCGTGAGATC GAGGATTCCG GTGAGCTGAT CCTGCGCCGG
ATCCAGCTCG CCGACGGCCG CACCCGCGCG TTCATCAACG ACCAGTCGGT CAGCGTGCAG
ACCCTGAAAT CGGTCGGCGC CACGCTGGTC GAGATCCACG GCCAGCACGA CGAGCGCGCG
CTGGTCGACG CCGCCACCCA TCGGCGGCTG CTCGATGCCT TCGCCGGGCT GGAGAAGGAC
GTCGCCGCGC TGGAGACGCT GTGGGAGGGT CGCCGCAGCG CCCGCGCCGC GCTCGACGCC
CACCGCGCCG GGATGGAGCG GGCCGCGCGC GAGGCGGATT ATCTGCGCCA CGCCTCCGAC
GAATTGAAGA AGCTGGCGCC GCAGGACGGC GAGGAGACCG TGCTGGCCGA GCGCCGCTCC
GTGATGATGC AGGGCGAGAA GATCGCCTCC GATCTGCGCG AGGCGCAGGA CGCGGTCGGC
GGCCATCATT CGCCGGTGGC GGCGCTGGCC GCCGCGGTGC GCCGGCTGGA GCGTCGCGCC
GGCTCCGCGC CGCAACTGGT CGAGCCCGCG GTGCGGGCGA TCGACGCCGC CATCAATGCG
CTGGAGGAAG CCGACCAGCA TCTCAACGCC GCACTCGCCG CCGCCGATTT CGACCCGCTC
GAACTGGAGC GGATCGAGGA GCGGCTGTTT GCTTTACGCG CGGCAGCGCG GAAGTATTCG
ACGCCGGTCG ATGGGCTCGC CGCACTCGCT ACGCAATACG CCGCCGACGT GGCGCTGATC
GACGCCGGCG CCGACCGGCT GAAGGCGCTG GAGAAGGCCG CCGGCGACGC CGATGCGCGC
TACGGCGCCG CCGCGGCGAA GCTCTCCGCA TCGCGCAGCA AGGCCGCCGA CAAGCTCAAC
AAGGCGGTCA ATGGCGAACT CGCGCCGCTC AAGCTCGAGC GCGCCAAATT CATGACCCAG
GTCGAGGCCG ATCCGGCGAC GCCGGGGCCG CAGGGCATCG ACCGCGTCGA ATTCTGGGTG
CAGACCAACC CCGGCACGCG GCCCGGCCCG ATGATGAAGG TCGCCTCCGG CGGCGAGCTG
TCACGCTTCC TGCTGGCGCT GAAAGTGGTG CTGTCGGACA AGGGCTCGGC GCCGACGCTG
GTGTTCGACG AGATCGACAC CGGGGTCGGC GGCGCGGTCG CCGATGCGAT CGGCGGGCGC
TTGGCGCGGC TCGCCACCAA GGTGCAGGTG ATGGCCGTGA CCCACGCCCC GCAAGTCGCC
GCCCGCGCCG ACCAGCATCT GCTGATTTCC AAAGCCGCGC TCGACAAGGG CAAACGCGTC
GCCACCCGCG TCGCCGCCCT GGAACAGGAC CACCGCCGCG AAGAAATCGC GAGGATGCTG
GCGGGCGCCG AGATCACCGC CGAGGCAAGG GCGGCGGCGG ACCGGCTGAT CAAGGCGGCG
GGGTGA
 
Protein sequence
MLSRLSIRDI VLIERLDIEF SRGLAVLTGE TGAGKSILLD AFALALGGRG DAALVRHGAE 
HGQVTATFDL AKGHPAFGIL SANGLDDREI EDSGELILRR IQLADGRTRA FINDQSVSVQ
TLKSVGATLV EIHGQHDERA LVDAATHRRL LDAFAGLEKD VAALETLWEG RRSARAALDA
HRAGMERAAR EADYLRHASD ELKKLAPQDG EETVLAERRS VMMQGEKIAS DLREAQDAVG
GHHSPVAALA AAVRRLERRA GSAPQLVEPA VRAIDAAINA LEEADQHLNA ALAAADFDPL
ELERIEERLF ALRAAARKYS TPVDGLAALA TQYAADVALI DAGADRLKAL EKAAGDADAR
YGAAAAKLSA SRSKAADKLN KAVNGELAPL KLERAKFMTQ VEADPATPGP QGIDRVEFWV
QTNPGTRPGP MMKVASGGEL SRFLLALKVV LSDKGSAPTL VFDEIDTGVG GAVADAIGGR
LARLATKVQV MAVTHAPQVA ARADQHLLIS KAALDKGKRV ATRVAALEQD HRREEIARML
AGAEITAEAR AAADRLIKAA G