Gene RPB_3564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3564 
Symbol 
ID3911366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4083820 
End bp4085286 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content74% 
IMG OID637885466 
Producthypothetical protein 
Protein accessionYP_487170 
Protein GI86750674 
COG category[L] Replication, recombination and repair 
COG ID[COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.677515 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCGCCG CCAAGCGCGA CAATGCACTG GTCGTCGTCG CCTGCGATGC ATGCGCGACG 
CAGGGCGGGT TGATGCCGGG GATGCCGCTC GCCACCGCGC GGGCGATGCA TCCGTCGCTC
GACGTGATCG ATCACGATCC GCATGCCGAC GCCGCGCTGC TCGCGTCTGT CGCCGACTGG
TGCGACCGCT TCACGCCGCT GGTGGCGTTC GACGGCGCCG ACGGATTGCT GCTCGACATC
ACCGGCTGCG CGCATCTGTT CGGCGACGAG GCCGAACTGC TGCGCATGCT CACCACTGCG
CTGACGCGGC AGGGCTTTGC GGTGAGCGCG GCGATCGCCG GCACCGCGGT GGCGGCGCGG
GCACTGACCC GCGGCGCGCC GGGCAGGATC GTGGCGCCGG GCGAGGAAGC CGCGGCGGTC
GCGCCGTTGC CGGTGGCGGC GCTCGGCGTC AGCGAGGCGA TCGTGCGCGG CCTTTGCCGC
GCCGGCCTCA CCACCATCGG CGATGTGCTG GCGCGGCAGC CGTCCGAACT CGCGGCGCGG
TTCGGCGAAG CCTTCATCGC GGTGCTGCGT CAGGCGACCG GCGAGGACGA CGCGCCGATT
TCGCCGCGCA AGCCGGCGCC GGACTATGTC GTGGACAAGC GCTTTGCCGA GCCGGTCGCC
ACCACCGAGG TGATCCTGCC GACGCTGCTG GCGCTGGCGC GGCTGCTGAT CGCCGCGATG
GAACGCAGCG GCAAGGGCGC GCGGCAGCTC ACCGCCTCGT TCTTCCGCAG CGATGGGGCG
GTGCGCAGCC TTGTGGTGGA GGCCGGACAG CCGGTGACGC GGGTCGAGGT GGTGCAGCGG
CTGTTTGCGG AGCGGCTCGA TGCGCTGGCG GACCCGCTCG ATCCCGGCTT CGGCTACGAC
CTGATCCGCC TCGCCGCGAG CCGGTGCGTT GCCATCGCCG AGGCGCAGCG CGGCTTCGAC
ACCACCGCGC ACCAGGCCGA AGACGTCGCT CTGCTGGCCG ACACGCTGTC GGCGCGGCTC
GGGGCGCGGC GCGTGGTGCG CTATCTGCCG CAGAACACGC ACATCCCCGA GCGTGCGGCG
CTCGCCGTGC CGGTGCAGCA TTGCCCGCCG GACGCGGACG ATGCGCCGTG GCCGGCGCGC
GCCGACGAGC CGCCGCTGCG GCCGCTGCGC CTGCTGCAGC CGCCGGAGCC GATCGAGGTG
CTGGCCGGCG TGCCGGACGG GCCGCCGGCG CAATTCACCT GGCGCCGCGT TCTCCACCGC
GTCGCCCGCG CCGAAGGCCC GGAGCGGATC GCGATGGAGT GGTGGCGCGC CGCCGAGCCC
GGCCTGACCC GCGATTACTT CCGCATCGAG GACGAATCCG GCACGCGGTT CTGGCTGTAT
CGCGACGGCC TGTATGGCCG CGAGGTGATG CCGCAGCCGG ACGGCGGCGG CCAGCCGCGC
TGGTACATGC ACGGCCTGTT CGCGTGA
 
Protein sequence
MVAAKRDNAL VVVACDACAT QGGLMPGMPL ATARAMHPSL DVIDHDPHAD AALLASVADW 
CDRFTPLVAF DGADGLLLDI TGCAHLFGDE AELLRMLTTA LTRQGFAVSA AIAGTAVAAR
ALTRGAPGRI VAPGEEAAAV APLPVAALGV SEAIVRGLCR AGLTTIGDVL ARQPSELAAR
FGEAFIAVLR QATGEDDAPI SPRKPAPDYV VDKRFAEPVA TTEVILPTLL ALARLLIAAM
ERSGKGARQL TASFFRSDGA VRSLVVEAGQ PVTRVEVVQR LFAERLDALA DPLDPGFGYD
LIRLAASRCV AIAEAQRGFD TTAHQAEDVA LLADTLSARL GARRVVRYLP QNTHIPERAA
LAVPVQHCPP DADDAPWPAR ADEPPLRPLR LLQPPEPIEV LAGVPDGPPA QFTWRRVLHR
VARAEGPERI AMEWWRAAEP GLTRDYFRIE DESGTRFWLY RDGLYGREVM PQPDGGGQPR
WYMHGLFA