Gene RPB_1070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1070 
Symbol 
ID3908922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1229206 
End bp1230279 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content72% 
IMG OID637882963 
ProductA/G-specific adenine glycosylase 
Protein accessionYP_484691 
Protein GI86748195 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCACAG CGGACGACGA CGGCGCCTCC GGCGGCCGCC CGGCGGCGCT GCTGGTCTGG 
TACGACCGCC ATCGCCGCGT GTTGCCGTGG CGCCCGCCCG CCGGCGTCGC CGCCGACCCC
TATGCGGTGT GGCTGTCGGA AATCATGCTG CAGCAAACCA CCGTCCGCGC GGTCGGCCCG
TATTTCGAGA AGTTCATGGC GCGCTGGCCC AGCGTGACGG CGCTCGGCCA AGCCTCGCTC
GACGACGTGC TGCGGATGTG GGCCGGGCTC GGCTACTACT CGCGGGCGCG CAATCTGCAC
GCCTGCGCGG TCGCGGTGGC GACGCAGCAC GGCGGCCGCT TTCCCGATAC GGAAGACGGC
CTGCGCGCCT TGCCGGGCGT CGGGCCTTAC ACCGCGGCGG CAATCGCGGC GATCGCCTTC
GGCCGGCAGA CCATGCCGGT CGACGGCAAT ATCGAGCGCG TGGTGTCGCG GCTCTACGCG
GTCGAAGACG AGATGCCGAA GGCCAAGCCG CGGATCCAGG AACTGGCGCG CACGCTGCTC
GGGCCGTCGC GCGCCGGCGA CAGCGCGCAG GCGCTGATGG ATCTCGGCGC CACCATCTGC
ACGCCGAAGA AGCCGGCCTG CGCGCTGTGC CCGATCGACG ATGATTGCGC GGCGAGGCGC
CGCGGCGACG CGGAGACGTT TCCGCGCAAG GCGCCGAAGA AGACCGGCGC GCTGCGCCGC
GGCGCGGCCT TCGTGGTGAT CCGCGGCGAT CAGGTGCTGC TCCGCAGCCG CGTCGCCAAA
GGCCTGCTCG GCGGCATGAC CGAGGTGCCG AATTCGGACT GGCTGTCCGA TCAGGACGAC
GCCGCGGCGC GTGCGCAGGC GCCGGCTGTG ACAGGCGCCA CGCGCTGGCA TCGCAAGGCA
GGAGTCGTCA GCCACGTGTT CACGCATTTC CCGCTCGAGC TCGTGGTCTA CACCGCGCAG
GCGCCGGCCG GGACGCACGC GCCGAAGGGC ATGCGCTGGG AGAAGATCGC CACGCTCGCC
GGCGAAGCGC TGCCCAATCT GATGCGCAAG GTGATCGCGC ATGCGCTCGA CTAA
 
Protein sequence
MITADDDGAS GGRPAALLVW YDRHRRVLPW RPPAGVAADP YAVWLSEIML QQTTVRAVGP 
YFEKFMARWP SVTALGQASL DDVLRMWAGL GYYSRARNLH ACAVAVATQH GGRFPDTEDG
LRALPGVGPY TAAAIAAIAF GRQTMPVDGN IERVVSRLYA VEDEMPKAKP RIQELARTLL
GPSRAGDSAQ ALMDLGATIC TPKKPACALC PIDDDCAARR RGDAETFPRK APKKTGALRR
GAAFVVIRGD QVLLRSRVAK GLLGGMTEVP NSDWLSDQDD AAARAQAPAV TGATRWHRKA
GVVSHVFTHF PLELVVYTAQ APAGTHAPKG MRWEKIATLA GEALPNLMRK VIAHALD