Gene RPC_4368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4368 
Symbol 
ID3970845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4871079 
End bp4872194 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content69% 
IMG OID637927477 
ProductA/G-specific adenine glycosylase 
Protein accessionYP_534210 
Protein GI90425840 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.357853 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0220919 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGTAT CGCCAGCCAG ACAAAAAGCC CGCCGCCAGG CTGCCCCGGG CTCTGCTTAT 
CCCCGGCTGC TGCTGGCCTG GTACGATCGG CATCGCCGCG CGCTGCCGTG GCGGGCGCTG
CCGGGGCAGG CCGCCGACCC CTACCGGGTC TGGCTCAGCG AAATCATGCT GCAGCAGACC
ACGGTGAAAG CGGTCGGGCC GTATTTCGAG AAATTTCTGG CGCGCTGGCC GAATGTTGCG
GCACTTGGCC GCGCCAGCCA GGACGACGTG CTGCGGATGT GGGCCGGGCT CGGCTATTAT
TCGCGGGCGC GCAATCTTTT CGCCTGCGCG GTGGCGGTTT CGCGCGACCA TGGCGGCGCC
TTCCCCGACA CCGAGGCCGG CCTGCGGGCG CTGCCGGGGA TCGGGCCCTA CACCGCTGCG
GCGATCGCGG CGATCGCGTT CGGCCGTCAT TGCATGCCGG TCGACGGCAA TATCGAGCGG
GTGGTGTCGC GGCTGTTCGC GGTCGAAGAC GCGCTGCCGC AGGCCAAGCC GAAGATTTCG
GAGTTAGCGC TGACGCTGGC GGGCGAAGCG CGCGCCGGAG ACTCGGCGCA GGCCCTGATG
GATCTCGGCG CCACCATCTG CACGCCGAAA AAGCCGGCCT GTGCGCTGTG CCCTTTAAAC
GAAGATTGCG TCGCCCGCAG CCGCGGCGAT CAGGAGACGT TTCCGCGCAA GGCTGCCAAG
ACCACCGGCA AACTGCGGCG CGGCGCCGCC TTCGTGGTGA GGCGCGGCGA CGAGCTTCTG
GTGCGCAGCC GGGCGGAAAA AGGCCTGCTC GGCGGCATGA CCGAAGTGCC GGGCTCCGAC
TGGATCGCCG ACCAGGACGA CACTATCGCG CGACAACAGG CGCCGGCGCT GCCGGGCGTG
GCGCGCTGGC AGCGCAAGCC GGGCGTGGTC AATCACGTCT TCACGCATTT TCCGCTGGAA
CTCGTGGTCT ACACCGCAAC CATGCCGGCG CGTAGCCGAG CGCCGATCGG CATGCGCTGG
GTCAAGATCG CGACCTTGCA GCACGAGGCG TTGCCGAACG TGATGCGCAA GGTGATCGCG
CACGGGCTCG GTGACGGCAA ACCGATCAGA CGCTGA
 
Protein sequence
MPVSPARQKA RRQAAPGSAY PRLLLAWYDR HRRALPWRAL PGQAADPYRV WLSEIMLQQT 
TVKAVGPYFE KFLARWPNVA ALGRASQDDV LRMWAGLGYY SRARNLFACA VAVSRDHGGA
FPDTEAGLRA LPGIGPYTAA AIAAIAFGRH CMPVDGNIER VVSRLFAVED ALPQAKPKIS
ELALTLAGEA RAGDSAQALM DLGATICTPK KPACALCPLN EDCVARSRGD QETFPRKAAK
TTGKLRRGAA FVVRRGDELL VRSRAEKGLL GGMTEVPGSD WIADQDDTIA RQQAPALPGV
ARWQRKPGVV NHVFTHFPLE LVVYTATMPA RSRAPIGMRW VKIATLQHEA LPNVMRKVIA
HGLGDGKPIR R