Gene Rpal_4978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4978 
Symbol 
ID6412670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5357814 
End bp5358863 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content71% 
IMG OID642714861 
ProductA/G-specific adenine glycosylase 
Protein accessionYP_001993942 
Protein GI192293337 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.183286 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGCGCGC GGCCCGCCGC ATTGCTGGCC TGGTACGACC GGCATCGCCG CACCCTGCCG 
TGGCGGGCGC CGCCCGGCGC GACCGCCGAT CCCTATGCGG TGTGGCTGTC GGAGATCATG
CTGCAGCAGA CCACCGTTCG TGCGGTCGGG CCGTATTTCG ACAAGTTCAT GGCGCGGTGG
CCGACGGTGA CGGCGCTGGG CGAGGCCTCG CTCGACGACG TGCTGAAGAT GTGGGCCGGG
CTCGGCTACT ATTCACGCGC CCGCAACCTG CACGCCTGCG CGGTGGCGGT GACGCGCCAG
CACGGCGGCC GCTTTCCCGA CACCGAGGAG GGGCTGCGGG CGCTGCCCGG CGTCGGGCCC
TACACAGCAG CCGCCATCGC CGCGATCGCG TTCAGCCGCC GGACCATGCC GGTCGACGGC
AATATCGAGC GGGTGGTGTC GCGGCTGTAC GCGGTCGAGG ACGAACTGCC GAAGGCCAAG
CCGCGCATCA AGGCGCTGGC CGAGACGCTG CTCGGCCCGT CCCGCGCCGG TGACAGCGCC
CAGGCGCTGA TGGATCTCGG CGCCACCATC TGCACGCCGA AGAAGCCGGC CTGCGCGCTG
TGCCCGCTGA TGCAGGGCTG CACCGCACGG CTGCGCGGTG ATGCCGAGAG CTTTCCGCGC
AAGGCGCCGA AGAAGACCGG GGCGCTGCGC CGCGGTGCCG CCTTCGTGGT GATCCGCGGC
GATCAGGTGC TGGTCCGCAG CCGCCCCGCC AAGGGCCTGC TCGGCGGCAT GACCGAGGTG
CCGAACTCCG ACTGGTTGCC CGATCAGGAC GAAGCCGCCG CCAAGGCGCA GGCCCCGGCG
CTGAAAGGCG TCGGTCGCTG GCATCGCAAA GCCGGCGTCG TCAGCCATGT GTTCACGCAC
TTCCCGCTGG AGCTGGCCGT GTATGTGGCG CATGCCTCGG CCGGCACCCG AGCCCCCACC
GGCATGCGCT GGACGCAGAT CTCGACGCTG TCGGACGAAG CTTTGCCCAA TCTGATGCGC
AAGGTGATCG CCCACGGCCT CGGTGATTGA
 
Protein sequence
MRARPAALLA WYDRHRRTLP WRAPPGATAD PYAVWLSEIM LQQTTVRAVG PYFDKFMARW 
PTVTALGEAS LDDVLKMWAG LGYYSRARNL HACAVAVTRQ HGGRFPDTEE GLRALPGVGP
YTAAAIAAIA FSRRTMPVDG NIERVVSRLY AVEDELPKAK PRIKALAETL LGPSRAGDSA
QALMDLGATI CTPKKPACAL CPLMQGCTAR LRGDAESFPR KAPKKTGALR RGAAFVVIRG
DQVLVRSRPA KGLLGGMTEV PNSDWLPDQD EAAAKAQAPA LKGVGRWHRK AGVVSHVFTH
FPLELAVYVA HASAGTRAPT GMRWTQISTL SDEALPNLMR KVIAHGLGD