Gene RPD_1899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1899 
Symbol 
ID4022381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2132717 
End bp2134159 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content67% 
IMG OID637962092 
Productphage SPO1 DNA polymerase-related protein 
Protein accessionYP_569035 
Protein GI91976376 
COG category[L] Replication, recombination and repair 
COG ID[COG1573] Uracil-DNA glycosylase 
TIGRFAM ID[TIGR00758] uracil-DNA glycosylase, family 4 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.530135 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTCCA TCCGGCTCGA CAGCGACACC GATTTCCATG GCTGGCGCAA GGCGGCGCGG 
GAGCTTGTGC TGGCGGAGGT CGCGCCGGCT GATATCAGTT GGACGGTGGC AGGCGACGAG
CCCGAATTGT TCGATGCGCT AGCGCCGCCT GCGCCCTCCC CGAGTGCGCC GTCGTCCGGC
ACCTTCAACG TTCCCGCTCG CTTCGTCGAG CTTGCGGCGA CCGCGATCCT GCATCGCGAT
CCGCAGCGCT TCGCCTGGCT GTATCAAGCG CTGTGGCGGC TGCGCGCCAA CCCGGAGCTG
TTGCAGATCG CGACCGATCC GGACGTTGCG CGGCTGCAGG CGATGGTGAA GGCGGTGCGC
CGCGATGAGC ACAAAATGCA CGCCTTCGTC CGCTTTCGCG AGATCGGCCG CGAGCCAAAG
TCGCGCTACG TCGCCTGGTT CGAGCCCGAG CACCATATCG TCGAGGCTGC GGCACCGTTC
TTCGCCCGGC GCTTCGCCGA CATGGCGTGG TCGATCCTGA CGCCGGACGT CTGCGCGCAT
TGGGACGGAC ACGCCATTGC GATCACGCCG GGCGTCGCCA AGGCGATGGC CCCATCCGAG
GATCGATTGG AAGAAACCTG GCTGACCTAC TACGCCAGCA TCTTCAATCC AGCGCGGCTG
AAGACCAAGG CGATGCAGGC GGAAATGCCG AAGAAATACT GGCGCAATCT ACCGGAAGCA
GCATTGATCA AGCCGCTGAT CGAGCATGCC GAGCGCGCCG CGCACGCAAT GATCGCCGCG
GAGGCGACCG CGCCGAAGAA ACCACAACGG CAGGAACAGC CGATGAGCCG AGCCGGACAC
GAAGGCGATA GGCTCGAAAC CTTGCGCGAA CAGGCGCGCG ACTGCCGCGC CTGCGATCTG
TGGAAGGACG CGACGCAGAC CGTGTTCGGC GAAGGTCCCC CGCATGCAAG CGTGATGCTG
GTCGGCGAGC AGCCCGGCGA CAAGGAAGAC CTCGCCGGCC ACCCGTTCGT CGGCCCGGCC
GGGCAGATGC TCGACCGCGC GCTGGCGGAA GCCGGGATCG ATCGCGCCGA GACCTACGTC
ACCAACGCGG TGAAGCATTT CAAATTCGTG CCGCGCGGCA AGATCCGCCT GCACCAGAAG
CCGGCGATGC CGGAAATCAA GGCGTGCCGG CCTTGGTACG AGCGCGAGCT CGCCGCGGTG
CGCCCGCAGC TCGTGGTGGC GATGGGCGCG ACCGCGGCGC AGAGCGTGCT CGGCAGGATC
ACACCGATCA ACAAGAACCG TGGCCATCTG ATCGATCGCG ACGGCGGCCC GCAGGTGCTG
GTCACGGTGC ACCCATCCTA TCTGCTCCGG CTGCCCGACG ACGACGCAAA GGCCCGCGAA
TACGCACGGT TCGTCGACGA CCTGAAGATC GCCGCCGCGC ATCTGAAGGC TGGGGCGGCC
TAG
 
Protein sequence
MHSIRLDSDT DFHGWRKAAR ELVLAEVAPA DISWTVAGDE PELFDALAPP APSPSAPSSG 
TFNVPARFVE LAATAILHRD PQRFAWLYQA LWRLRANPEL LQIATDPDVA RLQAMVKAVR
RDEHKMHAFV RFREIGREPK SRYVAWFEPE HHIVEAAAPF FARRFADMAW SILTPDVCAH
WDGHAIAITP GVAKAMAPSE DRLEETWLTY YASIFNPARL KTKAMQAEMP KKYWRNLPEA
ALIKPLIEHA ERAAHAMIAA EATAPKKPQR QEQPMSRAGH EGDRLETLRE QARDCRACDL
WKDATQTVFG EGPPHASVML VGEQPGDKED LAGHPFVGPA GQMLDRALAE AGIDRAETYV
TNAVKHFKFV PRGKIRLHQK PAMPEIKACR PWYERELAAV RPQLVVAMGA TAAQSVLGRI
TPINKNRGHL IDRDGGPQVL VTVHPSYLLR LPDDDAKARE YARFVDDLKI AAAHLKAGAA