Gene Rpal_2002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2002 
Symbol 
ID6409662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2166398 
End bp2167885 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content67% 
IMG OID642711888 
Productphage SPO1 DNA polymerase-related protein 
Protein accessionYP_001991000 
Protein GI192290395 
COG category[L] Replication, recombination and repair 
COG ID[COG1573] Uracil-DNA glycosylase 
TIGRFAM ID[TIGR00758] uracil-DNA glycosylase, family 4 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.515023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCGCA TCCGGCTCGA CAGCGACACC GACTTCGACG GCTGGCGCAA GGCGGCGCGG 
GAGCTGGTGC TGGCGGACGT GGCACCGGCG GATGTGACCT GGACGGTGGC GGGCGACGAG
CCGGAGCTGT TCGACGCGCC AGGGCGGGCG CCGGATGCTG GGTCCTTTGC TGAAACGGCC
GCGCCGTTCA TTCATCCTGC ACAATCCCCC TCCCCCGCGC AGACCTTCAA CGTCCCTGCC
CGCTTCGTCG AGCTGGCGCA CATTGCCATC CTGCATCGCG ATCCGCAGCG CTTTGCGTAC
CTGTATCAGC TGCTGTGGCG GCTCCGCGCC AATCCCGAGC TGATGCAGGT GGCAACCGAT
CCGGACGTGG CACGGCCACA GGCAATGGCC AAGGCGGTAA GGCGCGACGA ACACAAGATG
CACGCCTTCG TTCGCTTCCG CGAGATCGGC CGCGAGCCGA AGTCGCGTTA CGTCGCCTGG
TTCGAGCCCG AGCATCATAT CGTCGAGCTC GCCGCGCCTT TCTTCGCACG GCGCTTCGCC
GATATGGAAT GGTCGATCCT GACACCCGAC GTCTGCGCAC ATTGGGACGG CCACGCGATC
GCAATCACGC CCGGCGTCAG CAAGGCAATG GCGCCGTCGG AGGATCGGCT GGAGGAAACC
TGGCTGACTT ACTACGCCAG TATCTTCAAT CCGGCGCGGC TGAAGACCAA GGCGATGCAG
GCCGAAATGC CGAAGAAATA TTGGCGCAAT CTGCCCGAAG CGGCGCTGAT CAAACCGCTG
ATCGAACACG CCGAGCGCAA GGCGCATGCG ATGGTCGCGG CCGAGGCGAC CGCGCCGAGG
AAACCGCAGC GACAGGAGCC GCCGATGACG AGAGCCGAGC CGAAGGCCGA CACGCTGGCG
CATCTGCGCG AGGAAGCGAA AGATTGCCGC GCCTGCGACC TGTGGAAGGA CGCCACCCAG
ACCGTGTTCG GCGAAGGCCC GCAGCACGCC ACCGTGATGC TGGTCGGCGA GCAGCCCGGC
GACAAGGAAG ACCTCGCCGG CAAGCCGTTC GTCGGCCCCG CCGGCCAGAT GCTCGACCGC
GCACTGGCGG AGGCCGGCGT CGACCGCAGC AAGACCTACG TCACCAACGC GGTGAAGCAC
TTCAAATTCG TGCCGCGTGG CAAGATCCGC CTGCACCAGA AGCCGGCCAC GCCGGAGATT
AAGGCTTGCC GGCAGTGGTA CGAGCGCGAG CTCGCCGCGG TGCAGCCGCT GCTGGTGGTG
GCGATGGGCG CCACCGCGGC GCAGAGCGTG CTCGGCAGGA TCACCCCCAT CAACAAAACC
CGCGGCCGCC TGATCGACCG CGACGACGGT CCGCAAGTGC TGGTCACCGT CCACCCGTCC
TACTTGCTGC GGCTGCCCGA CGCCGACGCC AAGGCGCGCG AATACGCTCG CTTCGTCGAA
GATCTGAAGC TCATCGCCGC ACATCTGAAG AAGGCGCATG CGGCGTAG
 
Protein sequence
MNRIRLDSDT DFDGWRKAAR ELVLADVAPA DVTWTVAGDE PELFDAPGRA PDAGSFAETA 
APFIHPAQSP SPAQTFNVPA RFVELAHIAI LHRDPQRFAY LYQLLWRLRA NPELMQVATD
PDVARPQAMA KAVRRDEHKM HAFVRFREIG REPKSRYVAW FEPEHHIVEL AAPFFARRFA
DMEWSILTPD VCAHWDGHAI AITPGVSKAM APSEDRLEET WLTYYASIFN PARLKTKAMQ
AEMPKKYWRN LPEAALIKPL IEHAERKAHA MVAAEATAPR KPQRQEPPMT RAEPKADTLA
HLREEAKDCR ACDLWKDATQ TVFGEGPQHA TVMLVGEQPG DKEDLAGKPF VGPAGQMLDR
ALAEAGVDRS KTYVTNAVKH FKFVPRGKIR LHQKPATPEI KACRQWYERE LAAVQPLLVV
AMGATAAQSV LGRITPINKT RGRLIDRDDG PQVLVTVHPS YLLRLPDADA KAREYARFVE
DLKLIAAHLK KAHAA