Gene RPD_4252 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4252 
Symbol 
ID4024773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4720356 
End bp4721633 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content65% 
IMG OID637964458 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_571370 
Protein GI91978711 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.82541 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.253409 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAGA AGCCTCCCTC CGACATCCTC GATCGCCGCC GGTTTCTCGG CGCAGCAGGC 
CTTGCGGGCG CCGGCGCATT GCTGCCCCTC GCGGCGAAGG CGGGCGAGGC GGCGAAGCCG
GACCCGGCGA TCACCGAGGT GCAGGACTGG AATCGCTATC TCGGCGACGG CGTCGACAAG
AAGCCCTATG GCGTTCCGTC CAAGTTCGAA AAGGATGTGA TCCGCCGCGA CGTGTCGTGG
CTCACCGCCT CGCCGGAATC CTCGGTCAAT TTCACGCCGC TGCATGCGAT CGACGGCATC
ATCACGCCGT CCGGCGTGTG TTTCGAGCGC CACCACGGCG GCGTCGCCGA GATCAACCCG
GCGGAGCACC GGCTGATGAT CAATGGCCTG GTCGACACCC CGATGGTGTT CACCATGGAC
GACATCAAGC GGATGCCGCG GGTCAACAAG GTGTACTTCC TGGAATGCGC GGCGAACTCC
GGCATGGAGT GGCGCGGCGC GCAGCTCAAC GGCTGCCAGT TCACCCACGG CATGATCCAC
AACGTGATGT ACACCGGCGT GCCCCTGAAG GTGCTGCTCG AACAGGCCGG GCTGAAGCCG
AACGCGAAAT GGCTGATGCT GGAGGGCGCG GACAGCGCCG GCATGAATCG CTCGCTGCCG
GTTGCGAAGG CGCTCGACGA CGTGCTGATC GCGTTCGCGA TGAATGGCGA GGCGCTGCGC
CCCGAGAACG GCTATCCGCT GCGCGCGGTG ATCCCCGGCT GGCAGGGTAA TCTCTGGGTG
AAATGGCTGC GCCGGATCGA AGCCGGCGAC CAGCCCTGGC AGGCCCGCGA GGAAACCTCG
AAATACACCG ATCTGATGCC CGACGGCCGC GCCCGCAAAT ACACCTTCGT GATGGATGCG
AAGTCGGTGA TCACCAACCC GTCGCCGCAA GCGCCGCTGA AGTTCAAGGG CCGCAACGTG
CTGAGCGGCG TCGCCTGGTC GGGCCGCGGC ACCGTCAAGC GCGTCGACGT CACGATGGAC
GGCGGTCGGA ACTGGCGTGA GGCGCGGATC GACGGACCGG TGCTGGACAA GTCGTTGGTG
CGTTTCTACG TCGATTTCGA CTGGAACGGT CAGGAACTGA TGCTGCAGTC GCGCGCCATC
GACGAGACCG GCTACGTACA GCCGACCAAG GCCGAGCTGC GCAAGGTCCG CGGCGTCAAC
TCGATCTATC ACAACAACGG CATCCAGACT TGGCTCGTGC ATCCGGACGG AGTGACTGAA
AATGTCGAGA TCGCTTAG
 
Protein sequence
MSQKPPSDIL DRRRFLGAAG LAGAGALLPL AAKAGEAAKP DPAITEVQDW NRYLGDGVDK 
KPYGVPSKFE KDVIRRDVSW LTASPESSVN FTPLHAIDGI ITPSGVCFER HHGGVAEINP
AEHRLMINGL VDTPMVFTMD DIKRMPRVNK VYFLECAANS GMEWRGAQLN GCQFTHGMIH
NVMYTGVPLK VLLEQAGLKP NAKWLMLEGA DSAGMNRSLP VAKALDDVLI AFAMNGEALR
PENGYPLRAV IPGWQGNLWV KWLRRIEAGD QPWQAREETS KYTDLMPDGR ARKYTFVMDA
KSVITNPSPQ APLKFKGRNV LSGVAWSGRG TVKRVDVTMD GGRNWREARI DGPVLDKSLV
RFYVDFDWNG QELMLQSRAI DETGYVQPTK AELRKVRGVN SIYHNNGIQT WLVHPDGVTE
NVEIA