Gene RPC_4089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4089 
Symbol 
ID3973178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4542803 
End bp4544344 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content67% 
IMG OID637927193 
Productthymidine phosphorylase 
Protein accessionYP_533934 
Protein GI90425564 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02645] putative thymidine phosphorylase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0836613 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGCG CCGACGCGCC CCGCCCGCAA CTCAAGATCC GCCTGATCCA TCTCGACACC 
GGCCGCGAAA ACGTCGCGGT GATGTCGCGG CGCTCAAAGG CGCTGCGCCC GGAGGTGTTC
AGCGGCTTCA GCCGGGTCGA GATCCGCCGC AATGCGAAGT CGGTGCTGGC GACCTTGCTG
ATTACCGACG ACGACGCGCT GGTCGGGCCG GACGAACTCG GCCTCGCCGA GCCGGCGTTC
CGGCGTTTCG GCGAACCCGC CGGGACCTTC GTCACGGTGT CGCCGGCGAC GCCGCCGGAC
AGTCTCGACG CGGTGCGCGG CAAGATCCAG GGCCGCACGC TGAGCGCGGC GGAGATCACC
GCCATCGTCA ACGATCTCAC CCGCTATCGT TATTCCGACA TGGAGATCGC CGCCTTCCTG
ATCGGCGCGG CGCGGTTCAT GACTTCGGAC GAACTGCTGG CGCTGGTGTC GGCGATGGCC
TCGGTCGGCA CTCAGTTGCG CTGGGATCGC CCGATCGTGG TCGACAAGCA TTGCATCGGC
GGCATTCCCG GCAATCGCAC CTCGATGATC CTGGTGCCGA TCGTCGCCGC GCACGGGCTG
ACGATTCCGA AAACCTCGTC GCGCGCCATC ACCTCGCCGG CCGGCACCGC CGATACCATG
GAGGTGTTGG CGCGGGTCGA TGTCAGCGTC GCGGAAATGA AAGAGATCGT CGCCGCCTGC
AACGGCTGTC TGATCTGGGG CGGCCACGTC AATCTCTCGC CCGCCGATGA CATCTTGATC
TCGGTGGAGC GGCCGCTGTG TCTCGACACC CGCGAGCAGA TGGTGGCCTC GATCATGTCG
AAGAAGCTCG CCGCCGGCTC GACGCATCTG TTGGTCGATC TGCCGGTCGG CCCGACCGCC
AAGGTCGCCA GCGCACTCGA CGCGATGCGG CTGCGCAAGC TGTTCGAATT CGTCGGCGAT
CATTTCGGCA TCGCGGTGGA GACCATCACC ACCGACGGCC GGCAGCCGAT CGGCAACGGC
ATCGGCCCGG TGCTGGAGGC GCAGGACGTC ATGGCGGTGC TCGGCAACGA TCCGAAGGCT
CCCGCCGATC TGCGCGAAAA ATCGCTACGG TTGGCGGCGC ATCTGTTGGA ATACGATCCG
CATCTGCGCG GCGGCGCGGG CTACGCGCGG GCCCGCGAGT TGCTGGAAAG CGGCGCGGCG
CTGAAGCAGA TGCAGAAGAT CATCGACAAT CAGGGGCCTT CGACCTGTCA CAAGGACCTC
GGAACGCTGA CCGCCGAGGT GACGGCGGAG CGCGACGGCG TGGTGTCGGC GATCGATTGC
CTGCAGCTCA ATCGGCTGGC GCGCACCGCC GGCGCGCCGA TCGACAAGGG CGCCGGCATC
CGGCTGTTCA AGAAGGTCGG CGATCGTGTC GAGGCCGGCG AGCCGCTGTA TCGAATTTAC
GCCTTCGATC CGGCCGAGCG CGAACTCGCG GTGGCTGCCG CCAAGCTCGC CTGCGGTTAC
ACGGTCGACG ACGCGCAAAC CTTTCGCGAG CAGGTGATGT AG
 
Protein sequence
MTSADAPRPQ LKIRLIHLDT GRENVAVMSR RSKALRPEVF SGFSRVEIRR NAKSVLATLL 
ITDDDALVGP DELGLAEPAF RRFGEPAGTF VTVSPATPPD SLDAVRGKIQ GRTLSAAEIT
AIVNDLTRYR YSDMEIAAFL IGAARFMTSD ELLALVSAMA SVGTQLRWDR PIVVDKHCIG
GIPGNRTSMI LVPIVAAHGL TIPKTSSRAI TSPAGTADTM EVLARVDVSV AEMKEIVAAC
NGCLIWGGHV NLSPADDILI SVERPLCLDT REQMVASIMS KKLAAGSTHL LVDLPVGPTA
KVASALDAMR LRKLFEFVGD HFGIAVETIT TDGRQPIGNG IGPVLEAQDV MAVLGNDPKA
PADLREKSLR LAAHLLEYDP HLRGGAGYAR ARELLESGAA LKQMQKIIDN QGPSTCHKDL
GTLTAEVTAE RDGVVSAIDC LQLNRLARTA GAPIDKGAGI RLFKKVGDRV EAGEPLYRIY
AFDPAERELA VAAAKLACGY TVDDAQTFRE QVM