Gene RSc0204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSc0204 
SymboldeoA 
ID1219007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia solanacearum GMI1000 
KingdomBacteria 
Replicon accessionNC_003295 
Strand
Start bp225922 
End bp227445 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content63% 
IMG OID637236561 
Productthymidine phosphorylase 
Protein accessionNP_518325 
Protein GI17544923 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02645] putative thymidine phosphorylase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.61192 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGCTC CGCCTGAGGT GGCAGCACTG CCCGATCGGC TTACCTTCAA GCCGTTGGGC 
ATCGACACTT GGCAGGAGCA CGTCATCTAC ATGCATCCGG ATTGCGCAAT CTGCCGGGCT
GAGGGGTTCA CTGCACAAGC TCGGGTGGAG GTGCGGATCG GTCTGCGCTC CTTGATTGCC
ACACTCAATC TCGTCGGCTC AGGCTTGCTG GAGATGTGTG AGGTCAGCTT GTCCGTCAGC
GCCGTCGAGA CGCTGATGGC GAGGCCCGGC GATATTGTGA CTGTCAGTCA CGCGCCCGCG
CTGGAGTCGC TGCGGGCGGT ACGCGCCAAG ATCTATGGGG CGCATCTGGA TACGCATCAA
CTGGCTAGCG TCGTCGGTGA TATTGCCAAA GAGCGGTATG CCGACGTCCA CATTGCGGCG
TTCCTGAGCG CCTGCGCAGG CGGGCGAATG AGCGTCAAGG AGACAATCGA TCTCACCCAG
GCCATGGTCG ACTCGGGCGA ATGTCTCGAA TGGGATCGCG AGATCGTCGC GGACAAGCAC
TGTGTGGGCG GCTTGCCGGG CAACCGTACC AGTCCCATCG TGGTCGCCAT CGCTGCCGCT
GCAGGCTTGT TGCTACCTAA GACCTCGTCG CGCGCCATCA CGTCACCCGC CGGTACCGCC
GACACGATGG AGACACTCAC GCGTGTTGCT CTGAGCGCCA CGGAGTTGCG ACGCGTCGTT
GATCGGGTTG GAGCTTCGCT CGCGTGGGGC GGTGCGCTCA GCCTTAGCCC CGCCGACGAC
GTGCTCATTC GCGTGGAGCG GGCGTTGGAT GTGGATAGCG ATGCCCAACT TGCGGCCTCC
ATTCTGTCGA AGAAGATCGC GGCCGGGTCA ACCCATGTCT TGATCGACGT GCCCGTGGGG
CCGACGGCCA AGGTGCGCAG TCTGCAGGAT TTGGAGCGCC TGCGTATGCT GCTCGAGCGC
GTAGCGCGGT CGTTCGGCGT GCGCGTCACG ATCGTGCGCA CGGACGGCTC GCAGCCGGTT
GGCAGGGGAA TTGGTCCGGC GCTTGAAGCA CGAGACGTCT TGGCCGTGCT TCAACGCTCT
CCTGCGGCGC CGTTCGACCT GCGGGAGCGG TCGTTGTTGC TGGCTGCGAC TCTGCTAGAG
TTTTGTGGGG CGGTGGAGCA GGGGGCAGGG CTTGAGATGG CCACAGGCGT GCTGGACAGT
GGTGCGGCGT GGCGGAAGTT CGAGGAAATC TGCGAAGCGC AGGGAGGCCT GCGTGTGCCA
GGTGAGGCCA TCTTCCGTCG TGATGTGGTA GCTGAGCAAG ACGGCATCGT CACCGAGATC
GACAACCGAC ATCTTGCTCG TATCGCGAAA CTCGCGGGGG CTCCGATGCG CCAAGTGGCA
GGCGTGGAGA TGCACGTGAG ACTACACGAC CAGGTTAAGG CGGGGCGGCC TCTCTTTACC
ATCCATGCCC AGGCTTCAGG TGAACTGGAA TATTCCGTAG CTTATGCACT GATGCACCCA
GCGGTTTCCA TCGCCCCGAC TTGA
 
Protein sequence
MLAPPEVAAL PDRLTFKPLG IDTWQEHVIY MHPDCAICRA EGFTAQARVE VRIGLRSLIA 
TLNLVGSGLL EMCEVSLSVS AVETLMARPG DIVTVSHAPA LESLRAVRAK IYGAHLDTHQ
LASVVGDIAK ERYADVHIAA FLSACAGGRM SVKETIDLTQ AMVDSGECLE WDREIVADKH
CVGGLPGNRT SPIVVAIAAA AGLLLPKTSS RAITSPAGTA DTMETLTRVA LSATELRRVV
DRVGASLAWG GALSLSPADD VLIRVERALD VDSDAQLAAS ILSKKIAAGS THVLIDVPVG
PTAKVRSLQD LERLRMLLER VARSFGVRVT IVRTDGSQPV GRGIGPALEA RDVLAVLQRS
PAAPFDLRER SLLLAATLLE FCGAVEQGAG LEMATGVLDS GAAWRKFEEI CEAQGGLRVP
GEAIFRRDVV AEQDGIVTEI DNRHLARIAK LAGAPMRQVA GVEMHVRLHD QVKAGRPLFT
IHAQASGELE YSVAYALMHP AVSIAPT