Gene Dshi_0828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_0828 
SymboldeoA 
ID5711454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp835752 
End bp837059 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content74% 
IMG OID641266737 
Productthymidine phosphorylase 
Protein accessionYP_001532174 
Protein GI159043380 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02643] thymidine phosphorylase
[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.61021 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCGC GGACGGTGAT CGCGGCGGTG CGGGACGGCG CGCCGGTGGT GGCGGAGGAC 
CTGCGCGCCT TCGCCCGGGG CCTCGCCTCC GGCACGGTCA GCGATGCCCA GGCCGGGGCC
TTCGCCATGG CGGTCCTGTT GCGGGGCTTG GGCGATGCGG GGCGCGTGGC GCTGACGCAA
GCCATGCGCG ACAGTGGCGA TGTGCTGGCC TGGGACCTGC CCGGGCCGGT GCTCGACAAG
CATTCCACGG GCGGGATCGG GGATACGGTG TCGCTCCTGC TGGCCCCGGC GCTGGCGGCC
TGCGGGGCTT ACGTGCCGAT GATCTCGGGC CGGGGGCTGG GTCATACGGG CGGCACGCTC
GACAAACTCG AGGCGCTGGA GGGCTACCGC ACCGACATTT CCGAGGATGC GCTGAGGGCG
GTGGTGGCCG AGACCGGCTG CGCCATCGTC GGGGCCACGG GCCAGATCGC GCCCGCGGAC
CGGCGGCTCT ATGCCATCCG CGACGTCACC GCCACGGTCG AAAGCATCGA CCTGATCACC
GCCTCGATCC TGTCCAAGAA GCTCGCGGCG GGGCTGGGCG GGCTGGTGCT GGATGTCAAA
TGCGGCTCGG GCGCGTTTCT GACCGACCCC GCAGCGGCCC GGGATCTGGC GCAGGCGCTG
GTGCGCACGG CCAATGGCGC GGGCTGTCCC ACCTGGGCGC TGCTGACGGA TATGGATGAG
AGCCTCGCCA GTGCCGCGGG CAATGCGCTG GAGGTGGCGG AGGTGCTGCG TATCCTGCGG
GGCGACGCGA CGAGCCCGCG CCTGCGCGCG GCGACCGTGG CCCTGGGCGG GGCGCTGCTG
GCCCTGGGCG GGCTGGCGGA CGACCCCGCG GACGGGGCTG CACGGATCGA CGCCACCCTG
CGCGATGGCC GCGCGGCGGA GGTTTTCGCC CGCATGGTCG CTGCCCTTGG CGGGCCGCGC
GACGTGGTGG CCACGGGCCT TGCCGGTTTG CCCGAGGCGC CGGTGGTGCG CGAAGTGCCC
GCACCCGCGG CGGGCGTGCT GACCGCCTGC GATGCCAAGG CGCTGGGCTG GGCGGTGGTG
TCCCTGGGCG GCGGACGGCA GGTGGAAACC GACCCGGTCG ACCCGGCGGT GGGCCTGTCG
GATATCGCCC CGCTCGGCAC CCGCGTCGTC CGGGGCGAGC CCATCGCCCG GGTCCATGCC
GCCCGCGAAG AGGCCGCCGA CCGCGCCGTG GCCGAGGTTC AGGCGGCCTT CACCCTCGGC
ACGAGTTTCG ACGCGAAGCC GCTTGTCCTC GGAGAGGTCC GCCCATGA
 
Protein sequence
MTARTVIAAV RDGAPVVAED LRAFARGLAS GTVSDAQAGA FAMAVLLRGL GDAGRVALTQ 
AMRDSGDVLA WDLPGPVLDK HSTGGIGDTV SLLLAPALAA CGAYVPMISG RGLGHTGGTL
DKLEALEGYR TDISEDALRA VVAETGCAIV GATGQIAPAD RRLYAIRDVT ATVESIDLIT
ASILSKKLAA GLGGLVLDVK CGSGAFLTDP AAARDLAQAL VRTANGAGCP TWALLTDMDE
SLASAAGNAL EVAEVLRILR GDATSPRLRA ATVALGGALL ALGGLADDPA DGAARIDATL
RDGRAAEVFA RMVAALGGPR DVVATGLAGL PEAPVVREVP APAAGVLTAC DAKALGWAVV
SLGGGRQVET DPVDPAVGLS DIAPLGTRVV RGEPIARVHA AREEAADRAV AEVQAAFTLG
TSFDAKPLVL GEVRP