Gene Sare_0756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0756 
SymboldeoA 
ID5706960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp841700 
End bp842980 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content72% 
IMG OID641270275 
Productthymidine phosphorylase 
Protein accessionYP_001535666 
Protein GI159036413 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0537084 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGATT TTACGGCAGT TGACATCATT CGGGCCAAAC GGGACGGCGA GGCGTTCTCC 
GACGCGCAGG TCGACTGGGT GGTGGATGCC TATACCCGAG GTCAGGTGGC GGACGAACAG
ATGGCCGCGC TGGCGATGGC GATCCTGCTG AACGGGATGA CCACGCCGGA GATCGCCCGG
TGGACCGCGG CGATGATCGC CAGCGGTGAG CGACTGGACC TCGCCGCGGT CGATCGGCCG
ACCGTCGACA AACACTCCAC CGGTGGTGTC GGAGACAAGA TCACCCTGCC GCTCACCCCG
CTGGTGGCGG CCTGCGGCGC GGCGGTGCCG CAGCTGAGCG GTCGGGGACT CGGGCACACC
GGTGGCACGC TCGACAAGCT GGAGTCCATC CGGGGCTGGC GGGCGTCGCT GGGCAACGAG
GAGTTCATCG CCCAGCTCCG CGGTGTCGGC GCGGTGATCT GCGCGGCCGG CGCCGGTCTC
GCGCCGGCCG ACCGCAAGCT GTACGCGCTG CGGGACGTGA CCGGCACGGT GGAGGCCATC
CCACTCATCG CCAGTTCGAT TATGAGCAAG AAGATCGCTG AGGGGACCGG TGCCCTGGTC
CTGGACGTCA AGGTCGGCTC CGGTGCGTTC ATGAAGTCGG TCGACCAGGC CCGGCAACTG
GCCCGCACCA TGGTCGAGTT GGGCAGCGCG CACGACGTAC GCACGGTTGC CCTGCTCACC
GACATGTCCA CCCCACTAGG GCTGGCCATC GGCAACGCGG TCGAGGTGGC CGAATCGGTC
GAGGTACTGG CGGGTGGTGG ACCGGCCGAC GTGGTCGAGC TGACCCTGGC CCTGGCCCGG
GAGATGCTCG AAGCCGCCGG TCTACCGGAT GCCGACCCGG CGACGGCGCT GCGCGACGGC
CGGGCGATGG ACTCCTGGCG GGCGATGCTG CGGGCCCAGG GCGGGGATCC GGATGCCCCG
CTGCCCACGG CGCCGGAGAC CGAGGTGGTG CGCGCCGACA CCGACGGTGT GGTGGCGGAG
GTCGACGCGT TCGGCATGGG GGTGGCCGCG TGGCGGCTCG GCGCCGGGCG GGCCCGCAAG
GAGGACCCGG TGTCCGCGCC GGCGGGCGTG CTGCTGCGCA AACGCCCCGG CGATCCGGTC
CGGGCCGGTG ACCCGCTCTT CGAGCTGCGG GCCGAGGACG CCGCTCGGAT CCCGGCGGCC
CGGGAGGAGG CGGTGCGGGC GGTGCGTATT GCGGCGTCGA CCCCGGAGCC GAGGCCACTG
GTGCTCGAAC GAATCGGCTG A
 
Protein sequence
MVDFTAVDII RAKRDGEAFS DAQVDWVVDA YTRGQVADEQ MAALAMAILL NGMTTPEIAR 
WTAAMIASGE RLDLAAVDRP TVDKHSTGGV GDKITLPLTP LVAACGAAVP QLSGRGLGHT
GGTLDKLESI RGWRASLGNE EFIAQLRGVG AVICAAGAGL APADRKLYAL RDVTGTVEAI
PLIASSIMSK KIAEGTGALV LDVKVGSGAF MKSVDQARQL ARTMVELGSA HDVRTVALLT
DMSTPLGLAI GNAVEVAESV EVLAGGGPAD VVELTLALAR EMLEAAGLPD ADPATALRDG
RAMDSWRAML RAQGGDPDAP LPTAPETEVV RADTDGVVAE VDAFGMGVAA WRLGAGRARK
EDPVSAPAGV LLRKRPGDPV RAGDPLFELR AEDAARIPAA REEAVRAVRI AASTPEPRPL
VLERIG