Gene YpsIP31758_3497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3497 
SymboldeoA 
ID5387753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp3946043 
End bp3947365 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content54% 
IMG OID640866510 
Productthymidine phosphorylase 
Protein accessionYP_001402452 
Protein GI153946986 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02643] thymidine phosphorylase
[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTTCTGG CACAAGAAAT TATCCGTAAA AAACGCGACG GTCAGCCATT GAGCGAAGAA 
GAGATTCGTT TTTTTATCAA TGGGATCCGC GATAACGTTG TTTCTGAAGG GCAAATTGCC
GCTTTAGCGA TGACCATTTA TTTCCACGAT ATGAGTATGC CTGAGCGCGT TGCGCTGACC
ATGGCGATGC GTGATTCCGG TACTGTGCTG AATTGGAAGA GCCTGAATCT GAATGGCCCG
CTGGTCGATA AGCACTCCAC TGGTGGCGTG GGTGATGTGA CGTCACTGAT GCTTGGCCCG
ATGGTGGCAG CTTGCGGCGG CTATGTGCCG ATGATCTCTG GCCGTGGTCT TGGTCATACC
GGCGGCACAC TGGATAAACT GGAGGCGATC CCCGGTTTTG ATATTTTCCC GGATGATAAT
GCGTTCCGCA AAATTATTCA GAATGTTGGT GTGGCGATTA TCGGCCAAAC CAGCTCGCTG
GCCCCTGCCG ATAAGCGTTT TTACGCGACC CGCGATATTA CGGCAACAGT AGATTCTATT
CCATTGATTA CGGCCTCTAT CCTGGCCAAA AAATTGGCGG AAGGCCTGGA TGCATTGGTC
ATGGACGTGA AGGTCGGCTC TGGTGCCTTT ATGCCAACCT ACTCGTTGTC GGCTGATTTG
GCGCAGGCGA TTGTTGGTGT GGCAAACGGG GCGGGTTGCA AAACCACGGC GCTCCTGACG
GACATGAACC AAGTCCTGGC ATCCAGCGCC GGTAATGGGG TCGAAGTCCG CGAAGCTGTG
CGTTTCCTGA CGGGCGAATA TCGCAACCCA CGTTTGCTGG AAGTGACTAT GGCGCTGTGT
GTTGAAATGT TGCTGTCAGG CGGTTTAGCG CACGATGAAG CCGATGCCCG TGCCAAGCTG
CAAGCCGTTT TGGATAACGG CAAAGCGGCA GAAGTCTTTG GCCGCATGGT GGCCGCGCAA
AAAGGCCCGG TAGACTTTGT TGAACGCTAT GACAGCTACC TGCCCGTTGC TACCCTAAGC
AAACCGGTAT TTGCTGAACA GACGGGAATC ATTACTGCAA TGGATACCCG CGCCTTGGGT
ATGGCGGTGG TCGCCCTCGG CGGGGGACGC CGTCGGGCAA CGGATCCCAT TGATTATAGT
GTAGGGCTGA CGGAAATGGC CCGCTTGGGT ACCCGTGTTG ACGGGCAGCA GCCACTTGCG
GTGATCCATG CCAATAACGA AGATGACTGG CAACAGGCGG CAGAGGCTGT GCGTGCGGCC
ATCACCTTAG GGAATAACGC GCCAGAAGAA ACGCCAGTGA TTTATCGCCG TATCACTGAA
TAA
 
Protein sequence
MFLAQEIIRK KRDGQPLSEE EIRFFINGIR DNVVSEGQIA ALAMTIYFHD MSMPERVALT 
MAMRDSGTVL NWKSLNLNGP LVDKHSTGGV GDVTSLMLGP MVAACGGYVP MISGRGLGHT
GGTLDKLEAI PGFDIFPDDN AFRKIIQNVG VAIIGQTSSL APADKRFYAT RDITATVDSI
PLITASILAK KLAEGLDALV MDVKVGSGAF MPTYSLSADL AQAIVGVANG AGCKTTALLT
DMNQVLASSA GNGVEVREAV RFLTGEYRNP RLLEVTMALC VEMLLSGGLA HDEADARAKL
QAVLDNGKAA EVFGRMVAAQ KGPVDFVERY DSYLPVATLS KPVFAEQTGI ITAMDTRALG
MAVVALGGGR RRATDPIDYS VGLTEMARLG TRVDGQQPLA VIHANNEDDW QQAAEAVRAA
ITLGNNAPEE TPVIYRRITE