Gene YpAngola_A0830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0830 
SymboldeoA 
ID5799292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp853815 
End bp855230 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content55% 
IMG OID641338827 
Productthymidine phosphorylase 
Protein accessionYP_001605405 
Protein GI162420216 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02643] thymidine phosphorylase
[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACAGTA GGCTGAAGGC CTGCTGTGTC GCTATTTCCG AAAGCGCCTG TGGGCGGCTT 
TCATCCCATA TTCCGGTCAA GCAGGGGGAT GCCTTGTTTC TGGCACAAGA AATTATCCGT
AAAAAACGCG ACGGTCAGCC ATTGAGCGAA GAAGAGATTC GTTTTTTTAT CAATGGGATC
CGCGATAACG TTGTTTCTGA AGGGCAAATT GCCGCTTTAG CGATGACCAT TTATTTCCAC
GACATGAGCA TGCCTGAGCG CGTTGCGCTG ACCATGGCGA TGCGTGATTC CGGTACTGTG
CTGAATTGGA AGAGCCTGAA TCTGAATGGC CCGCTGGTCG ATAAGCACTC CACTGGTGGC
GTGGGTGATG TGACGTCACT GATGCTTGGC CCGATGGTGG CAGCCTGCGG CGGCTATGTG
CCGATGATCT CTGGCCGTGG TCTTGGTCAT ACCGGCGGCA CACTGGATAA ACTGGAGGCG
ATCCCCGGTT TTGATATTTT CCCGGATGAT AATGCGTTCC GCAAAATTAT TCAGAATGTT
GGTGTGGCGA TTATCGGCCA AACCAGCTCG CTGGCCCCTG CCGATAAGCG TTTTTACGCG
ACCCGCGATA TTACGGCAAC AGTCGATTCT ATTCCATTGA TTACGGCCTC TATTCTGGCC
AAAAAATTGG CGGAAGGGCT GGATGCATTG GTCATGGACG TGAAGGTCGG CTCCGGTGCC
TTTATGCCAA CCTACTCGTT GTCGGCTGAT TTGGCGCAGG CGATTGTTGG TGTGGCAAAC
GGTGCGGGTT GCAAAACCAC GGCGCTACTG ACGGACATGA ACCAAGTCCT GGCGTCCAGC
GCCGGTAATG GGGTCGAAGT CCGCGAAGCT GTGCGTTTCC TGACGGGCGA ATATCGCAAC
CCACGTTTGC TGGAAGTGAC TATGGCGCTG TGTGTTGAAA TGTTGCTGTC AGGCGGTTTA
GCGCACGATG AAGCCGATGC CCGTGCCAAG CTGCAAGCCG TTTTGGATAA TGGCAAAGCG
GCGGAAGTCT TTGGCCGTAT GGTGGCCGCG CAAAAAGGCC CGGCAGACTT TGTTGAACGC
TATGACAGCT ACCTGCCCGT TGCTACCCTA AGCAAACCGG TATTTGCTGA ACAGACGGGA
ATCATTACTG CAATGGATAC CCGCGCCTTG GGTATGGCGG TGGTCGCCCT CGGCGGGGGA
CGCCGTCGGG CAACGGATCC CATTGATTAT AGTGTAGGGC TGACGGAGAT GGCCCGTTTG
GGTACCCGTG TTGATGGGCA GCAGCCGCTT GCGGTGATCC ATGCCAATAA CGAAGATGAC
TGGCAGCAGG CGGCAGAGGT TGTGCGTGCG GCCATCACCG TAGGGAATAA CACGCCAGAA
GAAACGCCAG TGATTTATCG CCGTATCACT GAATAA
 
Protein sequence
MHSRLKACCV AISESACGRL SSHIPVKQGD ALFLAQEIIR KKRDGQPLSE EEIRFFINGI 
RDNVVSEGQI AALAMTIYFH DMSMPERVAL TMAMRDSGTV LNWKSLNLNG PLVDKHSTGG
VGDVTSLMLG PMVAACGGYV PMISGRGLGH TGGTLDKLEA IPGFDIFPDD NAFRKIIQNV
GVAIIGQTSS LAPADKRFYA TRDITATVDS IPLITASILA KKLAEGLDAL VMDVKVGSGA
FMPTYSLSAD LAQAIVGVAN GAGCKTTALL TDMNQVLASS AGNGVEVREA VRFLTGEYRN
PRLLEVTMAL CVEMLLSGGL AHDEADARAK LQAVLDNGKA AEVFGRMVAA QKGPADFVER
YDSYLPVATL SKPVFAEQTG IITAMDTRAL GMAVVALGGG RRRATDPIDY SVGLTEMARL
GTRVDGQQPL AVIHANNEDD WQQAAEVVRA AITVGNNTPE ETPVIYRRIT E