Gene SeSA_A4822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A4822 
SymboldeoA 
ID6515749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp4678746 
End bp4680068 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content56% 
IMG OID642749754 
Productthymidine phosphorylase 
Protein accessionYP_002117483 
Protein GI194734315 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02643] thymidine phosphorylase
[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.524384 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTTCTCG CACAAGAAAT TATTCGTAAA AAGCGTGATG GTCATGCGTT GAGTGACGAA 
GAAATTCGTT TCTTTATCAA TGGTATTCGT GACAATACTA TCTCTGAAGG GCAGATTGCC
GCCCTGGCGA TGACCATCTT CTTCCACGAT ATGACCATGC CGGAGCGTGT TTCGCTGACC
ATGGCGATGC GGGATTCCGG TTCTGTCCTT GACTGGAAAA GCCTGAATCT CAATGGCCCG
ATTGTCGATA AGCATTCGAC CGGCGGCGTA GGGGACGTGA CGTCTCTGAT GCTGGGGCCA
ATGGTAGCGG CCTGCGGCGG TTATGTACCG ATGATCTCCG GTCGCGGCCT CGGACATACC
GGCGGTACGC TCGATAAACT GGAAGCGATC CCGGGCTTCG ACATCTTCCC GGACGACAAC
CGTTTCCGCG AAATTATTCA AGACGTGGGT GTGGCGATTA TTGGGCAAAC CAGCTCGCTT
GCACCGGCGG ACAAACGTTT TTACGCCACC CGCGATATTA CCGCGACGGT GGACTCTATT
CCGCTGATCA CTGGTTCCAT CCTCGCCAAG AAACTGGCCG AAGGGCTGGA TGCGCTGGTA
ATGGACGTCA AAGTCGGCAG CGGCGCGTTT ATGCCAACCT ATGAACTTTC TGAAGCCCTT
GCTGAAGCGA TTGTCGGCGT GGCAAACGGC GCGGGAGTTC GCACTACGGC GTTGTTAACC
GATATGAATC AGGTGCTGGC TTCAAGCGCC GGTAACGCGG TGGAAGTGCG TGAAGCCGTG
CAGTTCCTGA CCGGTGAATA CCGCAATCCG CGCTTGTTTG ACGTCACTAT GGCGCTATGT
GTGGAGATGC TGATCTCCGG CCAGCTGGCG AAAGACGACG CCGAAGCGCG TGCCAAACTG
CAGGCGGTGC TGGATAACGG TAAAGCGGCA GAAGTCTTTG GTCGTATGGT GGCCGCGCAG
AAAGGGCCGA GCGATTTCGT TGAGAACTAC GATAAATACC TGCCGACCGC CATGTTGAGC
AAAGCGGTAT ATGCTGATAC CGAAGGGTTT ATCAGCGCAA TGGATACGCG TGCGCTGGGG
ATGGCGGTCG TCTCGATGGG CGGTGGGCGT CGTCAGGCGT CTGATACCAT TGATTACAGC
GTTGGCTTTA CCGACATGGC CCGTCTGGGC GACAGCATCG ACGGGCAGCG CCCGCTGGCG
GTGATTCATG CCAAAGACGA AACCAGTTGG CAGGAAGCGG CGAAGGCCGT CAAAGCGGCA
ATTATCCTTG ACGATAAAGC GCCAGCAAGC ACACCTTCGG TCTATCGTCG AATTACTGAA
TAG
 
Protein sequence
MFLAQEIIRK KRDGHALSDE EIRFFINGIR DNTISEGQIA ALAMTIFFHD MTMPERVSLT 
MAMRDSGSVL DWKSLNLNGP IVDKHSTGGV GDVTSLMLGP MVAACGGYVP MISGRGLGHT
GGTLDKLEAI PGFDIFPDDN RFREIIQDVG VAIIGQTSSL APADKRFYAT RDITATVDSI
PLITGSILAK KLAEGLDALV MDVKVGSGAF MPTYELSEAL AEAIVGVANG AGVRTTALLT
DMNQVLASSA GNAVEVREAV QFLTGEYRNP RLFDVTMALC VEMLISGQLA KDDAEARAKL
QAVLDNGKAA EVFGRMVAAQ KGPSDFVENY DKYLPTAMLS KAVYADTEGF ISAMDTRALG
MAVVSMGGGR RQASDTIDYS VGFTDMARLG DSIDGQRPLA VIHAKDETSW QEAAKAVKAA
IILDDKAPAS TPSVYRRITE