Gene SNSL254_A4924 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4924 
SymboldeoA 
ID6482244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4792543 
End bp4793865 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content56% 
IMG OID642740135 
Productthymidine phosphorylase 
Protein accessionYP_002043809 
Protein GI194443952 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02643] thymidine phosphorylase
[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.118094 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones92 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTTCTCG CACAAGAAAT TATTCGTAAA AAGCGTGATG GTCATGCGTT GAGTGACGAA 
GAAATTCGTT TCTTTATCAA TGGTATTCGT GACAATACTA TCTCTGAAGG GCAGATTGCC
GCCCTGGCGA TGACCATCTT CTTCCACGAT ATGACCATGC CGGAGCGTGT TTCGCTGACC
ATGGCGATGC GGGATTCCGG TACTGTCCTT GACTGGAAAA GCCTGAATCT CAATGGCCCG
ATTGTCGATA AGCATTCGAC CGGCGGCGTA GGGGACGTGA CGTCTCTGAT GTTGGGGCCA
ATGGTAGCGG CCTGCGGCGG TTATGTGCCG ATGATCTCCG GTCGCGGCCT CGGACATACC
GGCGGTACGC TCGACAAACT GGAAGCGATC CCGGGCTTCG ATATCTTCCC GGACGACAAC
CGTTTCCGCG AAATTATTCA AGACGTGGGT GTGGCGATTA TTGGGCAAAC CAGCTCGCTT
GCACCGGCGG ACAAACGTTT TTACGCCACC CGCGATATTA CCGCGACGGT GGACTCTATT
CCGCTGATCA CCGGCTCCAT CCTCGCCAAG AAACTGGCCG AAGGGCTGGA TGCGCTGGTA
ATGGACGTCA AAGTCGGCAG CGGCGCGTTT ATGCCAACCT ATGAACTTTC TGAAGCCCTT
GCTGAAGCGA TTGTCGGCGT GGCAAACGGC GCGGGAGTTC GCACTACGGC GTTGTTAACC
GATATGAACC AGGTGCTGGC TTCAAGCGCC GGTAACGCGG TGGAAGTGCG TGAAGCCGTG
CAGTTCCTGA CCGGTGAATA CCGCAATCCG CGCTTGTTTG ACGTCACTAT GGCGCTATGC
GTGGAGATGC TGATCTCCGG CCAGCTGGCG AAAGACGACG CCGAAGCGCG TGCCAAACTG
CAGGCGGTGC TGGATAACGG TAAAGCGGCA GAAGTCTTTG GTCGTATGGT GGCCGCGCAG
AAAGGGCCAA GCGATTTCGT TGAGAACTAC GATAAATACC TGCCGACCGC CATGTTGAGC
AAAGCGGTAT ATGCTGATAC CGAAGGGTTT ATCAGCGCAA TGGATACGCG TGCGCTGGGG
ATGGCGGTCG TCTCGATGGG CGGCGGCCGT CGTCAGGCGT CAGATACCAT TGATTACAGC
GTTGGCTTTA CCGACATGGC CCGTCTGGGC GACAGCATCG ACGGGCAGCG CCCGCTGGCG
GTGATTCATG CCAAAGACGA AGCCAGTTGG CAGGAAGCGG CGAAGGCCGT CAAAGCGGCA
ATTATCCTTG ACGATAAAGC GCCAGCAAGC ACACCTTCGG TCTATCGTCG AATTACTGAA
TAG
 
Protein sequence
MFLAQEIIRK KRDGHALSDE EIRFFINGIR DNTISEGQIA ALAMTIFFHD MTMPERVSLT 
MAMRDSGTVL DWKSLNLNGP IVDKHSTGGV GDVTSLMLGP MVAACGGYVP MISGRGLGHT
GGTLDKLEAI PGFDIFPDDN RFREIIQDVG VAIIGQTSSL APADKRFYAT RDITATVDSI
PLITGSILAK KLAEGLDALV MDVKVGSGAF MPTYELSEAL AEAIVGVANG AGVRTTALLT
DMNQVLASSA GNAVEVREAV QFLTGEYRNP RLFDVTMALC VEMLISGQLA KDDAEARAKL
QAVLDNGKAA EVFGRMVAAQ KGPSDFVENY DKYLPTAMLS KAVYADTEGF ISAMDTRALG
MAVVSMGGGR RQASDTIDYS VGFTDMARLG DSIDGQRPLA VIHAKDEASW QEAAKAVKAA
IILDDKAPAS TPSVYRRITE