Gene SeD_A4983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4983 
SymboldeoA 
ID6872012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4811862 
End bp4813184 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content56% 
IMG OID642787853 
Productthymidine phosphorylase 
Protein accessionYP_002218443 
Protein GI198242938 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02643] thymidine phosphorylase
[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.615857 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones98 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTTCTCG CACAAGAAAT TATTCGTAAA AAGCGTGATG GTCATGCGTT GAGTGACGAA 
GAAATTCGTT TCTTTATTAA TGGTATTCGT GACAATACTA TCTCTGAAGG GCAGATTGCC
GCCCTGGCGA TGACCATCTT CTTCCACGAT ATGACCATGC CGGAGCGTGT TTCGCTGACC
ATGGCGATGC GGGATTCCGG TACTGTCCTT GACTGGAAAA GCCTGAATCT CAATGGCCCG
ATTGTCGATA AGCATTCGAC CGGCGGCGTA GGGGACGTGA CGTCTCTGAT GTTGGGGCCA
ATGGTAGCGG CCTGCGGCGG TTATGTGCCG ATGATCTCCG GTCGCGGTCT CGGACATACC
GGCGGTACGC TCGACAAACT GGAAGCGATC CCGGGCTTCG ACATCTTCCC GGACGACAAC
CGTTTCCGCG AAATTATTCA AGACGTGGGT GTGGCGATTA TTGGGCAAAC CAGCTCGCTT
GCACCGGCGG ACAAACGTTT TTACGCCACC CGCGATATTA CCGCGACGGT GGACTCTATT
CCGCTGATCA CTGGTTCCAT CCTCGCCAAG AAACTGGCCG AAGGGCTGGA TGCGCTGGTA
ATGGACGTCA AAGTCGGCAG TGGCGCGTTT ATGCCAACCT ATGAACTTTC TGAAGCCCTT
GCTGAAGCGA TTGTTGGCGT GGCAAACGGC GCGGGAGTTC GCACTACGGC TTTGTTAACC
GATATGAACC AGGTGCTGGC TTCGAGCGCC GGTAACGCGG TGGAAGTGCG TGAAGCCGTG
CAGTTCCTGA CCGGAGAATA CCGCAATCCG CGCTTGTTTG ACGTTACCAT GGCGCTATGC
GTGGAGATGC TGATCTCCGG CCAGCTGGCG AAAGACGACG CCGAAGCGCG TGCGAAATTA
CAGGCGGTGC TGGATAACGG TAAAGCGGCA GAAGTCTTTG GTCGTATGGT GGCCGCGCAG
AAAGGGCCAA GCGATTTCGT TGAGAACTAC GATAAATACT TGCCGACCGC CATGTTGAGC
AAAGCGGTAT ATGCTGATAC CGAAGGGTTT ATCAGCGCAA TGGATACGCG TGCGCTGGGG
ATGGCGGTCG TCTCGATGGG CGGCGGCCGT CGTCAGGCGT CAGATACCAT TGATTACAGC
GTTGGCTTTA CCGACATGGC CCGTCTGGGC GACAGCATCG ACGGGCAGCG CCCGCTGGCG
GTGATTCATG CCAAAGACGA AGCCAGTTGG CAGGAAGCGG CGAAGGCCGT CAAAGCGGCA
ATTATCCTTG ACGATAAAGC GCCAGCAAGC ACACCTTCGG TCTATCGTCG AATTACTGAA
TAG
 
Protein sequence
MFLAQEIIRK KRDGHALSDE EIRFFINGIR DNTISEGQIA ALAMTIFFHD MTMPERVSLT 
MAMRDSGTVL DWKSLNLNGP IVDKHSTGGV GDVTSLMLGP MVAACGGYVP MISGRGLGHT
GGTLDKLEAI PGFDIFPDDN RFREIIQDVG VAIIGQTSSL APADKRFYAT RDITATVDSI
PLITGSILAK KLAEGLDALV MDVKVGSGAF MPTYELSEAL AEAIVGVANG AGVRTTALLT
DMNQVLASSA GNAVEVREAV QFLTGEYRNP RLFDVTMALC VEMLISGQLA KDDAEARAKL
QAVLDNGKAA EVFGRMVAAQ KGPSDFVENY DKYLPTAMLS KAVYADTEGF ISAMDTRALG
MAVVSMGGGR RQASDTIDYS VGFTDMARLG DSIDGQRPLA VIHAKDEASW QEAAKAVKAA
IILDDKAPAS TPSVYRRITE