Gene SeHA_C4976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4976 
SymboldeoA 
ID6491537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4853680 
End bp4855002 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content56% 
IMG OID642745019 
Productthymidine phosphorylase 
Protein accessionYP_002048588 
Protein GI194447406 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02643] thymidine phosphorylase
[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.718604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones94 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTTCTCG CACAAGAAAT TATTCGTAAA AAGCGTGATG GTCATGCGTT GAGTGACGAA 
GAAATTCGTT TCTTTATTAA TGGTATTCGT GACAATACTA TCTCTGAAGG GCAGATTGCC
GCCCTGGCGA TGACCATCTT CTTCCACGAT ATGACCATGC CGGAGCGTGT TTCGCTGACC
ATGGCGATGC GGGATTCCGG TACTGTCCTT GACTGGAAAA GCCTGAATCT CAATGGCCCG
ATTGTCGATA AGCATTCGAC CGGCGGCGTA GGGGACGTGA CGTCTCTGAT GTTGGGGCCA
ATGGTAGCGG CCTGCGGCGG TTATGTGCCG ATGATCTCCG GTCGCGGCCT CGGACATACC
GGCGGTACGC TCGACAAACT GGAAGCGATC CCGGGCTTCG ATATCTTCCC GGACGACAAC
CGTTTCCGCG AAATTATTCA AGACGTGGGT GTGGCGATTA TTGGGCAAAC CAGCTCGCTT
GCACCGGCGG ACAAACGTTT TTACGCCACC CGCGATATTA CCGCGACGGT GGACTCTATT
CCGCTGATCA CCGGCTCCAT CCTCGCCAAG AAACTGGCCG AAGGGCTTGA TGCGCTGGTA
ATGGACGTAA AAGTCGGCAG CGGCGCGTTT ATGCCAACCT ATGAACTTTC TAAAGCCCTT
GCTGAAGCGA TTGTCGGCGT GGCAAATGGC GCGGGAGTTC GCACTACGGC GTTGTTAACC
GATATGAACC AGGTGCTGGC TTCAAGCGCC GGTAACGCGG TGGAAGTGCG TGAAGCCGTG
CAGTTCCTGA CCGGTGAATA CCGCAATCCG CGCTTGTTTG ACGTCACTAT GGCGCTATGC
GTGGAGATGC TGATCTCCGG CCAGCTGGCG AAAGACGACG CCGAAGCGCG TGCCAAACTG
CAGGCGGTGC TGGATAACGG TAAAGCGGCA GAAGTCTTTG GTCGTATGGT GGCCGCGCAG
AAAGGGCCAA GCGATTTCGT TGAGAACTAC GATAAATACT TGCCGACCGC CATGTTGAGC
AAAGCGGTAT ATGCTGATAC CGAAGGGTTT ATCAGCGCAA TGGATACGCG TGCGCTGGGG
ATGGCGGTCG TCTCGATGGG CGGCGGCCGT CGTCAGGCGT CTGACACCAT TGATTACAGC
GTTGGCTTTA CCGACATGGC CCGTCTGGGC GACAGCATCG ACGGGCAGCG CCCGCTGGCG
GTGATTCATG CCAAAGACGA AGCCAGTTGG CAGGAAGCGG CGAAGGCCGT CAAAGCGGCA
ATTATCCTTG ACGATAAAGC GCCAGCAAGC ACACCTTCGG TCTATCGTCG AATTACTGAA
TAG
 
Protein sequence
MFLAQEIIRK KRDGHALSDE EIRFFINGIR DNTISEGQIA ALAMTIFFHD MTMPERVSLT 
MAMRDSGTVL DWKSLNLNGP IVDKHSTGGV GDVTSLMLGP MVAACGGYVP MISGRGLGHT
GGTLDKLEAI PGFDIFPDDN RFREIIQDVG VAIIGQTSSL APADKRFYAT RDITATVDSI
PLITGSILAK KLAEGLDALV MDVKVGSGAF MPTYELSKAL AEAIVGVANG AGVRTTALLT
DMNQVLASSA GNAVEVREAV QFLTGEYRNP RLFDVTMALC VEMLISGQLA KDDAEARAKL
QAVLDNGKAA EVFGRMVAAQ KGPSDFVENY DKYLPTAMLS KAVYADTEGF ISAMDTRALG
MAVVSMGGGR RQASDTIDYS VGFTDMARLG DSIDGQRPLA VIHAKDEASW QEAAKAVKAA
IILDDKAPAS TPSVYRRITE