Gene SeAg_B4890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B4890 
SymboldeoA 
ID6794683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp4763564 
End bp4764886 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content56% 
IMG OID642778952 
Productthymidine phosphorylase 
Protein accessionYP_002149510 
Protein GI197251400 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02643] thymidine phosphorylase
[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00649353 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTTCTCG CACAAGAAAT TATTCGTAAA AAGCGTGATG GTCATGCGTT GAGTGACGAA 
GAAATTCGTT TCTTTATCAA TGGTATTCGT GACAATACTA TCTCTGAAGG GCAGATTGCC
GCCCTGGCGA TGACCATCTT CTTCCACGAT ATGACCATGC CGGAGCGTGT TTCGCTGACC
ATGGCGATGC GGGATTCCGG TACTGTCCTT GACTGGAAAA GCCTGAATCT CAATGGCCCG
ATTGTCGATA AGCATTCGAC CGGCGGCGTA GGGGACGTGA CGTCTCTGAT GCTGGGGCCA
ATGGTAGCGG CCTGCGGCGG TTATGTGCCG ATGATCTCTG GTCGCGGCCT TGGACATACC
GGCGGTACGC TCGACAAACT GGAAGCGATC CCGGGCTTCG ATATCTTCCC GGATGACAAC
CGTTTCCGCG AAATTATTCA AGACGTGGGT GTGGCGATTA TTGGGCAGAC CAGCTCGCTT
GCACCGGCGG ACAAACGTTT TTACGCCACC CGCGATATTA CCGCGACGGT GGACTCTATT
CCGCTGATCA CCGGTTCCAT CCTCGCCAAG AAACTGGCCG AAGGACTGGA TGCGCTGGTA
ATGGACGTGA AAGTCGGCAG CGGTGCGTTT ATGCCAACCT ATGAACTTTC TGAAGCCCTT
GCTGAAGCGA TTGTCGGCGT GGCAAACGGC GCGGGAGTTC GCACTACGGC GTTGTTAACC
GATATGAACC AGGTGCTGGC TTCGAGCGCC GGTAACGCGG TGGAAGTGCG TGAAGCCGTG
CAGTTCCTGA CCGGTGAATA CCGCAATCCG CGCTTGTTTG ATGTCACTAT GGCGCTATGC
GTGGAGATGC TGATCTCCGG CCAGCTGGCG AAAGACGACG CCGAAGCGCG TGCGAAATTA
CAGGCGGTGC TGGATAACGG TAAAGCGGCA GAAGTCTTTG GTCGTATGGT GGCCGCGCAG
AAAGGGCCAA GCGATTTCGT TGAGAACTAC GATAAATACT TGCCGACCGC CATGTTGAGC
AAAGCGGTAT ATGCTGATAC CGAAGGGTTT ATCAGCGCAA TGGATACGCG TGCGCTGGGG
ATGGCGGTCG TCTCGATGGG CGGCGGCCGT CGTCAGGCGT CAGATACCAT TGATTACAGC
GTTGGCTTTA CCGACATGGC CCGTCTGGGC GACAGCATCG ACGGGCAGCG CCCGCTGGCG
GTGATTCATG CCAAAGACGA AACCAGTTGG CAGGAAGCGG CGAAGGCCGT CAAAGCGGCA
ATTATCCTTG ACGATAAAGC GCCAGCAAGC ACACCTTCGG TCTATCGTCG AATTACTGAA
TAG
 
Protein sequence
MFLAQEIIRK KRDGHALSDE EIRFFINGIR DNTISEGQIA ALAMTIFFHD MTMPERVSLT 
MAMRDSGTVL DWKSLNLNGP IVDKHSTGGV GDVTSLMLGP MVAACGGYVP MISGRGLGHT
GGTLDKLEAI PGFDIFPDDN RFREIIQDVG VAIIGQTSSL APADKRFYAT RDITATVDSI
PLITGSILAK KLAEGLDALV MDVKVGSGAF MPTYELSEAL AEAIVGVANG AGVRTTALLT
DMNQVLASSA GNAVEVREAV QFLTGEYRNP RLFDVTMALC VEMLISGQLA KDDAEARAKL
QAVLDNGKAA EVFGRMVAAQ KGPSDFVENY DKYLPTAMLS KAVYADTEGF ISAMDTRALG
MAVVSMGGGR RQASDTIDYS VGFTDMARLG DSIDGQRPLA VIHAKDETSW QEAAKAVKAA
IILDDKAPAS TPSVYRRITE