Gene EcSMS35_4931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4931 
SymboldeoA 
ID6145192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp5046159 
End bp5047481 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content55% 
IMG OID641619734 
Productthymidine phosphorylase 
Protein accessionYP_001746838 
Protein GI170682854 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02643] thymidine phosphorylase
[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTTCTCG CACAAGAAAT TATTCGTAAA AAACGTGATG GTCATGCGCT GAGTGATGAA 
GAAATTCGTT TCTTTATCAA CGGCATTCGC GACAACACTA TCTCCGAAGG GCAGATTGCC
GCCCTCGCGA TGACCATTTT CTTCCACGAT ATGACAATGC CTGAGCGTGT CTCGCTGACC
ATGGCGATGC GAGATTCAGG AACCGTTCTC GACTGGAAAA GCCTGCATCT GAATGGCCCG
ATTGTTGATA AACACTCGAC CGGCGGTGTC GGCGATGTGA CTTCGCTGAT GTTGGGGCCG
ATGGTCGCTG CCTGTGGCGG CTATATTCCG ATGATCTCCG GTCGCGGCCT CGGTCATACT
GGCGGTACGC TCGACAAACT GGAATCCATC CCTGGCTTCG ACATTTTCCC GGATGACAAC
CGTTTCCGCG AAATTATTAA AGACGTCGGC GTGGCGATTA TCGGTCAGAC CAGCTCACTG
GCTCCGGCGG ATAAACGTTT CTACGCGACC CGTGATATTA CCGCAACCGT GGACTCCATC
CCGCTGATCA CTGCCTCGAT CCTGGCGAAG AAACTGGCGG AAGGTCTGGA TGCGCTGGTG
ATGGACGTGA AAGTGGGTAG CGGCGCGTTT ATGCCGACCT ACGAACTTTC TGAAGCCCTT
GCCGAAGCGA TTGTTGGCGT AGCTAACGGC GCTGGCGTGC GTACCACCGC GCTGCTCACC
GATATGAATC AGGTACTGGC CTCCAGTGCA GGTAACGCGG TTGAAGTTCG TGAAGCGGTG
CAGTTCCTGA CGGGTGAGTA TCGTAACCCG CGTCTGTTTG ATGTCACAAT GGCGCTGTGC
GTAGAGATGC TTATCTCCGG CAAACTGGCG AAAGATGACG CCGAAGCGCG CGCGAAATTG
CAGGCGGTGC TGGACAACGG TAAAGCGGCA GAAGTCTTTG GTCGTATGGT AGCGGCACAA
AAAGGCCCGA CTGACTTCGT TGAGAACTAC GCGAAGTATC TGCCGACAGC GATGCTGACG
AAAGCAGTCT ATGCTGATAC CGAAGGGTTT GTCAGTGAAA TGGATACCCG CGCGCTGGGG
ATGGCAGTGG TTGCAATGGG CGGCGGACGC CGTCAGGCAT CTGACACCAT CGATTACAGC
GTCGGCTTTA CTGATATGGC GCGTCTGGGC GACCAGGTAG ACGGTCAGCG TCCGCTGGCA
GTTATCCACG CGAAAGACGA AAACAGCTGG CAGGAAGCGG CGAAAGCGGT GAAAGCGGCA
ATTAAACTTG CCGATAAAGC ACCGGAAAGC ACACCAACTG TCTATCGTCG TATCAGTGAA
TAA
 
Protein sequence
MFLAQEIIRK KRDGHALSDE EIRFFINGIR DNTISEGQIA ALAMTIFFHD MTMPERVSLT 
MAMRDSGTVL DWKSLHLNGP IVDKHSTGGV GDVTSLMLGP MVAACGGYIP MISGRGLGHT
GGTLDKLESI PGFDIFPDDN RFREIIKDVG VAIIGQTSSL APADKRFYAT RDITATVDSI
PLITASILAK KLAEGLDALV MDVKVGSGAF MPTYELSEAL AEAIVGVANG AGVRTTALLT
DMNQVLASSA GNAVEVREAV QFLTGEYRNP RLFDVTMALC VEMLISGKLA KDDAEARAKL
QAVLDNGKAA EVFGRMVAAQ KGPTDFVENY AKYLPTAMLT KAVYADTEGF VSEMDTRALG
MAVVAMGGGR RQASDTIDYS VGFTDMARLG DQVDGQRPLA VIHAKDENSW QEAAKAVKAA
IKLADKAPES TPTVYRRISE