Gene EcHS_A4617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4617 
SymboldeoA 
ID5593925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4621240 
End bp4622562 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content55% 
IMG OID640923711 
Productthymidine phosphorylase 
Protein accessionYP_001461148 
Protein GI157163830 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02643] thymidine phosphorylase
[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones66 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTTCTCG CACAAGAAAT TATTCGTAAA AAACGTGATG GTCATGCGCT AAGCGATGAA 
GAAATTCGTT TCTTTATCAA CGGTATTCGC GACAACACTA TCTCCGAAGG GCAGATTGCC
GCCCTCGCGA TGACCATTTT CTTCCACGAT ATGACAATGC CTGAGCGTGT CTCGCTGACC
ATGGCGATGC GAGATTCAGG AACCGTTCTC GACTGGAAAA GCCTGCATCT GAATGGCCCG
ATTGTTGATA AACACTCCAC CGGCGGCGTC GGCGATGTGA CTTCGCTGAT GTTGGGGCCG
ATGGTCGCAG CCTGCGGCGG CTATATTCCG ATGATCTCCG GTCGCGGCCT CGGTCATACT
GGCGGTACGC TCGACAAACT GGAATCCATC CCTGGCTTCG ACATTTTCCC GGATGACAAC
CGTTTCCGCG AAATTATTAA AGACGTCGGC GTAGCGATTA TCGGCCAGAC CAGCTCACTG
GCTCCGGCGG ATAAACGTTT CTACGCGACC CGTGATATTA CCGCAACCGT GGACTCCATC
CCGCTGATCA CTGCCTCGAT CCTGGCGAAG AAACTGGCGG AAGGTCTGGA TGCGCTGGTG
ATGGACGTGA AAGTGGGTAG CGGCGCGTTT ATGCCGACCT ACGAACTCTC TGAAGCCCTT
GCCGAAGCGA TTGTTGGCGT GGCTAACGGC GCTGGCGTGC GCACCACCGC GCTGCTCACC
GATATGAATC AGGTACTGGC CTCCAGTGCA GGTAACGCGG TTGAAGTTCG TGAAGCGGTG
CAGTTCCTGA CGGGTGAATA TCGTAACCCG CGTCTGTTTG ATGTCACGAT GGCGCTGTGC
GTGGAGATGC TTATCTCCGG CAAACTGGCG AAAGATGACG CCGAAGCGCG CGCGAAATTG
CAAGCAGTGC TGGACAACGG TAAAGCGGCA GAAGTCTTTG GTCGTATGGT AGCGGCACAA
AAAGGCCCAA CCGACTTCGT TGAGAACTAC GCGAAGTATC TGCCGACAGC GATGCTGACG
AAAGCAGTCT ATGCTGATAC CGAAGGTTTT GTCAGTGAAA TGGATACCCG CGCGCTGGGG
ATGGCAGTGG TTGCAATGGG CGGCGGTCGT CGTCAGGCAT CTGACACCAT TGATTACAGC
GTCGGCTTTA CTGATATGGC GCGTTTGGGC GACCAGGTAG ACGGTCAGCG TCCGCTGGCT
GTTATCCACG CGAAAGACGA AAACAGCTGG CAGGACGCGG CGAAAGCGGT GAAAGCGGCA
ATTAAACTTG CCGATAAAGC ACCGGAAAGC ACACCAACTG TCTATCGCCG TATCAGCGAA
TAA
 
Protein sequence
MFLAQEIIRK KRDGHALSDE EIRFFINGIR DNTISEGQIA ALAMTIFFHD MTMPERVSLT 
MAMRDSGTVL DWKSLHLNGP IVDKHSTGGV GDVTSLMLGP MVAACGGYIP MISGRGLGHT
GGTLDKLESI PGFDIFPDDN RFREIIKDVG VAIIGQTSSL APADKRFYAT RDITATVDSI
PLITASILAK KLAEGLDALV MDVKVGSGAF MPTYELSEAL AEAIVGVANG AGVRTTALLT
DMNQVLASSA GNAVEVREAV QFLTGEYRNP RLFDVTMALC VEMLISGKLA KDDAEARAKL
QAVLDNGKAA EVFGRMVAAQ KGPTDFVENY AKYLPTAMLT KAVYADTEGF VSEMDTRALG
MAVVAMGGGR RQASDTIDYS VGFTDMARLG DQVDGQRPLA VIHAKDENSW QDAAKAVKAA
IKLADKAPES TPTVYRRISE