Gene Shewmr4_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1038 
SymboldeoA 
ID4251111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp1211341 
End bp1212672 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content50% 
IMG OID638117611 
Productthymidine phosphorylase 
Protein accessionYP_733175 
Protein GI113969382 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02643] thymidine phosphorylase
[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.101588 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000837764 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTTCTAG CTCAGGAAAT TATACGTAAG AAACGCAATG GGTTAGCCTT AAGCGCCGAA 
GAGATCCAGT TCTTCGTTAA GGGTATAACC ACTAATGCAG TGTCGGAAGG TCAGATCGCC
GCATTAGGCA TGGCTGTGTA TTTTAATGAC ATGAATATGG ATGAAAGAAT CGCTTTGACC
ACGGCAATGC GCGATTCTGG CACTGTACTC AATTGGCAAT CACTTGGTCT TAATGGCCCA
GTCATCGATA AACACAGTAC TGGTGGTGTC GGTGATGTGA TTAGTCTCAT GCTCGGCCCC
ATGGCTGCGG CTTGCGGTGG TTATGTGCCG ATGATTTCGG GTCGCGGACT CGGACACACA
GGCGGTACGC TCGATAAGTT CGACGCTATT CCCGGTTATC AAACCGAACC TTCGAGTGAA
TTGTTCCGCA AAGTAGTTAA AGACGTTGGT GTGGCGATTA TCGGCCAAAC TGGCGATCTG
GTTCCCGCCG ATAAACGTTT TTATTCCATC CGTGACAACA CTGCGACCGT TGAATCCATC
TCCCTCATTA CCGCCTCTAT TCTCTCTAAG AAATTAGCTT GTAGTCTCGA TGCATTGGCG
ATGGACGTCA AAGTCGGTAG CGGCGCATTT ATGCCAACCT ACGAAGCCTC TGAAGAGCTT
GCACGCAGTA TTGCGGCGGT AGCTAATGGC GCAGGCACTA AAACGACGGC CTTACTCACC
GACATGAACC AAGTATTGGC CTCATGTGCG GGTAATGCGG TTGAAGTGAA AGAAGCCATC
GACTTCCTAA CTGGTGCTTA TCGTAATCCT CGTCTGTACG CTGTGACTAT GGGGCTTTGT
GCCGAGATGT TACTCCTAGG CGGCCTCGCC ACCGATGAAG CGGATGCCCG TGCCAAGTTA
AATCGAGTAT TAGATAACGG CCGCGCTGCC GAGATCTTTG GCAAGATGGT TTCAGGCCTC
GGTGGCCCAG TCGATTTTGT TGAAAATTAC AGTAAGTACT TACCGCAATC GCAAATTATT
CGCCCTGTCT TTGCGGATAC CCAAGGTTAT GCCCACAGCA TGGACACCCG TGAACTCGGT
TTAGCCGTGG TTACCTTAGG TGGTGGTCGT CGCAAGCCTG GTGATGCACT CGACTACAGT
GTTGGTCTGA CGCAAGTCTG TGCCCTTGGC GATAAGATTG ATGCTTCAAC GCCGATTGCC
GTGATCCACG CGCAATCTGA AGATGCCTTT GCTCAGGCGG AAGAAGCCGT GAAAAAAGCG
ATTCGTATTG ATGAAGTCGC TCCAGAAAAA ACACCTGAGA TCTATGCTTA TATCCGAGCA
GCGGATCTTT AA
 
Protein sequence
MFLAQEIIRK KRNGLALSAE EIQFFVKGIT TNAVSEGQIA ALGMAVYFND MNMDERIALT 
TAMRDSGTVL NWQSLGLNGP VIDKHSTGGV GDVISLMLGP MAAACGGYVP MISGRGLGHT
GGTLDKFDAI PGYQTEPSSE LFRKVVKDVG VAIIGQTGDL VPADKRFYSI RDNTATVESI
SLITASILSK KLACSLDALA MDVKVGSGAF MPTYEASEEL ARSIAAVANG AGTKTTALLT
DMNQVLASCA GNAVEVKEAI DFLTGAYRNP RLYAVTMGLC AEMLLLGGLA TDEADARAKL
NRVLDNGRAA EIFGKMVSGL GGPVDFVENY SKYLPQSQII RPVFADTQGY AHSMDTRELG
LAVVTLGGGR RKPGDALDYS VGLTQVCALG DKIDASTPIA VIHAQSEDAF AQAEEAVKKA
IRIDEVAPEK TPEIYAYIRA ADL