Gene Sbal223_1141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_1141 
SymboldeoA 
ID7087594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp1342473 
End bp1343804 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content48% 
IMG OID643460052 
Productthymidine phosphorylase 
Protein accessionYP_002357079 
Protein GI217972328 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02643] thymidine phosphorylase
[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000198917 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTTCTAG CTCAAGAGAT TATTCGTAAA AAACGTAATG GTTTAGCGCT AAGTTCCGAA 
GAAATACAGT TCTTTGTTCA AGGTATTACC ACCAACTCCG TATCTGAAGG TCAGATCGCC
GCATTAGGCA TGGCGGTGTA TTTTAATGAC ATGAATATGG ATGAAAGAAT CGCTTTAACG
ACAGCAATGC GTGATTCTGG CACTGTGCTT AATTGGCAAT CATTGGGACT CAACGGTCCA
GTTATCGACA AACACAGCAC TGGTGGTGTC GGCGATGTGA TTAGTCTCAT GCTTGGCCCT
ATGGCTGCGG CTTGTGGCGG TTATGTGCCT ATGATTTCTG GGCGCGGCCT AGGTCATACT
GGCGGTACAC TCGATAAGTT TGATGCCATT CCGGGTTACC AAACAGAGCC TTCAAGTGAA
TTGTTCCGCA AAGTGGTTAA AGATGTCGGG GTGGCGATTA TTGGCCAGAC GGGCGATCTC
GTTCCAGCCG ATAAACGCTT CTATTCTATT CGCGATAATA CCGCCACCGT TGAATCCATT
TCCCTCATCA CAGCATCGAT TTTGTCTAAG AAATTAGCCT GTAATTTAGA TGCGTTGGCG
ATGGACGTAA AAGTCGGTAG CGGCGCTTTC ATGCCAACCT ATGAGGCATC TGAAGAATTA
GCTCGCAGTA TTGCAGCTGT TGCTAATGGT GCGGGTACTA AAACGACGGC TTTACTTACC
GACATGAATC AAGTGCTTGC ATCTTGTGCA GGTAACGCGG TTGAAGTGAA AGAAGCCATC
GACTTTTTAA CGGGTGCTTA CCGTAACCCG CGTTTATATG AAGTCACTAT GGGTCTTTGT
GCTGAGATGC TGCTCCTTGG CGGTCTTGCA AGCAATGAAG CCGATGCTCG CGCTAAACTG
AATCGTGTAC TCGACAATGG TCGCGCTGCA GAACTCTTTG GCAAGATGGT GTCGGGTCTT
GGTGGTCCGG TTGATTTTGT TGAAAACTAC AGTAAATACC TGCCGCAGTC ACAAATTATT
CGTCCCGTCT TTGCCGATAT GCAAGGTTAT GCCTATAGCA TGGATACCCG TGAGTTAGGT
TTAGCGGTTG TGACCTTAGG TGGCGGCCGC CGTAAGCCCG GCGATGCACT AGACTATAGT
GTTGGCTTAA CCCAAGTGTG TGCCCTAGGC GATAAAGTGG ATTCATCGAC GCCGATTGCC
GTTATCCATG CACAATCTGA AGCCGCGTTC GCAGAAGCTG AACTTGCGGT GAAAAAAGCG
ATTCACATTG GTGAAACCGC TCCAGAAAAA ACACCTGAGA TCTATGCCTA TATTCGTGCA
TCGGATCTTT AA
 
Protein sequence
MFLAQEIIRK KRNGLALSSE EIQFFVQGIT TNSVSEGQIA ALGMAVYFND MNMDERIALT 
TAMRDSGTVL NWQSLGLNGP VIDKHSTGGV GDVISLMLGP MAAACGGYVP MISGRGLGHT
GGTLDKFDAI PGYQTEPSSE LFRKVVKDVG VAIIGQTGDL VPADKRFYSI RDNTATVESI
SLITASILSK KLACNLDALA MDVKVGSGAF MPTYEASEEL ARSIAAVANG AGTKTTALLT
DMNQVLASCA GNAVEVKEAI DFLTGAYRNP RLYEVTMGLC AEMLLLGGLA SNEADARAKL
NRVLDNGRAA ELFGKMVSGL GGPVDFVENY SKYLPQSQII RPVFADMQGY AYSMDTRELG
LAVVTLGGGR RKPGDALDYS VGLTQVCALG DKVDSSTPIA VIHAQSEAAF AEAELAVKKA
IHIGETAPEK TPEIYAYIRA SDL