Gene Shewana3_1042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_1042 
SymboldeoA 
ID4479387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp1218945 
End bp1220276 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content50% 
IMG OID639725585 
Productthymidine phosphorylase 
Protein accessionYP_868683 
Protein GI117919491 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02643] thymidine phosphorylase
[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.429173 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000248089 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTTCTAG CTCAGGAAAT TATACGTAAG AAACGCAATG GGTTAGCCTT AAGTACAGAA 
GAGATCCAGT TCTTCGTTAA GGGCATAACC ACTAATGCAG TGTCGGAAGG TCAGATCGCC
GCACTAGGCA TGGCTGTGTA TTTTAATGAC ATGAATATGG ATGAAAGAAT CGCTTTGACC
ACGGCAATGC GCGATTCTGG CACTGTACTC AACTGGCAAT CACTTGGTCT GAATGGCCCT
GTCATCGATA AACACAGTAC AGGTGGTGTC GGTGATGTGA TTAGTCTCAT GCTCGGCCCC
ATGGCTGCGG CTTGCGGTGG TTATGTGCCG ATGATTTCGG GTCGCGGACT CGGACACACA
GGCGGTACGC TCGATAAGTT CGACGCTATT CCCGGTTATC AAACCGAACC TTCGAGTGAA
TTGTTCCGCA AAGTAGTTAA AGACGTTGGT GTGGCGATTA TCGGCCAAAC TGGCGATCTG
GTTCCCGCCG ATAAACGTTT TTATTCCATC CGTGACAACA CTGCGACCGT CGAATCCATC
TCCCTCATCA CCGCCTCAAT TCTCTCTAAG AAATTAGCTT GTAGTCTCGA TGCATTGGCG
ATGGACGTCA AAGTCGGTAG CGGCGCATTT ATGCCAACTT ACGAAGCCTC TGAAGAGCTT
GCTCGCAGCA TTGCGGCGGT AGCCAATGGC GCAGGTACTA AAACGACGGC CTTACTCACC
GACATGAACC AAGTGTTAGC CTCATGTGCG GGTAATGCGG TTGAAGTGAA AGAAGCCATC
GATTTTTTAA CCGGTGCTTA CCGTAATCCT CGCCTCTACG CAGTGACTAT GGGGCTATGT
GCCGAGATGT TACTCCTGGG TGGTCTGGCG AGCGATGAAG CCGATGCCCG TGCCAAGTTG
AACCGCGTGC TAGACAACGG CCGTGCTGCC GAGATCTTTG GCAAGATGGT TTCAGGCCTC
GGTGGCCCCG TCGATTTTGT CGAAAACTAC AGTAAGTACT TACCGCAATC ACAAATTATT
CGCCCTGTCT TTGCGGATAC CCAAGGTTAT GCTTACAGCA TGGATACCCG CGAACTCGGT
TTAGCCGTGG TTACCTTAGG TGGTGGTCGT CGCAAACCTG GTGATGCACT CGACTACAGT
GTTGGTTTGA CGCAAGTCTG TGCCCTTGGC GATAAAATTG ATGCTTCTAC GCCGATTGCT
GTGATCCACG CGCAATCTGA AGAAGCCTTT GCGCAGGCAG AAGAAGCGGT GAAAAAAGCG
ATTCATATCG ATGAAGTCGC TCCAGAAAAA ACACCTGAGA TCTATGCTTA TATTCGAGCT
TCGGATCTTT AA
 
Protein sequence
MFLAQEIIRK KRNGLALSTE EIQFFVKGIT TNAVSEGQIA ALGMAVYFND MNMDERIALT 
TAMRDSGTVL NWQSLGLNGP VIDKHSTGGV GDVISLMLGP MAAACGGYVP MISGRGLGHT
GGTLDKFDAI PGYQTEPSSE LFRKVVKDVG VAIIGQTGDL VPADKRFYSI RDNTATVESI
SLITASILSK KLACSLDALA MDVKVGSGAF MPTYEASEEL ARSIAAVANG AGTKTTALLT
DMNQVLASCA GNAVEVKEAI DFLTGAYRNP RLYAVTMGLC AEMLLLGGLA SDEADARAKL
NRVLDNGRAA EIFGKMVSGL GGPVDFVENY SKYLPQSQII RPVFADTQGY AYSMDTRELG
LAVVTLGGGR RKPGDALDYS VGLTQVCALG DKIDASTPIA VIHAQSEEAF AQAEEAVKKA
IHIDEVAPEK TPEIYAYIRA SDL