Gene Sama_0974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_0974 
SymboldeoA 
ID4603226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp1177666 
End bp1178997 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content56% 
IMG OID639780313 
Productthymidine phosphorylase 
Protein accessionYP_926851 
Protein GI119774111 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02643] thymidine phosphorylase
[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.290566 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00118059 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTTCTGG CACAAGAGAT TATTCGCAAA AAACGCAACG CAGAGGCCCT CAGCAAAGAA 
GAAATTCAAT TCTTTGTGAA GGGGATTACC GACAACAGCG TATCCGAAGG ACAGATTGCC
GCGCTGGGTA TGGCGGTATA TTTCAATGAC ATGACCATGG ACGAGCGTAT TGCGCTGACC
ACGGCCATGC GCGATTCAGG TACCGTGCTC AACTGGGATA GCCTGGGGTT GAACGGCCCT
GTTATTGATA AGCACAGCAC AGGCGGTGTG GGCGATGTGA TTTCGCTGAT GCTCGGCCCC
ATGGCCGCAG CCTGTGGTGG TTATGTGCCC ATGATTTCGG GCCGGGGGCT GGGTCATACC
GGTGGCACAC TGGATAAGTT TGACGCCATT CCCGGTTACC AAACCGAGCC CTCCAGCGAG
CTGTTCCGCA AAGTGGTAAA AGAAGCCGGT GTTGCCATTA TTGGCCAAAC CGGGGATCTG
GTGCCCGCCG ACAAGCGTTT CTATTCCATT CGAGACAACA CCGCCACAGT TGAGTCCATT
TCACTTATCA CCGCATCGAT TCTGTCCAAG AAGCTGGCCG CAGGCCTGGA TGCGCTGGCA
ATGGACGTCA AGGTAGGCAG CGGCGCCTTT ATGCCGACCT ACGAAGCCTC GTTGGAACTC
GCGCGCTCCA TTACCGCCGT GGCTAACGGC GCCGGCACCA AAACCACAGC GCTGCTCACC
GACATGAATC AGGTGTTGGC TTCCTGCGCC GGTAACGCGC TGGAAGTGAA AGAAGCCGTG
GATTTCCTGA CCGGAAAATA CCGTAATCCT CGCCTTTACG AAGTCACCAT GGGCCTGTGC
GCCGAGATGC TGGTGCTGGG TGGTCTGGCC GCCAATGACG CCGATGCCCG TACCAAGCTC
AACACAGTGC TGGATAACGG CCGTGCTGCC GAGATTTTTG GCAAGATGGT GTCCGGCCTG
GGCGGCCCTG CTGATTTCGT TGAAAGTTAC GATAAGTATC TGCCCAAGGC ATCCATAATA
CGCCCCGTGT ACGCAGAACG TGACGGCTTT GCCTATAGTA TGGTGACCCG TGAGCTGGGT
CTTGCCGTGG TCACTCTGGG TGGTGGCCGT CGCAAGCCCG GTGATGCACT GGATTACAGT
GTAGGCTTGT CCAACGTGTG TGCCCTTGGT CAGCCAATAA ACAAAGACAC GCCGCTTGCC
GTAATCCATG CCCAGTCTGA GGCCGCTTTT GAAGAAGCCG CCAGGGCCGT TCGTGGGGCT
ATCACTGTCA GCGACAAGCA ACCCGAAAAA ACACCTGAGA TCTATCAGTA CGTACGTGCT
GAAGATCTGT AA
 
Protein sequence
MFLAQEIIRK KRNAEALSKE EIQFFVKGIT DNSVSEGQIA ALGMAVYFND MTMDERIALT 
TAMRDSGTVL NWDSLGLNGP VIDKHSTGGV GDVISLMLGP MAAACGGYVP MISGRGLGHT
GGTLDKFDAI PGYQTEPSSE LFRKVVKEAG VAIIGQTGDL VPADKRFYSI RDNTATVESI
SLITASILSK KLAAGLDALA MDVKVGSGAF MPTYEASLEL ARSITAVANG AGTKTTALLT
DMNQVLASCA GNALEVKEAV DFLTGKYRNP RLYEVTMGLC AEMLVLGGLA ANDADARTKL
NTVLDNGRAA EIFGKMVSGL GGPADFVESY DKYLPKASII RPVYAERDGF AYSMVTRELG
LAVVTLGGGR RKPGDALDYS VGLSNVCALG QPINKDTPLA VIHAQSEAAF EEAARAVRGA
ITVSDKQPEK TPEIYQYVRA EDL