Gene Meso_3657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMeso_3657 
SymboldeoA 
ID4179534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChelativorans sp. BNC1 
KingdomBacteria 
Replicon accessionNC_008254 
Strand
Start bp3948455 
End bp3949771 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content67% 
IMG OID638069550 
Productthymidine phosphorylase 
Protein accessionYP_676190 
Protein GI110635982 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02643] thymidine phosphorylase
[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCTCC CCCAGGAAAT CATCCGCAGG AAACGCGACG GCAAGGCTCT GTCGCCGGCC 
GATATCGCTG ATTTTGCGGC CGGTCTTGCC TCCGGCGCCT GGAGCGAAGG GCAGGTGGCC
GCCCTCGCCA TGGCGGTCCT GCTCAACGGC ATGGAGCGCG ATGAAACCGT CGCGCTCACT
CTTGCAATGC GCGATTCCGG CGACGTGCTC GACTGGTTGT CACTTCCCGC CCCGGTCACC
GACAAGCATT CCACGGGCGG CGTGGGCGAT AATGTCTCGC TGATGCTGGC GCCCATCGTC
GCCGCCTGCG GCGCCTATGT GCCCATGATA TCGGGGCGCG GGCTCGGGCA TACGGGCGGC
ACGCTCGACA AGATGGACTC CATTCCCGGC TATCGCAGCC AGCCGACGCG GGAAGAGTTC
GAGCGAACCG TGCGCGCCGC CGGCTGCGCC ATCATCGGCC AGACGGGAGA CCTTGCGCCC
GCGGACCGCC GCTTCTACGC CATCAGAGAC GTGACCGGCA CGGTGGAATC CATCCCTCTC
ATCACGGCGT CGATTCTTTC CAAAAAACTT GCCGCCGGCC TGCAGTCGTT GGTCCTCGAC
GTAAAATCGG GCAACGGCGC CTTCATGGCC GAGAGGAGCG AGGCTCAGGC GCTCGCGACC
AGCCTCGTGC AGGTGGCGCA GGGCGCGGGG CTATCTGCCA GCGCGCTTAT CACGGACATG
AACGAGCCGC TCGCCACGGC CGCGGGCAAT GCTGTCGAGG TGCGAAACGC CGTCGATTTT
CTCACGGGAG AAACACGGGA TGAGCGCCTT GAGGAGGTGA CGCTGGCGCT GGCGGCCGAG
ATGCTCGTAG CCACGGGCAT CGCATCCGGT GGGGAGAATG CGTTGAAACG CGCCCGCACG
GCGCTCGAAA GCGGCGGTGC GGCCGAGCGC TTTAGCCGGA TGGTCGCCGC GCTTGGCGGC
CCCGCCGATT TCGTTGAGAA GATGGACCAA TATCTCCCTC GCGCGCCCGA AACGCGTGCT
GTGAAAGCCG AGCGGGCGGG ATTCGTCGCC AGCATAGACA CGCGGGCGCT GGGAATTGTC
GTGGTCGAAC TCGGCGGCGG CCGGCGCAGG CCGGAAGATC CAATCGACCA CGCCGTCGGC
CTCACAGCCA CATTGCCCAT CGGCAGGGAA GTCCGCCGCG ACGATCCGCT GGCCGTGGTC
CATGCCCGCA CGCCCGCCGA TGCCGAACGG GCAGCCGAAG CGGTCCGCCG CGCCTACCGC
ATCGCCGATG AGCGGCTTTC CACGCAATCG CCTGTGCTGG AGAAGATCGC GGGCTGA
 
Protein sequence
MPLPQEIIRR KRDGKALSPA DIADFAAGLA SGAWSEGQVA ALAMAVLLNG MERDETVALT 
LAMRDSGDVL DWLSLPAPVT DKHSTGGVGD NVSLMLAPIV AACGAYVPMI SGRGLGHTGG
TLDKMDSIPG YRSQPTREEF ERTVRAAGCA IIGQTGDLAP ADRRFYAIRD VTGTVESIPL
ITASILSKKL AAGLQSLVLD VKSGNGAFMA ERSEAQALAT SLVQVAQGAG LSASALITDM
NEPLATAAGN AVEVRNAVDF LTGETRDERL EEVTLALAAE MLVATGIASG GENALKRART
ALESGGAAER FSRMVAALGG PADFVEKMDQ YLPRAPETRA VKAERAGFVA SIDTRALGIV
VVELGGGRRR PEDPIDHAVG LTATLPIGRE VRRDDPLAVV HARTPADAER AAEAVRRAYR
IADERLSTQS PVLEKIAG