Gene TM1040_1571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1571 
SymboldeoA 
ID4078380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1678773 
End bp1680077 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content66% 
IMG OID638006884 
Productthymidine phosphorylase 
Protein accessionYP_613566 
Protein GI99081412 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02643] thymidine phosphorylase
[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCGC GCGCGGTCAT TGCCCGGTTA AGGCAGAAAC ACACCCCAAG CACCGAGGAG 
CTGCGCTGGT TTGCCGAGGG GCTTGCCAGT GGCGCCGTCA GCGACGCGCA GGCGGGCGCC
TTTGCTATGG CGATCTGTCT CAACGGGTTG CCAGCCGCGG CGCGCTCTGA CCTGACACTT
GCGATGCGCG ACAGCGGCGA TGTTCTGACG TGGGATCTGC CGGGGCCGGT GGTGGACAAA
CACTCGACCG GTGGGGTCGG CGATTGCGTG TCGCTTCTTC TGGCGCCTGC GCTTGCGGAA
TGTGGCGCTT ATGTCCCGAT GATCTCGGGA CGCGGTCTTG GGCACACCGG TGGCACGCTC
GACAAGCTGG AGGCGATCCC CGGTCTAAGC ACCGAGGTCA CGCAAGATCG GCTGGCGGGA
ATTGTGGCCG ACGTTGGCTG CGCCATCGTG GGGGCAACCG CGCGGATTGC CCCGGCGGAC
AAGCGGCTTT ATGCGGTGCG CGATGTGACG GCCACGGTGG AGAGCCTCGA TCTGATTACA
GCATCGATCC TTTCGAAAAA GCTGGCTGCC AGCCCCGAGG CATTGGTGCT GGATGTCAAA
ATCGGCTCGG GCGCCTTTAT GAAAACTGTG GAGGAGGCGC GCGCTTTGGC GACCTCTCTG
GTGGAAACCT CAAAGGCGGC GGGGTGTCCG ACGCAGGCGC TGATCACCGA CATGAACCAG
CCGCTTGTTC CAGCCTTGGG CAATGCGCTT GAGGTTGCCG AAGTGGTGCG GGCGCTCACC
GGTCAGTCGA GCGGGCAGAT CATCGAGATC ACCGTGGCAC TTGGTGGCGC GCTGTTGCAG
CAGGCGGGAC TTGCCCCCAA CCAAGAGGCG GGCGAGACGC AAATTGCCGC CGCAATCGCC
GAAGGTCGCG CGGCAGAGCG GTTTGCCCGA ATGATTGCCG CGCAGGGTGG TCCGTCCACA
GAGCTTGAGA CATGGGCGCG CGCGCTGCCG CAAGCACCGG TCTGCGCAGA GGTCACGGCC
GAGGACGCAG GCTATGTTGC GGCGATCGAC GGCGAGGCCC TTGGTCTGCT GGTGGTTCGG
CTGGGCGGCG GGCGTATGGT TGAAAGCGAC CGTATCGACC CTGCGGTCGG GATCTCGGAC
CTGCTGCACT TGGGGGCCAA AGTGGCCAGG GGCGATGTCA TTGCGCGCGT TCATGCCGCC
CACGCAGAGG CCGCGCAAGA TGCGATCTCG GCCTTGCGGG CGGCGGTGAG GCTTGCACCT
GCCGCACCCG ACCTGCCGCC GCTGTTGCAT GAGAGGATCA GCTGA
 
Protein sequence
MDARAVIARL RQKHTPSTEE LRWFAEGLAS GAVSDAQAGA FAMAICLNGL PAAARSDLTL 
AMRDSGDVLT WDLPGPVVDK HSTGGVGDCV SLLLAPALAE CGAYVPMISG RGLGHTGGTL
DKLEAIPGLS TEVTQDRLAG IVADVGCAIV GATARIAPAD KRLYAVRDVT ATVESLDLIT
ASILSKKLAA SPEALVLDVK IGSGAFMKTV EEARALATSL VETSKAAGCP TQALITDMNQ
PLVPALGNAL EVAEVVRALT GQSSGQIIEI TVALGGALLQ QAGLAPNQEA GETQIAAAIA
EGRAAERFAR MIAAQGGPST ELETWARALP QAPVCAEVTA EDAGYVAAID GEALGLLVVR
LGGGRMVESD RIDPAVGISD LLHLGAKVAR GDVIARVHAA HAEAAQDAIS ALRAAVRLAP
AAPDLPPLLH ERIS