Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1571 |
Symbol | deoA |
ID | 4078380 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 1678773 |
End bp | 1680077 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638006884 |
Product | thymidine phosphorylase |
Protein accession | YP_613566 |
Protein GI | 99081412 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02643] thymidine phosphorylase [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGCGC GCGCGGTCAT TGCCCGGTTA AGGCAGAAAC ACACCCCAAG CACCGAGGAG CTGCGCTGGT TTGCCGAGGG GCTTGCCAGT GGCGCCGTCA GCGACGCGCA GGCGGGCGCC TTTGCTATGG CGATCTGTCT CAACGGGTTG CCAGCCGCGG CGCGCTCTGA CCTGACACTT GCGATGCGCG ACAGCGGCGA TGTTCTGACG TGGGATCTGC CGGGGCCGGT GGTGGACAAA CACTCGACCG GTGGGGTCGG CGATTGCGTG TCGCTTCTTC TGGCGCCTGC GCTTGCGGAA TGTGGCGCTT ATGTCCCGAT GATCTCGGGA CGCGGTCTTG GGCACACCGG TGGCACGCTC GACAAGCTGG AGGCGATCCC CGGTCTAAGC ACCGAGGTCA CGCAAGATCG GCTGGCGGGA ATTGTGGCCG ACGTTGGCTG CGCCATCGTG GGGGCAACCG CGCGGATTGC CCCGGCGGAC AAGCGGCTTT ATGCGGTGCG CGATGTGACG GCCACGGTGG AGAGCCTCGA TCTGATTACA GCATCGATCC TTTCGAAAAA GCTGGCTGCC AGCCCCGAGG CATTGGTGCT GGATGTCAAA ATCGGCTCGG GCGCCTTTAT GAAAACTGTG GAGGAGGCGC GCGCTTTGGC GACCTCTCTG GTGGAAACCT CAAAGGCGGC GGGGTGTCCG ACGCAGGCGC TGATCACCGA CATGAACCAG CCGCTTGTTC CAGCCTTGGG CAATGCGCTT GAGGTTGCCG AAGTGGTGCG GGCGCTCACC GGTCAGTCGA GCGGGCAGAT CATCGAGATC ACCGTGGCAC TTGGTGGCGC GCTGTTGCAG CAGGCGGGAC TTGCCCCCAA CCAAGAGGCG GGCGAGACGC AAATTGCCGC CGCAATCGCC GAAGGTCGCG CGGCAGAGCG GTTTGCCCGA ATGATTGCCG CGCAGGGTGG TCCGTCCACA GAGCTTGAGA CATGGGCGCG CGCGCTGCCG CAAGCACCGG TCTGCGCAGA GGTCACGGCC GAGGACGCAG GCTATGTTGC GGCGATCGAC GGCGAGGCCC TTGGTCTGCT GGTGGTTCGG CTGGGCGGCG GGCGTATGGT TGAAAGCGAC CGTATCGACC CTGCGGTCGG GATCTCGGAC CTGCTGCACT TGGGGGCCAA AGTGGCCAGG GGCGATGTCA TTGCGCGCGT TCATGCCGCC CACGCAGAGG CCGCGCAAGA TGCGATCTCG GCCTTGCGGG CGGCGGTGAG GCTTGCACCT GCCGCACCCG ACCTGCCGCC GCTGTTGCAT GAGAGGATCA GCTGA
|
Protein sequence | MDARAVIARL RQKHTPSTEE LRWFAEGLAS GAVSDAQAGA FAMAICLNGL PAAARSDLTL AMRDSGDVLT WDLPGPVVDK HSTGGVGDCV SLLLAPALAE CGAYVPMISG RGLGHTGGTL DKLEAIPGLS TEVTQDRLAG IVADVGCAIV GATARIAPAD KRLYAVRDVT ATVESLDLIT ASILSKKLAA SPEALVLDVK IGSGAFMKTV EEARALATSL VETSKAAGCP TQALITDMNQ PLVPALGNAL EVAEVVRALT GQSSGQIIEI TVALGGALLQ QAGLAPNQEA GETQIAAAIA EGRAAERFAR MIAAQGGPST ELETWARALP QAPVCAEVTA EDAGYVAAID GEALGLLVVR LGGGRMVESD RIDPAVGISD LLHLGAKVAR GDVIARVHAA HAEAAQDAIS ALRAAVRLAP AAPDLPPLLH ERIS
|
| |