Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4465 |
Symbol | deoA |
ID | 8015230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 4597873 |
End bp | 4599180 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644827041 |
Product | thymidine phosphorylase |
Protein accession | YP_002978242 |
Protein GI | 241207146 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02643] thymidine phosphorylase [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.173869 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.54283 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCCGC AGGAGATCAT TCGGCGTAAG CGCGATGGCG ACGAACTCGC CGCCGCCGAT ATCAGCTCCT TCATCGCGGC ACTCGCTGCC GGTCGATTGT CGGAAGGCCA GATCGGCGCA TTCGCCATGG CCGTTTGGTT CAAGGGCATG TCGCGGGCCG AAATCGTGGC CTTGACGCTG GCGATGGCCG ATTCCGGTGA CAGGCTGCAA TGGGCCGATA TCGACCGCCC GATCGCCGAC AAGCATTCGA CCGGCGGCGT CGGCGACAAT GTTTCGCTGA TGCTGGCGCC GATCGCCGCT GCCTGCGGCC TCGCCGTTCC GATGATCTCC GGGCGCGGCC TCGGCCATAC CGGCGGCACG CTGGATAAGC TCGAATCCAT TCCCGGCTAT CTGATCACCC CGGATGCTGA CCTGTTCCAC AAGGTCGTGA AGGAGGCGGG ATGCGCCATC ATCGGCCAGA CCGGGACCTT GGCGCCCGCC GACGGCAGGC TCTATGCCGT GCGCGACGTA ACCGCCACGG TCGATTCCAT TCCCCTCATC ACCGCCTCGA TCCTCTCGAA GAAACTTGCG GCGGGGCTCG AGACGCTGGT GCTCGACGTC AAGGTCGGCA ATGGCGCCTT CATGGCCGAT CGCGGCCAGG CGGAGATCCT CGCGCAGTCG CTGGTCGAGG TGGCCAATGG TGCAGGCGTG AAGACCTCGG CCCTGATCAC CGACATGAAC CAGCCGCTCG CCGACAGTGC CGGCAACGCG GTCGAGATGC GCAACTGCCT GGACTTCCTG GCAGGCAGGA AAAGAGACAC GCGGCTTGAT ATCGTCGTTT TTGCCTTCGC CGCCGAGATG CTGGTGAAAT CCGGTATCGC CGCTTCGCCT GATGAAGCTG AAGGAATGGC GCGGCGGGCC TTGTCGTCGG GAAAGGCGGC GGAAGTCTTC GCGCGTATGG TATCGATGCT CGGCGGCCCG GCCGATCTCA TCGAAAATCC CGACCGATAT CTAGTCAGGG CGCCTGTGGA AAAGCCTGTC CCGGCCGCCC GGTCCGGCTG GCTTGCCGGC TGCGATGCGC GCGGTGTCGG CATCAGTGTC ATCGACCTTG GCGGCGGAAG ACGCCATCCG GCGGCCCGGA TCGACCATCG CGTCGGCTTT TCCGAACTCC TGCCGCTTGG CACCCGCGTA AACGCGGGCG AACCGATCGC GCTGGTTCAT GCTGCTGACG AAGCCGCGGC GGAGCGGGCG GCTGCGGCAC TTGCCATGCA TTACCGCATC ACCGAGGACA AGCCGGAGCT GACACCGGTG ATTGCGGGCC TGATCTGA
|
Protein sequence | MIPQEIIRRK RDGDELAAAD ISSFIAALAA GRLSEGQIGA FAMAVWFKGM SRAEIVALTL AMADSGDRLQ WADIDRPIAD KHSTGGVGDN VSLMLAPIAA ACGLAVPMIS GRGLGHTGGT LDKLESIPGY LITPDADLFH KVVKEAGCAI IGQTGTLAPA DGRLYAVRDV TATVDSIPLI TASILSKKLA AGLETLVLDV KVGNGAFMAD RGQAEILAQS LVEVANGAGV KTSALITDMN QPLADSAGNA VEMRNCLDFL AGRKRDTRLD IVVFAFAAEM LVKSGIAASP DEAEGMARRA LSSGKAAEVF ARMVSMLGGP ADLIENPDRY LVRAPVEKPV PAARSGWLAG CDARGVGISV IDLGGGRRHP AARIDHRVGF SELLPLGTRV NAGEPIALVH AADEAAAERA AAALAMHYRI TEDKPELTPV IAGLI
|
| |