Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4177 |
Symbol | deoA |
ID | 6982950 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 4352599 |
End bp | 4353906 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643398908 |
Product | thymidine phosphorylase |
Protein accession | YP_002283665 |
Protein GI | 209551748 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02643] thymidine phosphorylase [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCCGC AGGAGATCAT CCGGCGCAAA CGCGATGGCG ACGAACTCGA CCCCGCCGAT ATCCGGTCTT TCATCGCAGC CCTCGCGGCC GGCCAGCTGT CGGAAGGCCA GATCGGCGCC TTCGCCATGG CCGTCTGGTT CAAGGGCATG TCACGCGAGG AAATCGTTGC TCTGACGCTG GCGATGGCCG ATTCCGGCGA TCGGCTGCAA TGGACGGATA TCGACCGCCC GATCGCCGAC AAACATTCGA CCGGCGGTGT CGGCGACAAT GTTTCGCTGA TGCTGGCGCC GATTGCTGCC GCCTGCGGCC TCGCCGTTCC GATGATATCG GGGCGCGGCC TCGGTCATAC CGGCGGCACG CTCGATAAGC TCGAATCCAT TCCCGGCTAT AGGATCACCC CGGATGCCGA CCTGTTCCAC ACAGTCGTGA AGCAGGTGGG ATGCGCCATT ATCGGTCAGA CCGGCGCCCT GGCGCCGGCC GATGGGAGGC TCTATGCCGT GCGCGATGTG ACCGCAACCG TCGATTCCAT TCCGCTCATC ACCGCCTCCA TCCTGTCGAA GAAACTTGCG GCCGGGCTTC AGACGCTGGT GCTCGACGTC AAGGTCGGCA ACGGCGCCTT CATGGCCGAC CGTGCCCAGG CGGAGATCCT GGCGCAGTCG CTGGTCGAGG TTGCCAATGG GGCAGGGGTG AAGACCTCGG CGCTGATCAC CGACATGAAC CAGCCGCTCG CCGATGCGGC CGGCAATGCC GTTGAAATGC GCAATTGCCT GGATTTCCTG GCGGGTGGGA AGGCGGACAC GCGGCTCGAG GCTGTCGTTC TTGCCTTTGC CGCCGAGATG CTGGTGAAGT CGGGTATTGC CGCGTCCTCT GAAGAAGGCG AGGGGATGGC GCGTCGAGCG CTGTCATCGG GAAAGGCAGC TGAGGTTTTC GCGCGCATGG TATCCATGCT TGGCGGCCCG GCCGATCTCA TGGAAAACCC CGACAGGTAT CTCGCCAGGG CGGCTGTGGA AAAACCTGTC CTTGCCGCCC GGTCAGGCTG GCTTGCCGCA TGCGACGCGC GCGGCATCGG CGTCAGCGTC ATCGATCTTG GCGGCGGCAG ACGCCATCCA GCTGACCGGA TCGATCACCG CGTCGGCTTC TCAGAATTGC TGCCGCTTGG CAGCCGGGTC AGTGCGGGCG AGCCGATCGC GCTGGTTCAT GCCGCCGACG ACGCGGCCGC GGAAAGGGCA GCGGCCGCCC TTGCCGCGCA CTACCGTATC ACCGAAGACA GTCCGAAGCT GACGCCGGTG ATTTCGAGCC GGATCTGA
|
Protein sequence | MIPQEIIRRK RDGDELDPAD IRSFIAALAA GQLSEGQIGA FAMAVWFKGM SREEIVALTL AMADSGDRLQ WTDIDRPIAD KHSTGGVGDN VSLMLAPIAA ACGLAVPMIS GRGLGHTGGT LDKLESIPGY RITPDADLFH TVVKQVGCAI IGQTGALAPA DGRLYAVRDV TATVDSIPLI TASILSKKLA AGLQTLVLDV KVGNGAFMAD RAQAEILAQS LVEVANGAGV KTSALITDMN QPLADAAGNA VEMRNCLDFL AGGKADTRLE AVVLAFAAEM LVKSGIAASS EEGEGMARRA LSSGKAAEVF ARMVSMLGGP ADLMENPDRY LARAAVEKPV LAARSGWLAA CDARGIGVSV IDLGGGRRHP ADRIDHRVGF SELLPLGSRV SAGEPIALVH AADDAAAERA AAALAAHYRI TEDSPKLTPV ISSRI
|
| |