Gene Rleg_4465 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4465 
SymboldeoA 
ID8015230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4597873 
End bp4599180 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content65% 
IMG OID644827041 
Productthymidine phosphorylase 
Protein accessionYP_002978242 
Protein GI241207146 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02643] thymidine phosphorylase
[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.173869 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.54283 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCCGC AGGAGATCAT TCGGCGTAAG CGCGATGGCG ACGAACTCGC CGCCGCCGAT 
ATCAGCTCCT TCATCGCGGC ACTCGCTGCC GGTCGATTGT CGGAAGGCCA GATCGGCGCA
TTCGCCATGG CCGTTTGGTT CAAGGGCATG TCGCGGGCCG AAATCGTGGC CTTGACGCTG
GCGATGGCCG ATTCCGGTGA CAGGCTGCAA TGGGCCGATA TCGACCGCCC GATCGCCGAC
AAGCATTCGA CCGGCGGCGT CGGCGACAAT GTTTCGCTGA TGCTGGCGCC GATCGCCGCT
GCCTGCGGCC TCGCCGTTCC GATGATCTCC GGGCGCGGCC TCGGCCATAC CGGCGGCACG
CTGGATAAGC TCGAATCCAT TCCCGGCTAT CTGATCACCC CGGATGCTGA CCTGTTCCAC
AAGGTCGTGA AGGAGGCGGG ATGCGCCATC ATCGGCCAGA CCGGGACCTT GGCGCCCGCC
GACGGCAGGC TCTATGCCGT GCGCGACGTA ACCGCCACGG TCGATTCCAT TCCCCTCATC
ACCGCCTCGA TCCTCTCGAA GAAACTTGCG GCGGGGCTCG AGACGCTGGT GCTCGACGTC
AAGGTCGGCA ATGGCGCCTT CATGGCCGAT CGCGGCCAGG CGGAGATCCT CGCGCAGTCG
CTGGTCGAGG TGGCCAATGG TGCAGGCGTG AAGACCTCGG CCCTGATCAC CGACATGAAC
CAGCCGCTCG CCGACAGTGC CGGCAACGCG GTCGAGATGC GCAACTGCCT GGACTTCCTG
GCAGGCAGGA AAAGAGACAC GCGGCTTGAT ATCGTCGTTT TTGCCTTCGC CGCCGAGATG
CTGGTGAAAT CCGGTATCGC CGCTTCGCCT GATGAAGCTG AAGGAATGGC GCGGCGGGCC
TTGTCGTCGG GAAAGGCGGC GGAAGTCTTC GCGCGTATGG TATCGATGCT CGGCGGCCCG
GCCGATCTCA TCGAAAATCC CGACCGATAT CTAGTCAGGG CGCCTGTGGA AAAGCCTGTC
CCGGCCGCCC GGTCCGGCTG GCTTGCCGGC TGCGATGCGC GCGGTGTCGG CATCAGTGTC
ATCGACCTTG GCGGCGGAAG ACGCCATCCG GCGGCCCGGA TCGACCATCG CGTCGGCTTT
TCCGAACTCC TGCCGCTTGG CACCCGCGTA AACGCGGGCG AACCGATCGC GCTGGTTCAT
GCTGCTGACG AAGCCGCGGC GGAGCGGGCG GCTGCGGCAC TTGCCATGCA TTACCGCATC
ACCGAGGACA AGCCGGAGCT GACACCGGTG ATTGCGGGCC TGATCTGA
 
Protein sequence
MIPQEIIRRK RDGDELAAAD ISSFIAALAA GRLSEGQIGA FAMAVWFKGM SRAEIVALTL 
AMADSGDRLQ WADIDRPIAD KHSTGGVGDN VSLMLAPIAA ACGLAVPMIS GRGLGHTGGT
LDKLESIPGY LITPDADLFH KVVKEAGCAI IGQTGTLAPA DGRLYAVRDV TATVDSIPLI
TASILSKKLA AGLETLVLDV KVGNGAFMAD RGQAEILAQS LVEVANGAGV KTSALITDMN
QPLADSAGNA VEMRNCLDFL AGRKRDTRLD IVVFAFAAEM LVKSGIAASP DEAEGMARRA
LSSGKAAEVF ARMVSMLGGP ADLIENPDRY LVRAPVEKPV PAARSGWLAG CDARGVGISV
IDLGGGRRHP AARIDHRVGF SELLPLGTRV NAGEPIALVH AADEAAAERA AAALAMHYRI
TEDKPELTPV IAGLI