Gene Rleg2_4177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4177 
SymboldeoA 
ID6982950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4352599 
End bp4353906 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content65% 
IMG OID643398908 
Productthymidine phosphorylase 
Protein accessionYP_002283665 
Protein GI209551748 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02643] thymidine phosphorylase
[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCCGC AGGAGATCAT CCGGCGCAAA CGCGATGGCG ACGAACTCGA CCCCGCCGAT 
ATCCGGTCTT TCATCGCAGC CCTCGCGGCC GGCCAGCTGT CGGAAGGCCA GATCGGCGCC
TTCGCCATGG CCGTCTGGTT CAAGGGCATG TCACGCGAGG AAATCGTTGC TCTGACGCTG
GCGATGGCCG ATTCCGGCGA TCGGCTGCAA TGGACGGATA TCGACCGCCC GATCGCCGAC
AAACATTCGA CCGGCGGTGT CGGCGACAAT GTTTCGCTGA TGCTGGCGCC GATTGCTGCC
GCCTGCGGCC TCGCCGTTCC GATGATATCG GGGCGCGGCC TCGGTCATAC CGGCGGCACG
CTCGATAAGC TCGAATCCAT TCCCGGCTAT AGGATCACCC CGGATGCCGA CCTGTTCCAC
ACAGTCGTGA AGCAGGTGGG ATGCGCCATT ATCGGTCAGA CCGGCGCCCT GGCGCCGGCC
GATGGGAGGC TCTATGCCGT GCGCGATGTG ACCGCAACCG TCGATTCCAT TCCGCTCATC
ACCGCCTCCA TCCTGTCGAA GAAACTTGCG GCCGGGCTTC AGACGCTGGT GCTCGACGTC
AAGGTCGGCA ACGGCGCCTT CATGGCCGAC CGTGCCCAGG CGGAGATCCT GGCGCAGTCG
CTGGTCGAGG TTGCCAATGG GGCAGGGGTG AAGACCTCGG CGCTGATCAC CGACATGAAC
CAGCCGCTCG CCGATGCGGC CGGCAATGCC GTTGAAATGC GCAATTGCCT GGATTTCCTG
GCGGGTGGGA AGGCGGACAC GCGGCTCGAG GCTGTCGTTC TTGCCTTTGC CGCCGAGATG
CTGGTGAAGT CGGGTATTGC CGCGTCCTCT GAAGAAGGCG AGGGGATGGC GCGTCGAGCG
CTGTCATCGG GAAAGGCAGC TGAGGTTTTC GCGCGCATGG TATCCATGCT TGGCGGCCCG
GCCGATCTCA TGGAAAACCC CGACAGGTAT CTCGCCAGGG CGGCTGTGGA AAAACCTGTC
CTTGCCGCCC GGTCAGGCTG GCTTGCCGCA TGCGACGCGC GCGGCATCGG CGTCAGCGTC
ATCGATCTTG GCGGCGGCAG ACGCCATCCA GCTGACCGGA TCGATCACCG CGTCGGCTTC
TCAGAATTGC TGCCGCTTGG CAGCCGGGTC AGTGCGGGCG AGCCGATCGC GCTGGTTCAT
GCCGCCGACG ACGCGGCCGC GGAAAGGGCA GCGGCCGCCC TTGCCGCGCA CTACCGTATC
ACCGAAGACA GTCCGAAGCT GACGCCGGTG ATTTCGAGCC GGATCTGA
 
Protein sequence
MIPQEIIRRK RDGDELDPAD IRSFIAALAA GQLSEGQIGA FAMAVWFKGM SREEIVALTL 
AMADSGDRLQ WTDIDRPIAD KHSTGGVGDN VSLMLAPIAA ACGLAVPMIS GRGLGHTGGT
LDKLESIPGY RITPDADLFH TVVKQVGCAI IGQTGALAPA DGRLYAVRDV TATVDSIPLI
TASILSKKLA AGLQTLVLDV KVGNGAFMAD RAQAEILAQS LVEVANGAGV KTSALITDMN
QPLADAAGNA VEMRNCLDFL AGGKADTRLE AVVLAFAAEM LVKSGIAASS EEGEGMARRA
LSSGKAAEVF ARMVSMLGGP ADLMENPDRY LARAAVEKPV LAARSGWLAA CDARGIGVSV
IDLGGGRRHP ADRIDHRVGF SELLPLGSRV SAGEPIALVH AADDAAAERA AAALAAHYRI
TEDSPKLTPV ISSRI