Gene Rleg_4469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4469 
Symbol 
ID8015233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4601570 
End bp4602790 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content63% 
IMG OID644827045 
Productphosphopentomutase 
Protein accessionYP_002978246 
Protein GI241207150 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1015] Phosphopentomutase 
TIGRFAM ID[TIGR01696] phosphopentomutase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.614507 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.393425 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGTG CCTTTCTTTT CGTTCTGGAT TCCTTCGGCA TTGGCGGGGC GCCGGATGCG 
GCGGCCTATG GCGACGAGGG CGCCGATACG CTCGGCCATA TCGCCGAGTT CTGCGCAGCC
GGAGCCGGAG ACCGCGCCGG ATTGCGCGAA GGGCCGCTTT CCCTGCCCAA CATGTCGGAA
CTCGGGCTCA TGCAAATCGC GCGATCCGCC TCCGGCCGAT TTCCGGCCGG CATGCCCGTC
CCGGAGAAGG TTTATGGCAT TTATGGCGCT GCGACCGAAA TCTCCCGAGG CAAGGATACG
CCGTCGGGTC ATTGGGAAAT CGCGGGAACA CCGGTCAGTT TCGATTGGGG TTATTTCCCG
ATAGAGGGCG ACGCCTTTCC TGCCGAATTC ATCGAGGCGC TATGCAGAGA GGCTGACGTG
CCCGGCATCC TCGGCAACTG CCATGCTTCG GGAACGGAGA TCATCGCCCG GCTCGGCGAG
GACCATATCC GCACCGGCAA GCCAATCTGC TACACCTCTT CGGATTCCGT CTTTCAGGTC
GCGGCGCACG AGGAGCATTT CGGCCTCGAT CGTCTGCTCG CCTTCTGCCG TTTGGCCCGG
GGGCTGCTCG ATCCCTACAA TATCGGCCGT GTCATCGCCC GGCCCTTTAT CGGCCAGTCC
GCCTCTACTT TCCAGCGCAC GGGAAACCGG CGCGACTTCT CCGTGGTGCC GCCGGAGCCG
ACGCTACTCG ACCGGCTGAT CGAGCACGGC CGGCATGTGC ATGCTGTGGG AAAGATCGAC
GACATCTTCG CGCATCAGGG CATTTCCAAG GTCATCAAGG CGAACGGAAA CGAGGCGCTG
ATGGATGCGT CCCTCGCGGC GCTCGACGAG GCTGGGGACG GCGATCTCGT TTTCACCAAT
TTCGTCGATT TCGACATGAT CTACGGTCAT CGCCGCGACG TGCCGGGTTA TGCAGCCGCA
CTCGAAGCCT TCGATGCGCG CTTGCCTGAA GTCCACAAGA AACTGAAGCC CGGCGATCTC
GTCGTGCTCA CCGCCGATCA TGGCTGCGAT CCGACCTGGC GCGGCACGGA CCATACGCGC
GAGCGTGTGC CTGTCATCGC TTATGGCCCC GGCATCCGGT CGCGTTCGAT CGGCGTGCGC
CGCAGCTATG CCGATATCGG CGAGAGCATC GCCCGGCATC TCGGCATCCC GGCCGGGCCG
CACGGAAGGA GTTTTCTGTG A
 
Protein sequence
MARAFLFVLD SFGIGGAPDA AAYGDEGADT LGHIAEFCAA GAGDRAGLRE GPLSLPNMSE 
LGLMQIARSA SGRFPAGMPV PEKVYGIYGA ATEISRGKDT PSGHWEIAGT PVSFDWGYFP
IEGDAFPAEF IEALCREADV PGILGNCHAS GTEIIARLGE DHIRTGKPIC YTSSDSVFQV
AAHEEHFGLD RLLAFCRLAR GLLDPYNIGR VIARPFIGQS ASTFQRTGNR RDFSVVPPEP
TLLDRLIEHG RHVHAVGKID DIFAHQGISK VIKANGNEAL MDASLAALDE AGDGDLVFTN
FVDFDMIYGH RRDVPGYAAA LEAFDARLPE VHKKLKPGDL VVLTADHGCD PTWRGTDHTR
ERVPVIAYGP GIRSRSIGVR RSYADIGESI ARHLGIPAGP HGRSFL