Gene Rleg_4047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4047 
Symbol 
ID8014852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4125433 
End bp4126662 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content61% 
IMG OID644826616 
Productradical SAM enzyme, Cfr family 
Protein accessionYP_002977827 
Protein GI241206731 
COG category[R] General function prediction only 
COG ID[COG0820] Predicted Fe-S-cluster redox enzyme 
TIGRFAM ID[TIGR00048] radical SAM enzyme, Cfr family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTCA TGGATGCGAT TGATGTCATC ACGTCTCAGG CGCCTCGTGC CGCTTCCGGC 
GTCGAGAAGC CGTCCCTGAT CGGGCTGTCA CGCGAGGAGA TGGGGGCGGC ACTCCGGGAA
AAGGGTGTGG CCGAGAAGCA GATCAAGATG CGCGTCTCGC AGCTCTGGAA CTGGATCTAT
GTGCGCGGGG TCTCTGACTT CGACCATATG ACGAATGTCG CCAAGGACAT GCGGGAGATG
CTGAAGCAGC ATTTCACCAT CGAACGTCCC GAGATCGTCG AGGAGCAGGT CTCCAACGAC
GGCACGCGCA AATGGCTGCT GCGCTTTCCC GCCCGCGGCG CCGGGCGTCC AGTCGAGATC
GAGGCCGTCT ACATTCCGGA AGAGGGCCGC GGCACGCTCT GCCTTTCCAG CCAGGTCGGC
TGCACGCTCA CCTGTTCCTT CTGTCATACC GGGACACAGC GTCTGGTGCG CAACCTGACG
GCGGAGGAAA TTCTTTCGCA GCTGCTGCTT GCCCGCGACC GGCTTGGGGA TTTCCCGGAC
CGTGAAGCGC CGCAGGGCAC GATCATGCCT GCCGAGGGCC GCAAGGTCAG CAACATCGTC
ATGATGGGCA TGGGTGAGCC GCTTTATAAC TTCGATGCCG TCAAACAGGC ATTGCTGATC
GCCACGGATG GTGACGGCCT GTCGCTGTCC AGGCGCCGCG TGACGCTTTC TACTTCTGGC
GTTGTGCCGG AGATCTTCCG CACCGGCGAG GAAATCGGCG TCATGCTGGC GATTTCGCTG
CATGCGGTGC GCGACGATCT GCGCGACCTT CTGGTGCCGA TCAACAAGAA GTATCCGCTG
AAGGAGCTGA TCGAAGCCTG CCGGACCTAT CCTGGCCTTT CGAACGCACG GCGCATCACC
TTCGAGTATG TGATGCTGAA GGATGTCAAC GACAGCCTGG AAGACGCCAA GGGGCTGATC
AAGCTCCTGA AAGGCGTGCC GGCGAAGATC AACCTCATTC CGTTCAATCC GTGGCCCGGC
ACCAATTACC AGTGTTCGGA CTGGGAGCAG ATCGAGAAGT TCGCCGATTT CATCAATTCG
GCAGGCTATG CCTCGCCGAT CCGCACACCC CGCGGTCGCG ACATTCTTGC CGCCTGCGGC
CAGCTGAAAT CGGAGTCGGA ACGCATGCGC AAGACCGATC GTTTGGCCTT CGAGGCGATG
ATGATCGCCA ATCACGGCGC CGACGACTGA
 
Protein sequence
MSVMDAIDVI TSQAPRAASG VEKPSLIGLS REEMGAALRE KGVAEKQIKM RVSQLWNWIY 
VRGVSDFDHM TNVAKDMREM LKQHFTIERP EIVEEQVSND GTRKWLLRFP ARGAGRPVEI
EAVYIPEEGR GTLCLSSQVG CTLTCSFCHT GTQRLVRNLT AEEILSQLLL ARDRLGDFPD
REAPQGTIMP AEGRKVSNIV MMGMGEPLYN FDAVKQALLI ATDGDGLSLS RRRVTLSTSG
VVPEIFRTGE EIGVMLAISL HAVRDDLRDL LVPINKKYPL KELIEACRTY PGLSNARRIT
FEYVMLKDVN DSLEDAKGLI KLLKGVPAKI NLIPFNPWPG TNYQCSDWEQ IEKFADFINS
AGYASPIRTP RGRDILAACG QLKSESERMR KTDRLAFEAM MIANHGADD