Gene Rleg_4101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4101 
Symbol 
ID8015889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4174848 
End bp4176290 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content64% 
IMG OID644826671 
ProductMicrocystin LR degradation protein MlrC-like protein 
Protein accessionYP_002977881 
Protein GI241206785 
COG category[S] Function unknown 
COG ID[COG5476] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCG CCGTCGGTGG CATTCATATC GAATGCAGCA CATACAACCC CGTCCTGAAC 
GAGGAGAAGG ATTTCCGCGT GCTGCGCGGC GCGGCGCTGC TGGAAGCGCC GTATTTCGCC
TTCCTCAGGG ATTACGCTGC CGAATTTCTG CCGACGATCC ATGCCCGCGC CATCGCCGGC
GGCCCGGTTT CGCGTGCCAC CTACGAGGCC TTCAAGGGTG AATTCCTCGA GCGGCTGAAG
CCGATGCTGC CGCTTGACGG TCTCTATCTC GCCATGCACG GCGCCATGTA TGTCGAGGGC
ATGGAGGATG CGGAAGGCGA CTGGATCAGC GCGGCCCGTG CGCTGGTCGG CAAGGATTGC
ACGGTCTCGG CAAGCTACGA TCTGCACGGC AACGTCACGC AACGCATCAT CGATGCGCTC
GATATCTATT CCACCTACCG CACTGCGCCG CATATTGATG TCGAAGAGAC GATGCGCCGC
TCCGTTTCCA TGCTGGTAAA GAGCCTGAAA ACCGGCGAGA GGCCGGTCGT GCTCTGGGTG
CCGATCCCGG TCGTGCTGCC CGGCGAGCGC ACCAGCACCG TCGATGAGCC GGCAAAGAGC
CTCTATGACA TGCTGCCCGG GATCGATGCG ATCGACGGCG TCTGGGATGC ATCGCTGATG
GTCGGCTATG TCTGGGCCGA CGAACCGCGC GCCACCGCCG CCGCGATCAT GACCGGCACC
GACCGCACCG TGCTGGAGCG CGAGGCCAAA CGCCTCGCGA GGGCTTATTG GGATGCGCGC
GAAGACTTTG TCTTCGGCTG CAAGACCGGC ACGCTCGAGG AATGTGTCGA AAGGGCGATC
GCAAGCCCGA CCGCTCCTGT GGTGCTTGCC GAATCCGGCG ACAACCCGAC CGGCGGCGGC
GTTGGGGACC GGGCTGATGT GCTGGCAGAG CTGATTGCCA GGGGCGCCAC CGGCGTCGTC
TTTGCCGGCA TCGCCGACAA GGCGGCGACC GAGGCCTGTT ATGCCGCTGG CATCGGTGCG
GAACTGGAGC TCAGTGTCGG CGCCTCGCTC GACACCCAGG GTAGCAAGCC CGTTCACGGC
CGCTTCACGG TCAAGTTCCT GCATGAGACA TCAGATCCCA CAGACCGCCA GGCGGTAGTT
TCGGTCAGTG GTATCGATCT CGTGCTCTCC GCCAAGCGTC GGCCCTATCA CAACATCGTC
GACTTCACCC GGCTCGGCCT CGACCCACAC AAGGCCAGCA TCATCGTCGT CAAATCGGGC
TATCTCTCGC CGGAACTGGC GCCGATCGCC AATCCGAACC TGATGGCGCT ATCAACAGGG
GTCGTCGATC AGTTCGTCGA GCGCCTGCCG CGGCTGCGCA AGCAGCGTCC GACCTATCCT
TTCGACAAGG ATTTTGCCTT CGAGCCGCAG GTTTTTCTCT CCGCACGCTC GACGCTGGCC
TGA
 
Protein sequence
MRIAVGGIHI ECSTYNPVLN EEKDFRVLRG AALLEAPYFA FLRDYAAEFL PTIHARAIAG 
GPVSRATYEA FKGEFLERLK PMLPLDGLYL AMHGAMYVEG MEDAEGDWIS AARALVGKDC
TVSASYDLHG NVTQRIIDAL DIYSTYRTAP HIDVEETMRR SVSMLVKSLK TGERPVVLWV
PIPVVLPGER TSTVDEPAKS LYDMLPGIDA IDGVWDASLM VGYVWADEPR ATAAAIMTGT
DRTVLEREAK RLARAYWDAR EDFVFGCKTG TLEECVERAI ASPTAPVVLA ESGDNPTGGG
VGDRADVLAE LIARGATGVV FAGIADKAAT EACYAAGIGA ELELSVGASL DTQGSKPVHG
RFTVKFLHET SDPTDRQAVV SVSGIDLVLS AKRRPYHNIV DFTRLGLDPH KASIIVVKSG
YLSPELAPIA NPNLMALSTG VVDQFVERLP RLRKQRPTYP FDKDFAFEPQ VFLSARSTLA