Gene Rleg_1569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1569 
Symbol 
ID8012647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1554653 
End bp1556023 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content56% 
IMG OID644824155 
ProductFRG domain protein 
Protein accessionYP_002975397 
Protein GI241204301 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0788051 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.000961497 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGTACAACC TGCTGATGAT GGGAAACGAC GACGAGTGGA ACGTTGCACT CGGCAGCGTG 
AGCGAGGCCT CGTTTCCTCT TTCCCGGTAC CTTGAATACA CTGAAGAGAG TATCGAGCAA
CGCCTGAAAC CCATCACGAC CGAAGTCGTG ACCTTTCTCT CGCAACTGCC AACGCTCTTC
ATGAGCGAAC TGCAGAACGA CGCGGACGGG AGACAGTTCG TCAGGATTCG TCTCGGTCAA
GTGTGGAATA TCCGCATAGT CGGCAAGGAC ATTCTGTACC TGTTCCGTAT AGATCAGCAC
CTTGGTGAAC ATGCCGTGAC CGACAGGCAG GCTTTCGAGA GAACCTTCTC CTTCGGCAAG
TGGGAGCTGA GTCGGACTCA TTGGGCGGTC AAGGATAGAG ATCTGCCCGC GACGCTCGTC
AGCGCGGGTG TAATCCAGCC ACCGGTCGCA GCCAACGACG GGGCGCCTCC GCCAGTTCCA
CCGCCGCCGA ACCCACTGTT GCCCGACGAC CAGCCAGCTC TGCCCATCAT CGAGAGCTTG
GTAGGCTTCA TGAATCATGT GCTGAACCTG CCGTCGGCGC CGGACGATGA GATCTTCTAT
CGTGGTCATT CGGACAGGCT CTATAAGCTT ATCCCGTCAC TTTTCCGAAA GAACGACGGC
GGAGATTGGC GCTACCGGCA CAAGGAGGAA ACGATCGTCC GCGAGCTTCT CACAGCCCAG
GCGACAGCGT TCTCCAGCGA CGAATACATG CTGGACAAGT TGGTGAGGAT GCAACACTAC
GGCCTGCCGA CGAGATTGCT CGATGTGACG TCGAATCCTT TGGTCGCGCT GTACTTCTGC
TGCGCGGACG ACAGGCGCGA CCACAACGGC AACGAAGTCG ACGGAGAGGT GATCGTAATG
AGGACGAAGT CGTCCGACGT CCGATTCTTC GATTCAGACA CGGTTAGCTG CGTTGCCAAC
ATATGTCTTC TGACCGACGC CGAGAAGGAA AAGATGGAAA CGTCCGGGGA GTCGATTGCA
TTCAACGAAA CGCCTGAATG CAAGAAGCTC CTACATTTCA TTCGGCGTGA GAAGCCCTAT
TTCGAAGGCC GGATCAACCC GAGCGACCTC GAGAGGATCA TGTTCGTCCG CGGCCGAAAC
ACCAACGAGC GCATCACCTC GCAGTCGGGC GCCTTTCTCC TGTTCGGCAA GGATTCGGTG
CTTCCGGAGA CGGGTTTCAG TTCCCTCGAT GTCCAACGGA TGACGATCAG AAATAAGGCC
GGCATTTTGC GTGACCTTGC CAAACTCAAT ATCAAGTCGA GCACGATCTA TCCCGGCATC
GAGAAGACGA CCGCCGAAAT TGCCAAGAAG CACGAACTCG CGGCGGGTTA A
 
Protein sequence
MYNLLMMGND DEWNVALGSV SEASFPLSRY LEYTEESIEQ RLKPITTEVV TFLSQLPTLF 
MSELQNDADG RQFVRIRLGQ VWNIRIVGKD ILYLFRIDQH LGEHAVTDRQ AFERTFSFGK
WELSRTHWAV KDRDLPATLV SAGVIQPPVA ANDGAPPPVP PPPNPLLPDD QPALPIIESL
VGFMNHVLNL PSAPDDEIFY RGHSDRLYKL IPSLFRKNDG GDWRYRHKEE TIVRELLTAQ
ATAFSSDEYM LDKLVRMQHY GLPTRLLDVT SNPLVALYFC CADDRRDHNG NEVDGEVIVM
RTKSSDVRFF DSDTVSCVAN ICLLTDAEKE KMETSGESIA FNETPECKKL LHFIRREKPY
FEGRINPSDL ERIMFVRGRN TNERITSQSG AFLLFGKDSV LPETGFSSLD VQRMTIRNKA
GILRDLAKLN IKSSTIYPGI EKTTAEIAKK HELAAG