Gene Rleg2_5291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5291 
Symbol 
ID6978385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp914537 
End bp916189 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content61% 
IMG OID643394395 
Producttransferase hexapeptide repeat containing protein 
Protein accessionYP_002279213 
Protein GI209547295 
COG category[R] General function prediction only 
COG ID[COG0110] Acetyltransferase (isoleucine patch superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.111996 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCAGG CCAACAGTCC GCAAGCATCA GCGAGCGAAC AGAGGGAAGC GCGCAGACTG 
CAATATCTTC CCTGGGAACG TATTGCGTCG GACCTCGATC ATCCCACTCA CCTCGGCCGC
AAGGCGGAGC TTAGGCGGAC CTGCGGTGCT GAATTGGCCG ACACGTCCTA TATCGCCGAA
AATGCTGCGA TTTTTACCGA GAGCCTGACG ATGGGCGAGC GGTCGTGGAT TGCGGGGCAC
GCGCTCGTTC GGGGCAATGT GATGCTTGGT GACGACTGCA CAATCAATCC CTATGCCTGC
GTGTCCGGCA AGGTGACGTG CGGCAATGGG GTGCGGATTG CTTCACATGC CTCGGTCGTC
GGCTTCAATC ATGGATTCGA CGATCCCGAT CGGCCCATCC ACCGCCAGGG CGTCATCAGC
CTCGGCATCA CGATCGGTGA CGATGTCTGG ATCGGCGCCA ATTGCGTGAT CCTTGATGGC
GTCATTATTG GAAACGGCGC GGTGATTGCC GCCGGGGCTG TGGTCACGCA GGACATTCCC
GCTATGGCGA TCGCCGGTGG CGTACCAGCA AAGGTGCTGC GGAGCCGAGG CACGGCAACC
AGGAAATCCG GCACGGGAGA TATCGAAGAA AGATTGCTCA GGCTTGGCCA GAAGGCGAAA
GAGCAATGGC CTGACATTCT TGCACGATGG AAAACGTCGG AAGCCTATGA ATCGCTGGAG
GCTGACGGCG TCCGGAGACC GGCGATCCGG CATCTGTGTG ATGCCATCGA GATCGCGGCC
GGCTTCTGGC ATCTGCCGCC CGGGCTCGAT CCGTCGCAGA CCGTCGAGCT TCTCCAAGGC
CTTCAGGATC GGGAGACCGG GCTTTTCCCC GAAGAGCATT CACGTGTGCA TGGCCGCGCG
TTGCGGGAGG ATCCGAAGGC ACTCTACAAT GTGCTTTCGG TCGGCTATGC GCTTGAACTG
CTTGGCTCCG GGCCACGGCA GCCCATCCAG GCGGTGCAGC TAAGTGCTGA AGAGCTGGAG
GAATGGCTGA GCGGCCTGCC CTGGTCGAAC CGAGCGTGGC ACGCCGGCAG CGTGGTCGAT
GCCATCGGAA CTGCGATGTA CTTCAATGCG CGGTATTTCG GCATCAGAGG CCCACGCCAG
GCGCTTTTCG AGTGGCTGAG CCGTAATGCC AACACCGTCT CGGGCCTATG GGGCGAGCCG
ACGGCGCTGG AAGGATGGCT GCAGCCGGTG AACGGTTTTT ATCGCCTGAC GCGCGGCACC
TACGCGCAGT TCGGCGTTCC GCTTCCCCAT TCGCACGCTT CGCTCGAAAC GGTTCACCTC
AACTATCGCA ATCACAAGGG TTTCGTCGGC GCGCAATACA ATGCGTGCAA CCTACTCGAT
ACGATCCATC CGCCGCTGCT GATTGCCCGG CAAACCGATT ACCGGCGGGC CGATGGCCAG
GCGATCGCCA GCAATCTCAT ATCGAGAGCG CTCGATCGAT GGCGGGATGG CGAGGGTTTT
GCCTTCGCCG ATGGCGGTGA ACCGAGCTTG CAGGGAACGG AAATGTGGCT TTCCGTCATT
CACCTGGCGG CGGATTTCCT GGATCTGTCG GATCGGTTCG CCTTCGTCTC GAAAGGCGCG
CACCGGACGG CAACGCCCGG GCTCGGCTTG TGA
 
Protein sequence
MDQANSPQAS ASEQREARRL QYLPWERIAS DLDHPTHLGR KAELRRTCGA ELADTSYIAE 
NAAIFTESLT MGERSWIAGH ALVRGNVMLG DDCTINPYAC VSGKVTCGNG VRIASHASVV
GFNHGFDDPD RPIHRQGVIS LGITIGDDVW IGANCVILDG VIIGNGAVIA AGAVVTQDIP
AMAIAGGVPA KVLRSRGTAT RKSGTGDIEE RLLRLGQKAK EQWPDILARW KTSEAYESLE
ADGVRRPAIR HLCDAIEIAA GFWHLPPGLD PSQTVELLQG LQDRETGLFP EEHSRVHGRA
LREDPKALYN VLSVGYALEL LGSGPRQPIQ AVQLSAEELE EWLSGLPWSN RAWHAGSVVD
AIGTAMYFNA RYFGIRGPRQ ALFEWLSRNA NTVSGLWGEP TALEGWLQPV NGFYRLTRGT
YAQFGVPLPH SHASLETVHL NYRNHKGFVG AQYNACNLLD TIHPPLLIAR QTDYRRADGQ
AIASNLISRA LDRWRDGEGF AFADGGEPSL QGTEMWLSVI HLAADFLDLS DRFAFVSKGA
HRTATPGLGL