Gene Rleg2_5522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5522 
Symbol 
ID6978616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp1170839 
End bp1171846 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content63% 
IMG OID643394621 
Productectoine utilization protein EutE 
Protein accessionYP_002279439 
Protein GI209547521 
COG category[R] General function prediction only 
COG ID[COG3608] Predicted deacylase 
TIGRFAM ID[TIGR02994] ectoine utilization protein EutE 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00516127 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAGAGA TTGACTTGCG GCCGTCGCCG ATCAGCGCGA CGGTGGATTT CGCCGCCGAG 
GGCGTCCAGC ACGGTTTCCT GAGGCTGCCT TACAGCCGCG ACGATTCCGC CTGGGGTTCG
GTGATGATCC CGATAACGGT CGTCAGGAAC GGCAAGGGAC CGACGGCGCT GCTGACCGGC
GGCAATCATG GCGACGAGTA TGAAGGACCG ATCGCGCTTT TCGACCTTGC CCGTTCGCTG
AAGGGCGAGG AGGTGAGCGG CGCTGTCATT GTCGTGCCGG CGATGAATTA TCCGGCATTC
CTGGCGGGAA CCCGAACCTC GCCGATCGAC AGGGGCAATA TGAACCGCAG CTTTCCGGGC
CAGCCGGACG GCACGGTGAC GCAAAAGATC GCCGACTATT TCCAGCGCGT GCTCCTGCCG
ATGGCTGATC TGGTTCTCGA TTTCCATTCC GGCGGCAAGA CGCTCGATTT TCTCCCGTTC
TGCGCAGCCC ATATCCTGTC GAACAAGCAA CAGGAAGCGA AGGCTTTCGA TTTCGTCACG
GCCTTTGCCG CACCCTATTC GATGAAGATG CTGGAGATCG ATGCAGTGGG CATGTACGAC
ACTGCCGCCG AGGAGATGGG CAAGGTCTTC ATCACCACGG AACTCGGCGG CGGCGGGACG
GCTACGGCCA AGAGTGCGGC GATTGCCAAG CGCGGCACCA TGAACGTGCT GCGCCACGCC
GGGATCGTTG CGGGCGCCGC CGATATCGGT CCGACCACCT GGCTCGACAT GCCGGACGGC
CGGTGTTTTT CCTTCGCTGA GGAGGGCGGG TTGATCGAGC CCGTCATCGA TCTCGGTGAA
GCCGTCGGTA AGGATGCGGT CATCGCTCGC ATCTATCCGA CCGGGCGGAC CGGAGTGGCC
CCCCACGAGG TCCGCGCCGG CATGGATGGC ATCCTCTGCG CCCGGCATTT TCCCGGACTG
GTCAAGTCAG GCGATTGCGT CGCCGTGGTC GCGATCGTTA CCGGCTGA
 
Protein sequence
MTEIDLRPSP ISATVDFAAE GVQHGFLRLP YSRDDSAWGS VMIPITVVRN GKGPTALLTG 
GNHGDEYEGP IALFDLARSL KGEEVSGAVI VVPAMNYPAF LAGTRTSPID RGNMNRSFPG
QPDGTVTQKI ADYFQRVLLP MADLVLDFHS GGKTLDFLPF CAAHILSNKQ QEAKAFDFVT
AFAAPYSMKM LEIDAVGMYD TAAEEMGKVF ITTELGGGGT ATAKSAAIAK RGTMNVLRHA
GIVAGAADIG PTTWLDMPDG RCFSFAEEGG LIEPVIDLGE AVGKDAVIAR IYPTGRTGVA
PHEVRAGMDG ILCARHFPGL VKSGDCVAVV AIVTG