Gene Rleg_5243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5243 
Symbol 
ID8007417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp654350 
End bp655357 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content64% 
IMG OID644822151 
Productectoine utilization protein EutE 
Protein accessionYP_002973411 
Protein GI241113576 
COG category[R] General function prediction only 
COG ID[COG3608] Predicted deacylase 
TIGRFAM ID[TIGR02994] ectoine utilization protein EutE 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.472707 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGAGA TTGGCTTGCG GCCGTCGCCG ATCAGCGCGA CGGTCAATTT CGCCGCCGAG 
GGCATCCAGC ACGGCTTCCT GCGTCTGCCT TACAGCCGCG ACGATTCCGC CTGGGGATCG
GTGATGGTGC CGATAACGGT CGTCCGGAAC GGCACGGGCC CCACGGCCCT TCTGACCGGC
GGCAACCACG GCGACGAGTA TGAAGGACCG ATCGCGCTCT TCGATCTTGC CCGCACGCTG
AAGGCCGAAG AGGTGAGGGG CGCAGTCATC ATCGTGCCGG CCATGAATTA TCCGGCATTC
CAAGCGGGAA CCAGAACCTC GCCGATCGAC AGGGGCAACA TGAACCGCAG CTTCCCGGGC
CGGCCGGACG GCACGGTGAC CGAGAAGATC GCCGACTATT TCAGCCGTGT GCTTCTGCCG
ATGGCGGATC TCGTCCTCGA CTTCCATTCC GGCGGCAAGA CGCTCGATTT TCTGCCGTTC
TGCGCAGCCC ACGTCCTGCC GAACAAGCAG CAGCAGGAAA AAGCTTTCGA ATTCGTGAGA
GCGTTTGCGG CACCCTATTC GATGAAGATG CTGGAGATCG ATGCGGTCGG CATGTACGAC
ACCGCGGCCG AGGAGATGGG CAAGATCTTC ATCACCACGG AACTCGGCGG CGGCGGCACC
GCCACGGCCA AAAGTGCGGC GATTGCCAAG CGCGGCACCA CGAACGTGCT ACGCCACGCA
GGGATTGTCG CCGGCGCCGT CGATCCCGGT CCGACGACTT GGCTCAACAT GCCGGACGGC
CGCTGCTTCT CCTTCGCGGA AGAGGGAGGT CTGATCGAAC CTGTCATCGA TCTCGGTGAA
GCCGTCACTG ACGATGCGGT GATCGCGCGC ATCTATCCGA CCGGGCGGAC CGGGGTGGCG
CCGCGCGAGA TTCGCGCCGG CATGAACGGT ATTCTCTGTG CCCGGCATTT CCCGGGGCTG
GTCAAGGCTG GTGACTGTGT TGCCGTCGTG GCGATCGTCG ACGACTAA
 
Protein sequence
MTEIGLRPSP ISATVNFAAE GIQHGFLRLP YSRDDSAWGS VMVPITVVRN GTGPTALLTG 
GNHGDEYEGP IALFDLARTL KAEEVRGAVI IVPAMNYPAF QAGTRTSPID RGNMNRSFPG
RPDGTVTEKI ADYFSRVLLP MADLVLDFHS GGKTLDFLPF CAAHVLPNKQ QQEKAFEFVR
AFAAPYSMKM LEIDAVGMYD TAAEEMGKIF ITTELGGGGT ATAKSAAIAK RGTTNVLRHA
GIVAGAVDPG PTTWLNMPDG RCFSFAEEGG LIEPVIDLGE AVTDDAVIAR IYPTGRTGVA
PREIRAGMNG ILCARHFPGL VKAGDCVAVV AIVDD