Gene Rleg_5241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5241 
Symbol 
ID8007415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp652136 
End bp653128 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content65% 
IMG OID644822149 
Productectoine utilization protein EutC 
Protein accessionYP_002973409 
Protein GI241113574 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2423] Predicted ornithine cyclodeaminase, mu-crystallin homolog 
TIGRFAM ID[TIGR02992] ectoine utilization protein EutC 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.326235 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGGA TGATCATTCT GACGGAAGCG GAACTGCGGA AAGTCATCGC GCTTGATCGC 
GATGCGGTTG ATTGCGTCGA GGCCGCTTTC GCAGCGCTTG CGACCAAGGC TGTCGCCATG
CCGCCGATCC TGCGGCTCGA CATTCCGGAA TATCGGGGCG AAGTCGACGT AAAGACCGCC
TATGTGCCCG GCATCGAGGG CTTCGCAATC AAGATCAGCC CCGGCTTCTT CGACAACCCC
AAGATCGGCC TGCCGAGCAC CAACGGCATG ATGGTGCTGC TGTCGAGCCG AACCGGACTG
GTGCAGGCGC TGCTCTTGGA CAACGGCTAT CTCACCGACG TGCGCACCGC AGCGGCCGGC
GCCGTCGCGG CAAAACATCT GTCGCGGGAA AATGCGTCCG TGGCCGCGAT CTTCGGCGCC
GGCATGCAGG CGCGGCTGCA GCTCGAGGCA CTGACGCTGG TCCGGCCGAT CCGCGAAGCG
AGGATATGGG CGCGCGATTC TGCCAAGGCG CAAAGCGTGG CAGCGGAACT GGCCGCAAAG
CTCGGCTTTT CCGTCACCGC CACACCGGAC GCCAGAGGCG CAGTGACCGG CGCCGATCTC
ATCGTTACCA CCACGCCTTC CGAAACCCCG ATCATCGAGG CCGGGTGGCT GGAACCCGGA
CAGCATCTGA CGGCCATGGG CTCGGACACC GAACACAAGA ACGAGATCGA TCCGGCCGCC
ATTGCGGTTG CTGACCTCTA CGTCGCCGAC AGCCTGAAGC AGACGCGCCG TCTCGGCGAG
TTGCATCACG CAATCGATGG CGGCCTGGTC GCAGATGACG CGATCTTTGC CGAGCTCGGC
CAGATCGTTG CCGGCCGGAC GCGGGGACGG ACGCGCAACG ACCAGATCAC CATTGCGGAC
CTGACCGGAA CCGGCATCCA GGACACCGCC ATCGCCACGC TCGCCTTTAC CCGCGCCGGC
GCGGCCAATG CCGGGACCAC ATTCGAAAGC TGA
 
Protein sequence
MSRMIILTEA ELRKVIALDR DAVDCVEAAF AALATKAVAM PPILRLDIPE YRGEVDVKTA 
YVPGIEGFAI KISPGFFDNP KIGLPSTNGM MVLLSSRTGL VQALLLDNGY LTDVRTAAAG
AVAAKHLSRE NASVAAIFGA GMQARLQLEA LTLVRPIREA RIWARDSAKA QSVAAELAAK
LGFSVTATPD ARGAVTGADL IVTTTPSETP IIEAGWLEPG QHLTAMGSDT EHKNEIDPAA
IAVADLYVAD SLKQTRRLGE LHHAIDGGLV ADDAIFAELG QIVAGRTRGR TRNDQITIAD
LTGTGIQDTA IATLAFTRAG AANAGTTFES