Gene Rleg_5242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5242 
Symbol 
ID8007416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp653161 
End bp654342 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content62% 
IMG OID644822150 
Productectoine utilization protein EutD 
Protein accessionYP_002973410 
Protein GI241113575 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID[TIGR02993] ectoine utilization protein EutD 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.490883 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAGC CCAAACTCAA ATTCTCGCTC GGCGAATATG CCGCGCGGCT GGAAAAGACA 
CGGCGTGCCA TGGAGGCGAA GGGTGTCGAC CTGCTGATCG TCAGCGATCC GTCGAATATG
GCCTGGCTGA CCGGCTATGA CGGCTGGTCC TTCTACGTGC ACCAGGCGGT GATCGTGCCG
CCGCAGGGCG AGCCGATCTG GTTCGGCCGC GGCCAGGATG CCAACGGCGC CAAATTCACT
GCCTATCTGA AGCACGACAA CATCGTCGGT TATCCCGATC ACTACGTGCA GTCGACCGAG
CGCCACCCGA TGGACTACCT CTCGGGCATC CTGACCGAGC GCGGCTTCGG CAAGCTGACG
ATCGGTGTCG AGATGGACAA TTACTGGTTT TCGGCGGCGG CCTTTGCGGC GCTGCAAAAA
CATTTGCCGA ACGCGCGCTT TGTCGACGCG ACCGCCCTCG TCAACTGGCA GCGAGCCGTC
AAGAGCGACA CCGAGATCGG CTATATGCGC AATGCCGCCC GAATCGTCGA GGCGATGCAC
GCCCGCATCT TCGACAAGAT CGAAGTCGGC ATGCGCAAGT GCGATCTGGT CGCGGAAATC
TATGATGCCG GCACCCGCGG CGTCGACGGC ATCGGCGGTG ATTATCCGGC GATCGTGCCG
CTGCTGCCGT CCGGCGTCGA GGCATCGGCA CCGCACCTGA CCTGGGACGA CCGGCCGCTG
AAGAAGGGCG AGGGCACCTT CTTCGAGATC GCCGGCTGCT ACAACCGCTA TCACCTGCCG
CTGTCGCGCA CCGTCTTCCT CGGCAAGCCG ACGCAGGCCT TTCTTGATGC CGAAAAGGCG
ACGCTGGAAG GCATGGAAGC CGGTCTTGCA GTCGCCAGAC CCGGCAATAC CTGCGAGGAT
ATTGCCAACG CCTTCTTCGC GGTGCTGAAG AAATACGGGA TCGTCAAGGA TAACCGCACC
GGTTATCCGA TCGGCCTTTC CTATCCGCCG GACTGGGGCG AGCGCACTAT GAGCCTGCGA
CCGGGCGATC GGACGGAGCT GAAGCCCGGC ATGACCTTCC ATTTCATGAC CGGTCTCTGG
CTCGACGACA TGGGTTTCGA AACGACCGAG AGCATCCTCA TCACCGACAG CGGCGTCGAG
TGCTTCGCCA AAGTGCCGCG CAGGCTGATG GTCAAGGATT GA
 
Protein sequence
MTKPKLKFSL GEYAARLEKT RRAMEAKGVD LLIVSDPSNM AWLTGYDGWS FYVHQAVIVP 
PQGEPIWFGR GQDANGAKFT AYLKHDNIVG YPDHYVQSTE RHPMDYLSGI LTERGFGKLT
IGVEMDNYWF SAAAFAALQK HLPNARFVDA TALVNWQRAV KSDTEIGYMR NAARIVEAMH
ARIFDKIEVG MRKCDLVAEI YDAGTRGVDG IGGDYPAIVP LLPSGVEASA PHLTWDDRPL
KKGEGTFFEI AGCYNRYHLP LSRTVFLGKP TQAFLDAEKA TLEGMEAGLA VARPGNTCED
IANAFFAVLK KYGIVKDNRT GYPIGLSYPP DWGERTMSLR PGDRTELKPG MTFHFMTGLW
LDDMGFETTE SILITDSGVE CFAKVPRRLM VKD