Gene Rleg_5236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5236 
Symbol 
ID8007410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp647999 
End bp648847 
Gene Length849 bp 
Protein Length282 aa 
Translation table11 
GC content64% 
IMG OID644822144 
Productectoine/hydroxyectoine ABC transporter solute-binding protein 
Protein accessionYP_002973404 
Protein GI241113569 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID[TIGR02995] ectoine/hydroxyectoine ABC transporter solute-binding protein 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.897169 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.367584 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAC GATATCTTTT GAGCGCCGCC AGCCTGTCAG TGCTTCTGAT CACGGCGGCT 
TCGCCTGCCT CCGCCGCCGA TGACAAGCTC GAGCAGTTGA AGGAGCAAGG CTTTGCGCGT
ATCGCCATCG CCAACGAGCC GCCGTTCACC GCCGTCGGCG CCGACGGCAA GGTTTCCGGC
GCGGCCCCCG ATGTGGCGCG CGCAATATTC GAAAAGCTGG GCGTCAAGGA AGTGGTCGCC
TCGATCTCGG AATATGGCGC AATGATCCCC GGCCTGCAGG CCGGCCGCCA CGACGCGATC
ACCGCAGGCC TCTTCATGAA GCCCGAGCGC TGCAACGCCG TCGCCTATTC CGAACCGATC
CTTTGCGACG CCGAAGCTTT CGCGCTCAAG AAGGGCAACC CGCTGAAGCT GACGAGCTAC
AAGGACATCG CCGACAATCC GGACGCCAAG ATCGGCGCGC CGGGCGGCGG TACCGAGGAG
AAGCTGGCGC TTGAGGCCGG CGTGCCGCGC GATCGCGTCA TCGTCGTTCC GGATGGCCAG
AGCGGCATCA AGATGCTGCA GGACGGCCGC ATCGACGTCT ACTCGCTGCC GGTTCTGTCG
ATCCACGATC TGATGGCCAA GGCGAACGAT CCGAACCTCG AGACCGTCGC ACCCGTCGTC
AATGCGCCGG TCTATTGCGA TGGCGCGGCC TTCCGCAAGC AGGACGTTGC GCTCCGCGAC
GCCTTCGATG TCGAGCTGAA GAAGCTGAAG GAATCCGGCG AATTCGCCAA GATCATCGAG
CCCTACGGTT TCTCGGCCAA GGCGGCGATG TCGACGAGCC GCGAAAAGCT TTGCGCCGCG
GCGAAGTAA
 
Protein sequence
MKTRYLLSAA SLSVLLITAA SPASAADDKL EQLKEQGFAR IAIANEPPFT AVGADGKVSG 
AAPDVARAIF EKLGVKEVVA SISEYGAMIP GLQAGRHDAI TAGLFMKPER CNAVAYSEPI
LCDAEAFALK KGNPLKLTSY KDIADNPDAK IGAPGGGTEE KLALEAGVPR DRVIVVPDGQ
SGIKMLQDGR IDVYSLPVLS IHDLMAKAND PNLETVAPVV NAPVYCDGAA FRKQDVALRD
AFDVELKKLK ESGEFAKIIE PYGFSAKAAM STSREKLCAA AK