Gene Rleg_4838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4838 
Symbol 
ID8007226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp213912 
End bp215018 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content61% 
IMG OID644821768 
Productglycine betaine/L-proline ABC transporter, ATPase subunit 
Protein accessionYP_002973028 
Protein GI241113193 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.218023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.119301 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCG TAAATGCCGA TATCAATGAC GTCCTGATCG ACTGCCGATC CCTTTGGAAA 
GTCTTCGGTG ACAAGTCCGC TGCGGCAATG AAGTCGATCA AGGAGCGCGG CCTCGGCAAA
AAAGAGGTCC TGAAGGAGTT CAACTGCGTC GTCGGCGTGT CCGACGCGAG CATTGAGGTC
CGGCGCGGCG AAATCTTCTG CATCATGGGC TTGTCGGGAA GCGGAAAATC GACACTCATC
CGCCTTCTCA ACAAGCTGAT CACGCCGAGC TCGGGCAAGG TCCTCGTCAA GGGACGCGAC
CTCGCCGCCC TGTCGCCGGT CGATCTCCGG CAGATGCGCG CCAGGAACAT CGGCATGGTG
TTTCAAAGCG TCGCTCTGTT GCCGCATCGG ACGGTCCTCG AAAATGCGGC CTTCGGCCTC
GAGGTTCAGG GAATTGCCAA GCCGGAGCGA AACAAGACCG CCGTTGCAGC GCTCGAGAAG
GTCGGCCTCG CCGACTGGGT GAGCCGATAT CCGAACGAGC TTTCCGGCGG CATGCAGCAA
CGCGTCGGGC TGGCCCGCGC GCTTGCTTCC GACCCCGAGA TCATCCTGAT GGATGAGCCG
TTCAGCGCTC TTGATCCACT CATACGGCGT CAACTTCAGG ACGAATTCCG GCAGCTGACG
AAAGCCTTGG GCAAGTCCGC GGTCTTCATC ACCCATGATC TCGACGAGGC GATCCGGATC
GGCGATCGCA TTGCGATCAT GAAGGACGGC GTCATCATCC AGACCGGCAC GGCCGAGGAA
ATCATCCTCA ACCCGGCGGA TGCCTATGTC GCCGAATTCG TCGCCGGCAT ATCCCGCCTT
CATCTGATCA AGGCGCATTC CGTCATGCGC AGCGTCGCAG AATTCCAGCA GAGCGCGCCG
CATTCCGACA TCGCGTCGCT GGCGCGCACG ACGCCGGGCG CTGATATCGA CGAACTCATC
ACATTGACGA TGCAGTCGGA GCGCGATGCC ATCGCGGTCG TCGACAATGA TCAGATCGTC
GGCGTGGTGA CGCCACGCAG CCTGCTGATG GGCGTCAAGG GAACCTCCAC CCACGATCTC
ACGCCGGCGT CCCACAACTG GAGCTGA
 
Protein sequence
MKTVNADIND VLIDCRSLWK VFGDKSAAAM KSIKERGLGK KEVLKEFNCV VGVSDASIEV 
RRGEIFCIMG LSGSGKSTLI RLLNKLITPS SGKVLVKGRD LAALSPVDLR QMRARNIGMV
FQSVALLPHR TVLENAAFGL EVQGIAKPER NKTAVAALEK VGLADWVSRY PNELSGGMQQ
RVGLARALAS DPEIILMDEP FSALDPLIRR QLQDEFRQLT KALGKSAVFI THDLDEAIRI
GDRIAIMKDG VIIQTGTAEE IILNPADAYV AEFVAGISRL HLIKAHSVMR SVAEFQQSAP
HSDIASLART TPGADIDELI TLTMQSERDA IAVVDNDQIV GVVTPRSLLM GVKGTSTHDL
TPASHNWS