Gene Rleg2_3159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3159 
Symbol 
ID6981910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3242175 
End bp3243197 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content61% 
IMG OID643397875 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002282652 
Protein GI209550735 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAG CGCTCATTGC CTCGGCAGCA CTCGCCGCCC TGTCTCCGCT CGGAGCTACA 
GCAGCGGACC GCACATTGAC CATTTCGGTC TATGCTTTTG CCCAGGACGA TTTCAAGACG
CTGGTCTATG ATCCCTTCGA AGCCAAATGC GGCTGCAAGC TGGTGGTCGA GACCGGCAAC
AGCGTCGAAC GCCTGGCCAA GATGGAAGCG AACAAGGCGA ACCCCGTCGT CGACCTCGCC
GCTGTTTCCA TGGCCGATGC GCTGGCCGCC TCCCGTGCCG GCCTGATCGA CAAGGTCGAC
ACCACCAAGC TCGCCAATTT CACCAAGCTC TACGACGTCG CCAAGGATCC GAACGGCGAC
GGCATGAGCG TCGGTTACAC CTTCTACGCC ACCTCGATCG CCTATCGCTC CGACAAGATG
AAGATCGACT CCTGGGCCGA TCTCCTGAAG CCGGAATATG TCGGCCACGT CGCCTTCCCG
AACGTGACGA CCAACCAGGG GCCGCCGGCG CTCTATATGC TGGGCCAGGC GCTCGGCAAG
GACACCCCCG ATCTGAAGGG GCCGATCGAG GCGCTGGGCG AGAAGAAGGA CGACATCGTC
ACCTTCTACG AAAAATCCTC GCAGCTCGTG CAACTGATGC AGCAGGAGGA AATCTGGGCC
GCGCCGATCG GCCGTTTCTC CTGGGCTGGT TTTACCAAGC TCGATGTTCC GGTCGCCTGG
GCGACACCGA AAGAGGGTCA GACCGGCGGC ATGAATGTGC TGGTGCTGAC CAAGGGTTCG
AAGAACCAGG ATCTCGCCCT GCAGTTCATG GATTTCTGGC TCTCGACCGA CATCCAGACC
AAACTCGCCG AAAAGCTGGT CGACAGCCCG GCCAACAGCG AGGTCAAGCT TTCCGAAGCC
GCTGCCAACA ACCTCACCTA TGGCGAGGAA ACCGCCAAGA GCCTCAAGCT GATCCCTTCG
GCCGTCGCCC TCGACAATCG CGCCGGCTGG CTGAAGACCT GGAACGAAAA GGTCGGCCAG
TAA
 
Protein sequence
MKKALIASAA LAALSPLGAT AADRTLTISV YAFAQDDFKT LVYDPFEAKC GCKLVVETGN 
SVERLAKMEA NKANPVVDLA AVSMADALAA SRAGLIDKVD TTKLANFTKL YDVAKDPNGD
GMSVGYTFYA TSIAYRSDKM KIDSWADLLK PEYVGHVAFP NVTTNQGPPA LYMLGQALGK
DTPDLKGPIE ALGEKKDDIV TFYEKSSQLV QLMQQEEIWA APIGRFSWAG FTKLDVPVAW
ATPKEGQTGG MNVLVLTKGS KNQDLALQFM DFWLSTDIQT KLAEKLVDSP ANSEVKLSEA
AANNLTYGEE TAKSLKLIPS AVALDNRAGW LKTWNEKVGQ