Gene Rleg2_4695 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4695 
Symbol 
ID6977789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp336422 
End bp337720 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content59% 
IMG OID643393868 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002278686 
Protein GI209546768 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.52565 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGCG CCACCGTCAG CGCATTTGCG CTGAGCACAA TGTTATTTTC AGCATCCGGT 
TCGTATGCCC AGGAACTTGC AACCAAGGAC AGGATCGGCC TTGCCGATGC GCCAAAATCC
CTCGTCGTCC GTCTGACCAA CGACAGCCCG AACAATGCGG ATCCGGCGAT CGCCGAGGGC
TATCAAAAGC TCTTCGTCGA CTTCATCAAA AAGCATCCCG ACTGGAAATT GCAGATGCAA
TTCATGTCGT CTGATATCGG CACCGAACAG GCCAAGATGC TGGAGCAGGC CAAGGCCGGC
AATGCGCCCG ATTGCGCCGC CGTCGACTCC TTCGTGCTCT CGCAGTTCAT GGTCAATCAT
GTGCTGGCGG ACTTCACGCC CTATTTCTCG AAGGAAGAAG TAGACGACCT CTTCCCCTTC
ATCCGCAACG GCATCACCGA CAAGGACAAG ACGGTGCGCG CCTGGTGGTG GGATACCGAC
CTTCGTGTGC TCTACCGCAA CAAGTCTGTG GTCGCAGATG CGCCGCAGAC CTGGGATGAT
CTGAAAAAGG CCGCGCTTGC CTCCACCAAG GAGGGCATGG AAGGCGTGCT CTTCAACGGC
GGGCGCTGGG AAGGCACGAC CTTCGACTGG CTGGCGAACT ATTGGGCGCT CGGCGGCAAG
CTCGTCGACG ACTCGGGCAA ACCGGTCTTC GGCGAAGGCG AGAACAAGGA GAAATTCCTG
AAGGCGCTGA ACTATTTCAA GGATCTCGTC GATTCCGGCG CGGCCCCCAA GCGCGTCAGC
ACGATCGCCA ATTACGACGA CATGAATGCC GCGGCAGCTG CCGCGACCAC AGCCCTCTTC
ATCGGCGGCA ATTGGCAATA TGCCCAGCTG AAGTCCACGC TTGACGAAGA CGAGTTCAAG
AACTGGACCT TCTCGCCGAT CCCCGGCCCG ACGGCCGATC AACGTTCGAC AGGCACCGGC
GGCTGGACGA TCGCCTCGTT CAGCAAGGAC AAGGACAAGG TCGAGATGTG CGCCAACCTC
GCACGCGAGG TCTATATGGG GCCGGCCAAC GCGCTGCAGC AGCAGCTGCC GACCCGCAAA
TCGCTGTTCG ACAAGTACGA GGTCTTCTCG ACGGAAGCCA ATAAGACCTT CGCCAAGGCT
CTGGTCGACG GACAGGCGCG CCCCGGCGTG CCGATCTATC CAGAGATCTC GAACCAGATC
CAGATCATGA TGGGTGACGT GCTCTCTGGG ACTAAGAAAC CGGAAGAAGC GCTGGATGCC
GCCTTCAATG CGGCGATGGA GGCCTACAAG CGTCTGTGA
 
Protein sequence
MKSATVSAFA LSTMLFSASG SYAQELATKD RIGLADAPKS LVVRLTNDSP NNADPAIAEG 
YQKLFVDFIK KHPDWKLQMQ FMSSDIGTEQ AKMLEQAKAG NAPDCAAVDS FVLSQFMVNH
VLADFTPYFS KEEVDDLFPF IRNGITDKDK TVRAWWWDTD LRVLYRNKSV VADAPQTWDD
LKKAALASTK EGMEGVLFNG GRWEGTTFDW LANYWALGGK LVDDSGKPVF GEGENKEKFL
KALNYFKDLV DSGAAPKRVS TIANYDDMNA AAAAATTALF IGGNWQYAQL KSTLDEDEFK
NWTFSPIPGP TADQRSTGTG GWTIASFSKD KDKVEMCANL AREVYMGPAN ALQQQLPTRK
SLFDKYEVFS TEANKTFAKA LVDGQARPGV PIYPEISNQI QIMMGDVLSG TKKPEEALDA
AFNAAMEAYK RL