Gene Rleg2_1218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1218 
Symbol 
ID6979939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1229401 
End bp1230735 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content58% 
IMG OID643395932 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002280738 
Protein GI209548821 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGAA CCGTGAAATC GATCAGCATA CTGCCGGCAT ACCGATTGAA AGCAGCTGTC 
GCGCTTGCCG GCGCGGCGCT TTTGAGTTTC GGCCTTTCCG CCGCCCGGGC CGACGATCTG
GCCGTGTGGG ATGACCAGAC CTTCGAAGGC CAGAGCGCGG TCATCGAGCA ACTGAATAAG
GATTTCGAAG CCGCGCATCC CGGTGTCACG ATCAAACGCA CCGCGCGCAC TTTCGATGAC
ATGAAGCTGA CGCTGAAGCT TGCAGTTTCG GCAGGTGATG GCCCCGTCAT CACCAAGGTC
AACCAGGGCG CCGGCGACAT GGGCGCGATG GTCAAGGAAG GCTTGCTCCT GCCGGTCGAC
GAATACATCA AGAAATATGG TTGGGATAAG CGGCAGTCGG ATTCCGTGCT GGCCAGAGAC
CGCTGGGAGG GCGCGAAATT CGGGGTCGGC AAGACCTACG GCATATCGGG TCTCGGCGAG
ATCGTCGGCC TCTACTACAA TAAGAAGATC CTCGACGACG CGGGCGTGGC GCTGCCGCAG
ACCTTCGAGG AACTGTTGGC CGATCTCGAC AAGCTGAAGG AAAAAGGCGT TGCGCCCTTC
ATGATGGGCT CCGCCAAGCA GCATCTTGCC CTGCATATGA TCGGCGCTAT CGATCAAGCG
CATATCGACG CGGCCAATCG CGCCGAGCTT GACGACCTGA TCTACGGCAA GGGCGGTTCC
TGGAACACCA AAGGCAACAT CGAATCAGCC AAACTCGTGC AGAAATGGGC ACAGGGCGGC
TATTTCTACC CCGGTTTCGA GGGCATCTCG GGTGACGACG CCGTCCAGCT GTTCATATCA
GGGCAGGGCG CATTTCTGAT CTCCGGAACC TGGTACTTTG GCGACATGCA AAACAATCCG
GATATCGGCT TCATGGCCAT TCCCGCGCCG AAGGGTGTCG CCAAACCCAT GAGCGTCGGC
GGTGTGGATC TTGCCTGGGC GATAACCAGC CTTGCCAAAG ACAAGGCGAA GCAGGATCTG
GCCGGCGAGT ACATCGACTA TATGGTGTCC GAAAAGGCCG CTGAAAGCTG GGCCGCTGCA
GGCTATCTTC CTGCAACGTC GCTCCCGGCG GATGCAAAGC CCAAGCTGAC GCCGCTCCTG
ACTTCCGGCA TCGAGATGTG GAAGACACTC AACGCCAACG ATGCGCTCGG CCATTACCCC
GATTGGTCGA GCCCGACGAT GCTGAAGACA ATCGACGACA ACACGCCACT TCTCCTGTCC
GGCAAGATCA CGCCCGAAGC CTTTGTCGAT GCCATGGACA AGGATTATCA GGCCTATTTG
AAGGATCAGA AATAA
 
Protein sequence
MNRTVKSISI LPAYRLKAAV ALAGAALLSF GLSAARADDL AVWDDQTFEG QSAVIEQLNK 
DFEAAHPGVT IKRTARTFDD MKLTLKLAVS AGDGPVITKV NQGAGDMGAM VKEGLLLPVD
EYIKKYGWDK RQSDSVLARD RWEGAKFGVG KTYGISGLGE IVGLYYNKKI LDDAGVALPQ
TFEELLADLD KLKEKGVAPF MMGSAKQHLA LHMIGAIDQA HIDAANRAEL DDLIYGKGGS
WNTKGNIESA KLVQKWAQGG YFYPGFEGIS GDDAVQLFIS GQGAFLISGT WYFGDMQNNP
DIGFMAIPAP KGVAKPMSVG GVDLAWAITS LAKDKAKQDL AGEYIDYMVS EKAAESWAAA
GYLPATSLPA DAKPKLTPLL TSGIEMWKTL NANDALGHYP DWSSPTMLKT IDDNTPLLLS
GKITPEAFVD AMDKDYQAYL KDQK