Gene Rleg_6156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6156 
Symbol 
ID8016169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012852 
Strand
Start bp202415 
End bp203647 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content61% 
IMG OID644827462 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002978662 
Protein GI241258778 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.889741 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.000305646 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTATCC GCAAATATGC AATTCTGGGC GCTCTCGCAC TTGCAGGCGT CTCGCTGTTC 
GGTCTTTCGG CCAAGGCCGA AGACGTCACA CTGACGCTCT GGTCGCTGGA TAAGGACACC
CAGCCGGCGC CGAACCTCGT CAAGGAGTTC AACGCCCAGA ACAACGGCAT CAAGATCGAA
TATCGGCTGA TCCAGTTCGA CGACGTCGTC ACCGAGGCGA TGCGTGCCTA TGCGACCGGC
CAGGCGCCCG ACATCATCGC CGTCGACAAT CCGGAGCATG CGCTGTTTTC GTCGCGCGGC
GCCTTCCTGG ATCTTACCGA CATGATCGCC AAGTCGACCG TCATCAAGCC GGAGAATTAT
TTCCCCGGCC CGCTGAAATC GGTCGAGTGG GACGGCAAGT ATTTTGGCGT GCCGAAGGCG
ACCAATACGA TCGCGCTTTA CTATAACAAG GACATGTTCA AGGCCAAGGG CCTCGACCCG
AACAAGCCAC CGCAGACTTG GGACGAGCTC GTCGAGGACG CGCGTAAGCT GACCGACCCC
GCCAAGAACG TCTATGGTCT CGCCTTCTCG GCCAAGGCCA ACGAGGAGGG CACCTTCCAG
TTCCTTCCCT GGGCTCAGAT GGGCGGCGGC AGCTATGAGA ACATCAATGC CGAAGGCGCG
GTGAAGGCGC TCGGGATCTG GAAGACGATC ATGGACGAGA AGCTCGCTTC TCCCGACACC
TTGACGCGCG GCCAGTGGGA TTCGACCGGC ACCTTCAATT CCGGCAATGC GGCAATGGCG
ATCTCGGGCC CGTGGGAGCT CGACCGCATG ACGCAGGAAG CGAAGTTCGA CTGGGGCGTC
ACGCTGCTCC CGGTTCCCAA GGAAGGGGCT GAACGCTCCT CGGCCATGGG CGACTTCAAC
TGGGCGATCT TCGCCACCAG CAAACATCCG GCCGAAGCCT TCAAGGCGCT CGAATATTTC
GCCTCGCAGG ACGACAAGAT GTTCAAGAAC TTCGGCCAGC TTCCGGCCCG TTCCGACATC
TCGATCCCCG AGACCGGCCA GCCGCTGAAG GATGCAGCCC TCAAGGTCTT CCTCGAACAG
CTGAAATACG CCAAGCCGCG CGGCCCGCAT CCGCAATGGC CGAAGATCTC CAAGGCGATC
CAGGACGCTA TCCAGGCAGC ACTCACCGGC CAGATGAGCC CGAAAGACGC GCTCGACCAG
GCAGCCGACA AGATCAAGGC AGTACTAGGC TGA
 
Protein sequence
MAIRKYAILG ALALAGVSLF GLSAKAEDVT LTLWSLDKDT QPAPNLVKEF NAQNNGIKIE 
YRLIQFDDVV TEAMRAYATG QAPDIIAVDN PEHALFSSRG AFLDLTDMIA KSTVIKPENY
FPGPLKSVEW DGKYFGVPKA TNTIALYYNK DMFKAKGLDP NKPPQTWDEL VEDARKLTDP
AKNVYGLAFS AKANEEGTFQ FLPWAQMGGG SYENINAEGA VKALGIWKTI MDEKLASPDT
LTRGQWDSTG TFNSGNAAMA ISGPWELDRM TQEAKFDWGV TLLPVPKEGA ERSSAMGDFN
WAIFATSKHP AEAFKALEYF ASQDDKMFKN FGQLPARSDI SIPETGQPLK DAALKVFLEQ
LKYAKPRGPH PQWPKISKAI QDAIQAALTG QMSPKDALDQ AADKIKAVLG