Gene Rleg_4751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4751 
Symbol 
ID8007004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp119746 
End bp120999 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content58% 
IMG OID644821681 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002972941 
Protein GI241113106 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.137656 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.378465 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGACCC TTTCTGCGAA ATTGAAGACT GCGAGCATCG TTGCGATCGC CGTGGCGTCA 
CTATCGGCCA CGCCGGTTCT TGCGGAAGAC ATCACGCTTT GGACCCTCAA CTTCGACAAC
AATGCTGCCA ACACGGCTCT GAAAAAGGTG GCGACGGACT TCGAAGCGGC AAACCCCGGA
ACGCATGTCG AGATCGTTCA GCGCGCCGTC GACGAGCATA AGACCGCCTT GCGCGTCGCT
GCTGGCTCCG ACAAGGGACC TGACATTTAT TTCAGCTGGG CGGGCCTCGG CCTCGGCGGC
GAGTATGTGA AGGCCGGTCT GTCCCTGCCC CTCGACAAAT ACTATGCCGA GTATAAGTGG
AGCGACGAAT TGCTGCCCTC GGCAGCGGCT TTTGCCGACC TCTATCCCGG CGGCAAGCAC
GGCGTCCCCT TCACCTTCAA GGGTGAGGCC GTCTATTACA ACAAGAAGCT TTTCGAACAG
GCCGGCATCA AGGAAGAGCC GAAGACCTAC GAGGAATTCC TTGCAGCGGC CGATAAGCTG
AAGGCTGCCG GCATTCCCGC CTTCACCTTC GGCGGCACGG TCAACTGGCA CGTCATGCGT
CTCATGGACG TCATCCTTGA AACGAAGTGC GGTGCTGAAA AGCACGATGC GCTGAAGGCG
ATGACGCTGG ATTGGACCAA GGAACCCTGC GCGACGGATT CATTCGCGGA GTTTGCGAAG
TGGACGAAGG ACTATACGCT GCAGCCGTTC ATGGGCATCG ACAACAAACA GTCCTACAGC
CTCTTCACCG CGGGTCGTGC AGCGATGATG CTCGAAGGCG ACTGGCTGGT CAGCCAGCTT
AACGGCTCCG GCGCCAATCT CGACGACTAC GGGATTTTCC CCTTCCCGAC CAACACCGAT
CGTCTCTACG GTTTCGCCGA GTACAACTAC ATCAGCACCA AGAGCAAGAG CCCTGATGTA
GCGGCGAAGT TCCTCGACTA CTTCCTCTCG ACGAAGGTCC AGCAGGACCT GCTCGGCCAG
CTGAGTTCAA CCTCCGTCAA CAAGAACGTC CAATATGCCA ACCAGAAGCC GCTCGAGGCG
GAATGGCTGG GGATCTTCCA GAAATACGGC AAGGTCTACA TGAACGGCGA CCAGGCGTTC
CCGCTCGACG TCACGACGGA GTACTTCCGG GTCATCAACG ATGTTGCTTC CGGCAACACC
GAGCCGGCCG ATGCGGCCAA GCAGTTGCAG AGCTTTATCG CAAGCCGAAC CTGA
 
Protein sequence
MLTLSAKLKT ASIVAIAVAS LSATPVLAED ITLWTLNFDN NAANTALKKV ATDFEAANPG 
THVEIVQRAV DEHKTALRVA AGSDKGPDIY FSWAGLGLGG EYVKAGLSLP LDKYYAEYKW
SDELLPSAAA FADLYPGGKH GVPFTFKGEA VYYNKKLFEQ AGIKEEPKTY EEFLAAADKL
KAAGIPAFTF GGTVNWHVMR LMDVILETKC GAEKHDALKA MTLDWTKEPC ATDSFAEFAK
WTKDYTLQPF MGIDNKQSYS LFTAGRAAMM LEGDWLVSQL NGSGANLDDY GIFPFPTNTD
RLYGFAEYNY ISTKSKSPDV AAKFLDYFLS TKVQQDLLGQ LSSTSVNKNV QYANQKPLEA
EWLGIFQKYG KVYMNGDQAF PLDVTTEYFR VINDVASGNT EPADAAKQLQ SFIASRT