Gene Rleg2_4662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4662 
Symbol 
ID6977756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp298660 
End bp299913 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content60% 
IMG OID643393836 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002278654 
Protein GI209546736 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.917353 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGACCC TATCCGCGAA ATTGAAGACT GCGAGCATCG TTGCGATCGC CATGGCGTCT 
TTCGCCGCGA CGCCGGTTCT TGCCGAAGAC ATCACGCTCT GGACGCTCAA CTTCGACAAC
AACGCCGCCA ACGGCGCGCT GAAGAAGGTC GCGACGGATT TCGAAGCGGC GAACCCGGGA
ACGCATGTCG AGATCGTTCA GCGCGCCGTC GACGAGCACA AGACGGCGCT GCGCGTCGCA
GCCGGCTCAG ACAAGGGGCC CGACATTTAT TTCAGCTGGG CGGGCCTTGG CCTCGGCGGC
GAATATGTGA AGGCCGGCCT CTCCCTGCCG CTCGACAAAT ATTACAGCGA ATATAAATGG
AACGACGAAT TGCTGCCCTC GGCTGCGGCT TTCGCCGACC TTTATCCCGG CGGCAAACAC
GGCGTTCCCT TCACCTTCAA GGGCGAGGCC GTCTATTACA ACAAGAAGCT TTTCGAGCAG
GCCGGCATCA AGGAAGAGCC AAAGACCTAT GAGGAACTTC TGGCCGCGGC CGACAAGCTG
AAAGCCGCCG GCATTCCCGC CTTCACCTTC GGCGGCACGG TCAACTGGCA CGTCATGCGC
CTGATGGATG TCATCCTCGA GACCAAGTGC GGCGCCGAGA AGCACGACGC GCTGAAGGCG
ATGACGCTGG ACTGGACCAA GGAGCCTTGC GCGACGGACG CCTTTGCGGA ATTTGCCAAG
TGGACGAAGG ACTATACGCT GCAGCCCTTC ATGGGCATCG ACAACAAGCA GTCCTACAGC
CTCTTCACGG CAGGCCGCGC GGCGATGATG CTCGAAGGCG ACTGGCTGGT CAGCCAGCTC
AACGGCTCGG GCGCCAATCT CGACGATTAC GGCATCTTCC CCTTCCCGAC CAATACCGAG
CGTCTCTATG GTTTCGCCGA GTACAATTAC ATCAGCACCA AGAGCAAGAA TCCCGACACA
GCCGCGAAAT TCCTCGACTA TTTCCTTTCG ACCAAGGTGC AGCAGGATCT GCTCGGCCAG
CTGAGCTCGA CCTCCGTCAA CAAGAATGTC CAATACGCCA ACCAGAAGCC GCTCGAGGCG
GAATGGCTGG GGATCTTCCA GAAATACGGC AAGGTCTACA TGAACGGCGA CCAGGCCTTC
CCGCTCGATG TGACGACGGA ATATTTCCGC GTCATCAATG ACGTCGCCTC CGGCAACACC
GAGCCGGCCG AGGCGGCCAA GCAGCTGCAG ACCTTCATCG CAAGCAGAAC CTGA
 
Protein sequence
MLTLSAKLKT ASIVAIAMAS FAATPVLAED ITLWTLNFDN NAANGALKKV ATDFEAANPG 
THVEIVQRAV DEHKTALRVA AGSDKGPDIY FSWAGLGLGG EYVKAGLSLP LDKYYSEYKW
NDELLPSAAA FADLYPGGKH GVPFTFKGEA VYYNKKLFEQ AGIKEEPKTY EELLAAADKL
KAAGIPAFTF GGTVNWHVMR LMDVILETKC GAEKHDALKA MTLDWTKEPC ATDAFAEFAK
WTKDYTLQPF MGIDNKQSYS LFTAGRAAMM LEGDWLVSQL NGSGANLDDY GIFPFPTNTE
RLYGFAEYNY ISTKSKNPDT AAKFLDYFLS TKVQQDLLGQ LSSTSVNKNV QYANQKPLEA
EWLGIFQKYG KVYMNGDQAF PLDVTTEYFR VINDVASGNT EPAEAAKQLQ TFIASRT