Gene Rleg2_1685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1685 
Symbol 
ID6980422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1715010 
End bp1716314 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content62% 
IMG OID643396409 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002281199 
Protein GI209549282 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTTA TGCTGAAGAA CGCCGCTTTC GGCGGCAGGC GCATTGTATC CATGGCCGCG 
GCAGCCGGCC TGTTGCTGGC GGGAGCGGGT GCCGCCTCGG CGACGACGGT GGTGAAATGG
CTGCATCTCG AGCTCGACCC GAAAAACGTT GCGGCCTGGG AAGATATCGT CAAGAAATAC
GAAGCCCAGC ATCCCGACGT CGACATCCAG ATGCAGTTCC TCGAAAACGA GGCCTTCAAG
GCCAAGCTTC CGACATTGCT GCAGTCCGAC GACGTGCCGG ATTTCTTTTT CAGCTGGGGC
GGCGGCGTGT TGAAGCAGCA GTCCGAGACC GGCGCGCTCC AGGATGTGAC GGCAGCGCTT
GATGCCGATG GCGGCAAGCT GCGCAGCGCC TATACCCCGG CTTCGGTCGA TGGCCTGACT
TTCGAGGGCA AGACCTGGGC CATTCCCTAC AAGGTCGGTC TCGTCAGCTT CTTCTACAAC
AAGGCGCTGT TTGCCAAGGC CGGCGTCAAG GCCGAAGACA TCAAGACCTG GAGCGATTTT
CTCGCCACGG TGAAGAAGAT CAAGGCGGCC GGCATCGTGC CGATCGCCGG CGGCGGCGGT
GAGAAATGGC CGATCCATTT CTACTGGAGC TATCTCGTCA TGCGCGAGGG CGGCCAGAAG
GTCTTCGAAG CGGCCAAGAA CGGCGAGGGC GAAGGCTTCC TCGATCCCAC TATCATCAAG
GCCGGCGACG ACCTCGCCGA ACTCGGCAAG CTCGAACCGT TCCAGCCCGG CTATCTCGGT
GCGACCTGGC CGCAGACGCT CGGCGTTTTC GGCGACGGCA AGGCGGCGAT GATCCTCGGC
TTTGAAGCGA CAGAGGCCAA CCAGCGCAAG AATGCCGGCG ACGGCAAGGG GCTTTCCTCA
GACAATATCG GCCGTTTCGT CTTCCCGACG GTCGAAGGCG GCGCCGGCAA GCCGACCGAT
ACGCTCGGCG GCTTGAACGG CTGGGCCGTC ACCAAGAAGG CCTCCAAGGA AGCGCTCAAT
TTCCTCGCTT TCCTGACGAG CGCGGAGAAT GAACGGGCGA TGGCCAAATC AGGCATGTTG
CTTCCCGTTG CCGTCGGCGC CGATGACGGC GTCGTCAATC CGTTGCTGGC CGAATCGGCC
AAACAGCTTG CCGGTTCGAC CTGGCATCAG AACTTCTTCG ACCAGGATCT CGGCGCTGCC
GTCGGCCGCG TCGTCAACGA CGTCTCCGTG GAAATCGTCT CCGGCCAGAT GAATTCCAAG
GACGGCGCCC AGATGATCCA GGACGCTTTC GAGCTGGAAC AATAA
 
Protein sequence
MNFMLKNAAF GGRRIVSMAA AAGLLLAGAG AASATTVVKW LHLELDPKNV AAWEDIVKKY 
EAQHPDVDIQ MQFLENEAFK AKLPTLLQSD DVPDFFFSWG GGVLKQQSET GALQDVTAAL
DADGGKLRSA YTPASVDGLT FEGKTWAIPY KVGLVSFFYN KALFAKAGVK AEDIKTWSDF
LATVKKIKAA GIVPIAGGGG EKWPIHFYWS YLVMREGGQK VFEAAKNGEG EGFLDPTIIK
AGDDLAELGK LEPFQPGYLG ATWPQTLGVF GDGKAAMILG FEATEANQRK NAGDGKGLSS
DNIGRFVFPT VEGGAGKPTD TLGGLNGWAV TKKASKEALN FLAFLTSAEN ERAMAKSGML
LPVAVGADDG VVNPLLAESA KQLAGSTWHQ NFFDQDLGAA VGRVVNDVSV EIVSGQMNSK
DGAQMIQDAF ELEQ