Gene Rleg2_1699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1699 
Symbol 
ID6980436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1731520 
End bp1732779 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content63% 
IMG OID643396423 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002281213 
Protein GI209549296 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGAA TACAATCGAT CGGTGCGGCT TTTGCTGCGG TTCTCTTGAG TTCCGTTGCC 
GCCCATGCCG GCGACGTGCG CATCATGTGG TATTCCGATG GCGGCGAAGG CGCTGTCATC
AAGGATCTGC TGGCGCGCTT CTCGAAGGCC AATCCCGATG TCAACGTCAT TCTCGACGAG
GTCTCCTATG ACGTCGTCAA GGAACAGCTG CCGGTGCAGC TCGAAGCCGG GAAGGGGCCG
GATATCGCCC GCGTCACCAA TCTGAAGGCG CTGGCCCAGC ACTGGCTCGA TCTTCGCCCG
CTCCTTGCCG ATGCGAAATA TTGGGACGAC AATTTCGGCG CCCAGGCCGA CTGGATGCGC
CCCGACGGCT CGAACGCCAT CACCGGCTTC ATGACGCAGC TGACGCTGAC CGGCGGCTTC
GCCAACAAGA CGCTGTTCGA TCAGGCCGGC GTCGAAATTC CCGGCCCGAA AGCCACCTGG
GATGATTGGG CGGCGGCCGC CAAGAAGGTC GCCGACAGTC AGAAGGTCTT CGCCATGGCG
ATTGACCGCT CCGGTCATCG CGTCTCCGGC CCGAACATCT CCTACGGCGC CAATTATATC
GCCGCCGACG GCAAGCCGGC GCCGATCGAT CAGGGCGCCA AGGACTTCCT CAGCCGCTTC
GTCAAATGGA ACGAGGAGGG CATCGTCAAC AAGGATGTCT GGGTCAGCGC CGCCGGCACC
ACCTATCGCG CTGCCGCCGA AGACTTCATC AATGGCGGCC TTGCCTATTA TTATTCCGGC
AGCTGGCAGG TCCCGGGCTT TGCCCAGAAG ATCGGCGATA ATTTCGATTG GGTCATGACC
GGAAGCCCCT GCGGCACGGC CAGCTGCACC GGCATACAGG GCGGCGCCGC TCTTGTCGCC
GTCAAATACA CCAAGAACCC CAAGGACGTC GCCAAGGTGA TGGATTACCT GGCAGGTGCC
GACGTGCAGA AGGAATTCGC CGAGCGCAGC CTGTTCATTC CAGCGCATAA GGGTGTCGCC
GCCGGCCAGG TGGACTTCAA GACCGACAAT CCGCATGTGC AGGCGGCGCT GAAGGCCTTC
GTCGAAGCGG CCGGCCAGAC GGCGGCACCG GCCATGAAGC TGCCGGGCTG GAAGTGGTCG
GATGCCTATT ACAGCGCCAT CGTCGCCCGC ATCAGCCAGG TGATCGCCGG CGAGATGAAG
CTCGACGACG CCTATGCCCG CATCGACGAG GACATCAAGG CCAAGGTCGG CGCCAACTAA
 
Protein sequence
MTRIQSIGAA FAAVLLSSVA AHAGDVRIMW YSDGGEGAVI KDLLARFSKA NPDVNVILDE 
VSYDVVKEQL PVQLEAGKGP DIARVTNLKA LAQHWLDLRP LLADAKYWDD NFGAQADWMR
PDGSNAITGF MTQLTLTGGF ANKTLFDQAG VEIPGPKATW DDWAAAAKKV ADSQKVFAMA
IDRSGHRVSG PNISYGANYI AADGKPAPID QGAKDFLSRF VKWNEEGIVN KDVWVSAAGT
TYRAAAEDFI NGGLAYYYSG SWQVPGFAQK IGDNFDWVMT GSPCGTASCT GIQGGAALVA
VKYTKNPKDV AKVMDYLAGA DVQKEFAERS LFIPAHKGVA AGQVDFKTDN PHVQAALKAF
VEAAGQTAAP AMKLPGWKWS DAYYSAIVAR ISQVIAGEMK LDDAYARIDE DIKAKVGAN