Gene Rleg2_5331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5331 
Symbol 
ID6978425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp956041 
End bp957315 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content60% 
IMG OID643394433 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002279251 
Protein GI209547333 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATTTGG AAAAATTCGG GAGGACGGTG AAACTTGCCG TGGCAGGTTT TACCTTGGCA 
GCAACGACAT CAGGCGCAGC GTTTGCCCAA GACGCCGTGA CGCTCAAATG GGCTTTGTGG
GACTGGGATA AGACCGCCTA TTACAAGCCG CTGATCGAGG CCTATCAGGC CAAGCATCCG
AACGTGAAGT TCGAGCCGAT GGATCTCGGC TCGCAGGACT ACCAGCAGAT GATTTCGACG
CAGCTGACCG GCGGCTCGAA AGACATCGAC ATCGTCACCA TCAAAGACGT GCCGGGCTAT
ACCAATCTGG TGCGCGCCGG CAATATCGCC GATCTGAGCG GCTTCGTGAA GGATCAGAAA
ATCGATCCGG CCCCCTATGG CGGCTTGATC GAGGAATTGA CCATCGACGG CAAGGTCTAT
TCTCTGCCGT TCCGCTCCGA CTTCTGGATC GTCTATTACA ACAAGGACAT CTTCGACAAG
GCTGGCGTCC CCTACCCCAC CAATGACATG ACCTGGGCGC AGTTCGACGC GACCGCCGAG
AAGCTGACCG GCGGCATGGG CACCAACAAG ACCTATGGCG CGCTGCTGCA CACCTGGCGT
TCGACCGTCC AGCTGCCTGG TATCCTCGAC GGACAACACA CGCTGGTCGA CGGCGACTAC
GCCTTCCTGA AGCCGTGGTA CGAGCGGGCG CTCACCCTGC AGAAGGATGG CGCAATTCCC
TCCTATGCCT TCCTGAAGAC GTCGAACACG CATTATTCGG CGCTGTTCTT CAACGGCACG
ATCGGCATGC TGCCGATGGG AACCTGGTTC GTCGGCACCC AGATCGCCAA GGTGAAATCG
GGTGAATCGA AGAGCAAGAA CTGGGGCATC GTGAAGTTCC CGCATCCGGA CGGCGTGGCA
GCCGGCACGA CGGCTGCGCA GATCTCGGGC CTCGCCGTCA ACGCCAACTC AGAGCACAAG
GATGCGGCCC TCGACTTCAT CAAGTTCGTC ACCGGTCCGG AGGGCGCTGC CGTCATCGCA
TCGACGGGCA CCTTCCCGGC GCTCAAGACC GCCGATGTCA GCGCAAAGAT CGCCGCAACG
CCCGGCTTCC CGGAAGACGC GGCCAGCAAG GAGGCGCTGA TACCGTCGAA GGCCTATCTG
GAGATGGCGG TCAACCCGAA CGCGGCCAAG ATCGAGGTCG TGCTCAACCG CGTCCATGAC
GCGATCATGA CCGACAATAC CCCGATCGAC GACGGCCTGA AGGAAATGAC CGAAGGCGTC
AAGGCCATCA AGTAG
 
Protein sequence
MYLEKFGRTV KLAVAGFTLA ATTSGAAFAQ DAVTLKWALW DWDKTAYYKP LIEAYQAKHP 
NVKFEPMDLG SQDYQQMIST QLTGGSKDID IVTIKDVPGY TNLVRAGNIA DLSGFVKDQK
IDPAPYGGLI EELTIDGKVY SLPFRSDFWI VYYNKDIFDK AGVPYPTNDM TWAQFDATAE
KLTGGMGTNK TYGALLHTWR STVQLPGILD GQHTLVDGDY AFLKPWYERA LTLQKDGAIP
SYAFLKTSNT HYSALFFNGT IGMLPMGTWF VGTQIAKVKS GESKSKNWGI VKFPHPDGVA
AGTTAAQISG LAVNANSEHK DAALDFIKFV TGPEGAAVIA STGTFPALKT ADVSAKIAAT
PGFPEDAASK EALIPSKAYL EMAVNPNAAK IEVVLNRVHD AIMTDNTPID DGLKEMTEGV
KAIK