Gene Rleg_0157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0157 
Symbol 
ID8015405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp154908 
End bp155897 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content62% 
IMG OID644822748 
Productsugar ABC transporter, substrate-binding protein 
Protein accessionYP_002974007 
Protein GI241202911 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000969227 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTCG TGAAATCCCT TCTGTCCCGC CGCGCCTTTA CCGCGCTTGC GGGCGCCGCA 
GTTATCGCCT CGGCGATGCC GGCACCGTCG TTTGCGGCCG ACGTGACGAT CCCGATCATC
GTCAAGGACA CGACGTCCTT CTACTGGCAG ATCGTTCTGG CCGGCGCCCG CAAGGCCGGC
AAGGATCTCG GCGTCAACGT GCCGGAACTC GGCGCTCAGG CCGAATCCGA CGTCAACGGC
CAGATCAGCA TTCTTGAGAA CGCCGTTGCC GGCAAGCCGG CGGCCGTCGT CATTTCGCCG
ACCGAATTCA AGGCGCTCGG CAAGCCGATC GATGAAGCGG CCAAGTCGGT TCCGATCATC
GGCATCGACT CGGGCGCCGA CTCCAAGGCG TTCAAGTCGT TCCTGACGAC CGACAACGTT
CAGGGCGGCC GCATCGCCGC TGACGGTCTG GCCGCCGCCA TCAAGGGCGC CACCGGCAAG
GAAGAGGGCG AAATCGTCAT CCTCACCAAC CTTCCGGGCG TCGGCTCGCT GGAACAGCGC
CGCGAAGGCT TCCTGGATCA GGTGAAGACC AAGTATCCCG GCCTGAAGGT CATTGCCGAC
AAGTACGGCG ACGGCCAGGC AACGACCGGC CTCAACATGA TGACCGACCT GATCACGGCA
AATCCGAACC TCGTCGGCAT CTTCGCCTCG AACCTGATCA TGGCGCAGGG CGTTGGCCAG
GCGATCGCCG AAAACAAGCT CGGCGAGAAG ATCAAGGTCA TCGGCTTTGA CAGCGACGAC
AAGACGGTCG GCTTCCTCAA GGATGGTGCG ATTGCCGGCC TCGTCGTTCA GGACCCCTAC
CGCATGGGTT ATGACGGCGT GAAGACCGCG CTTGCCGTCT CCAAGGGCGA GAAGGTCGAA
GAGAATGTCG ACACCGGTGC AAACCTCGTC ACCAAGGCGA ATATGGCCGA CCCGAAGATC
GACGCGCTGC TGAACCCGAA GATCAAGTAA
 
Protein sequence
MSFVKSLLSR RAFTALAGAA VIASAMPAPS FAADVTIPII VKDTTSFYWQ IVLAGARKAG 
KDLGVNVPEL GAQAESDVNG QISILENAVA GKPAAVVISP TEFKALGKPI DEAAKSVPII
GIDSGADSKA FKSFLTTDNV QGGRIAADGL AAAIKGATGK EEGEIVILTN LPGVGSLEQR
REGFLDQVKT KYPGLKVIAD KYGDGQATTG LNMMTDLITA NPNLVGIFAS NLIMAQGVGQ
AIAENKLGEK IKVIGFDSDD KTVGFLKDGA IAGLVVQDPY RMGYDGVKTA LAVSKGEKVE
ENVDTGANLV TKANMADPKI DALLNPKIK