Gene Rleg_5048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5048 
Symbol 
ID8007641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp431192 
End bp432466 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content59% 
IMG OID644821963 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002973223 
Protein GI241113388 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.133709 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATTTGG ATAAATTCGG GGGGACGGTG AAACTTGCCG TGGCAGGATT CACCTTGGCG 
GCGATGACGG CCGGGGCAGC GTTTGCCCAG GACGCCGTGA CGCTGAAATG GGCTTTGTGG
GACTGGGACA AGACCGCCTA TTACAAACCG CTGATCGAGG CCTATCAGGC CAAGCACCCC
AACGTGAAGT TCGAGCCGAT GGATCTCGGC TCGCAAGACT ATCAGCAGAT GATCTCAACG
CAGTTGACCG GCGGCTCCAA GGACATCGAC ATCGTCACTA TCAAGGATGT GCCGGGCTAC
ACCAATCTGG TGCGCGCCGG CAACATCGCC GATCTCAGCG GCTTCGTGAA GGATCAGAAG
ATCGACCCGG CTCCCTTTGG CGGCCTGATC GAGGAACTGA CCATCGATGG CAAGATCTAC
TCCCTGCCGT TCCGCTCCGA CTTCTGGGTT GTCTATTACA ATAAGGATAT ATTCGACAAA
GCAGGCGTCC CCTACCCCAC CAATGACATG ACCTGGGCGC AGTTCGACGA GACCGCCGAG
AAGCTTTCAG GCGGCATGGG CACCAACAAG ACCTATGGCG CGCTTCTGCA TACCTGGCGG
TCAACCGTTC AATTGCCTGC CATCCTCGAC GGAAAACACA CGCTTGTCGA CGGCGACTAC
GGCTTCCTGA AGCCCTGGTA CGAGAGGGCG CTGACCCTGC AGAAGGATGG CGCGATTCCC
TCCTATGCCT TCCTGAAGAC GTCGAACACA CATTATTCGG CGCTCTTCTT CAATGGCACG
ATCGGCATGC TGCCGATGGG AACCTGGTTC GTCGGCACCC AGATCACCAA GGTGAAATCG
GGTGAATCGA AGAGCAGGAA CTGGGGCATC GTCAAGTTCC CGCATCCCGA CGGTGTGGCA
ACCGGCACGA CCGCTGCGCA GATTTCCGGC CTGGCGGTCA ACGCCAATTC CGACCACAAG
GATGCCGCGC TCGATTTCAT CAAGTTTGTC ACCGGTCCTG AGGGCGCAGC AGTTATCGCG
TCGACGGGCA CCTTCCCTGC GCTCAAGACG GATGATGTCA GCGCCAAGAT CGCGGCAACA
CCCGGATTTC CTGAGGATGC GGCCAGCAAG GAGGCGCTGA AGCCGTCGAA AGCCTACCTG
GAGATGGCGG TCAACCCGAA CGCCGCCAAG ATCGAGGTCG TACTCAACCG GGTGCATGAC
GCGATCATGA CTGACAGCAC CTCCGTCGAT GACGGGCTGA AGGAAATGAC CGAAGGCGTG
AAGGCCATCA AGTAG
 
Protein sequence
MYLDKFGGTV KLAVAGFTLA AMTAGAAFAQ DAVTLKWALW DWDKTAYYKP LIEAYQAKHP 
NVKFEPMDLG SQDYQQMIST QLTGGSKDID IVTIKDVPGY TNLVRAGNIA DLSGFVKDQK
IDPAPFGGLI EELTIDGKIY SLPFRSDFWV VYYNKDIFDK AGVPYPTNDM TWAQFDETAE
KLSGGMGTNK TYGALLHTWR STVQLPAILD GKHTLVDGDY GFLKPWYERA LTLQKDGAIP
SYAFLKTSNT HYSALFFNGT IGMLPMGTWF VGTQITKVKS GESKSRNWGI VKFPHPDGVA
TGTTAAQISG LAVNANSDHK DAALDFIKFV TGPEGAAVIA STGTFPALKT DDVSAKIAAT
PGFPEDAASK EALKPSKAYL EMAVNPNAAK IEVVLNRVHD AIMTDSTSVD DGLKEMTEGV
KAIK