Gene Rleg2_4015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4015 
Symbol 
ID6982785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4185170 
End bp4186456 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content61% 
IMG OID643398744 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002283503 
Protein GI209551586 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.946152 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTAC GTGTAAGCAG ACGTAATTTC ATAGCGGGAG GCGCAACGCT TCTTTCGCTC 
TCGGCGCTCG GTACCAGCGC TTTTGCACAG GAAAAGCGCT TGCGCCTCCT GTGGTGGGGC
TCGCAGCCGC GCGCCGACCG CACCAACAAG GTTTCGCAAC TCTATCAGAC GAAGAATGCC
GGTACGTCGA TCACCGGCGA ATTCCTCGGC TGGGGCGATT ACTGGCCGCG CCTTGCGACC
CAGGTCGCCG GCCGCAATGC GCCCGACGTC ATCCAGATGG ATTATCGCTA TATCGTTCAG
TATGCACGGC GTGGCGCACT CGCACCGCTT GAATCCTATA TGCCGGCCAA GCTCAACCTT
GACGATTTCG ACAAGGCGCA GATCGAAGGC GGCAGCGTCG ACGGCCATCT CTATGGCGTC
AGCCTGGGTG CAAACTCGGC CGCGACGGTG CTGAACACCA CCGCCTTCAA GGAAGCCGGG
GTCGATCTGC CGACACAGGC GACCACCTGG GAAGAATTCG GCCGTATGGG TGCGGAGATC
ACCAAGGCAG GCAAACGCAA GGGCATGTTC GGCCTCGCCG ACGGCAGTGG CGGTGAACCG
CTGTTCGAAA ACTGGCTGCG CCAGCGCGGC AAGGGCCTCT ATACCGCCGA CGGCAAGATC
GCCTTCGACG TCGACGATGC ATCGGAATGG TATGACATGT GGGCGAAGTT CCGTGAGGCC
GGCGCTTGCG TTCCCGCCGA TATCCAGGCT CTCGACAAGA ACGATATCGA AACCAACACG
GTGTCGCTCG GCAAGGCAGC CGCCGGTTTT GCACATTCAA ACCAGTTCGT CGCCTATCAG
GCCATGAACA AGGACAAGCT GGCGCTCACC AATTACATGC GCATCAAGGC GGATTCGAAG
GGCGGCCACT ACCGCAAGCC TTCGATGTTC TTCTCGGTCT CGGCCCAGTC GAAAGCGATC
GACTTGGCAG TGGATTACAT CAACTTCTTC GTCAAGAACC CCGAAGCAGT GCTGCTCTTG
GATGTCGAAC GCGGCATTCC GGAATCGGCT GCCATGCGCG AGGTTGTTGC GGCGAAACTC
GATGAGAACG GCAAGGTCGC GCTGGCCTAT GTCAGCGGCC TTGGCGACCT CGCTGGCAAA
TTGCCGCCGC CGCCGCCGGC CGGCGCCGGT GAAGGCGAGC TGATGCTGCG CAACATCGCC
GAACAGGTCG GCTTCGGCCA GCTGTCTCCT TCCGATGGCG GCAAACAGCT TGTCACCGAA
ATCACGCAGA TTCTCGCACG AGGCTGA
 
Protein sequence
MSLRVSRRNF IAGGATLLSL SALGTSAFAQ EKRLRLLWWG SQPRADRTNK VSQLYQTKNA 
GTSITGEFLG WGDYWPRLAT QVAGRNAPDV IQMDYRYIVQ YARRGALAPL ESYMPAKLNL
DDFDKAQIEG GSVDGHLYGV SLGANSAATV LNTTAFKEAG VDLPTQATTW EEFGRMGAEI
TKAGKRKGMF GLADGSGGEP LFENWLRQRG KGLYTADGKI AFDVDDASEW YDMWAKFREA
GACVPADIQA LDKNDIETNT VSLGKAAAGF AHSNQFVAYQ AMNKDKLALT NYMRIKADSK
GGHYRKPSMF FSVSAQSKAI DLAVDYINFF VKNPEAVLLL DVERGIPESA AMREVVAAKL
DENGKVALAY VSGLGDLAGK LPPPPPAGAG EGELMLRNIA EQVGFGQLSP SDGGKQLVTE
ITQILARG