Gene Rleg2_1378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1378 
Symbol 
ID6980106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1398742 
End bp1400004 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content62% 
IMG OID643396099 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002280898 
Protein GI209548981 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAGGT TTTTGACCAC GACGGCGATG ATCGCATTGG CTCTGGCCGG CGCGCATGTC 
TCCGCCCGCG GAGCCGACGT GAAGGAAGTT CAGATGCTGC ATTGGTGGAC GTCTGGCGGC
GAGGCGGCGG CTTTGAACGT GCTGAAGCAG GATCTGTCGA AGGAAGGTTT TGCCTGGAAG
GACGTTCCAG TGGCCGGCGG CGGCGGTGAT GCGGCGATGA CGGCACTGAA GGCGATGGTT
GCGGCCGGCA CCTATCCGAC GGCCTCGCAG ATGCTGGGCT ATACCGTGCT CGATTATGCC
CAGGCCGGCG TCATGGGCGA CCTGACCGAG ACGGCGAAGA AGGAAGGCTG GGACAAGTCG
GTGCCGGCGG CGCTGCAGAA GTTCTCGGTC TATGACGGCA AGTGGGTCGC AGCCCCTGTT
AACGTGCACT CGGTCAACTG GCTGTGGATC AACAAGGCGG TGATGGACAA GATCGGCGGC
ACCCAGCCGA AGACCTTCGA CGATCTGATC GCGCTGCTCG ACAAGGCCAA GGCCGCAGGT
GTCATCCCCT TGGCGCTCGG CGGTCAGAAC TGGCAGGAAG CGACGATGTT CGATTCCATC
GTGCTGTCGA CCGGCGGGCC GGAATTCTAC AAGAAGGCCT TCAACGATCT CGATGAGGAG
TCGCTGAAGT CGGACACGAT GAAGAAGTCC TTCGACAATC TGGCGACGAT CATCAAATAT
GTCGATCCGA ACTTCTCCGG CCGCGACTGG AACCTGGCGA CCGCCATGGT CATCAAGGGT
GATGCGCTGG TGCAGGTGAT GGGCGACTGG GCCAAGGGCG AATTCGTCGC CGCCAAGAAG
ACCCCGGATA CCGACTTCCT CTGCTACCGC TTCCCCGGCA CCGAAGGCAG CGTCGTCTAT
AACTCCGACA TGTTCGGCAT GTTCAACGTT CCCGATGACC GCAAGGCCGC TCAGGTGGCG
CTGGCAACCG CGACGCTGTC GAAGAGCTTC CAGTCGGCCT TCAACGTCGT CAAGGGTTCG
GTGCCGGCCC GCACCGACGT TCCCGACACC GACTTCGATG CCTGCGGCAA GAAGGGCATC
GCCGATCTGA AGGCGGCCAA TGAGGGCGGC ACGCTGTTCG GCTCGCTGGC CCAGGGCTAT
GGCGCGCCTC CGGCCATCGC CAATGCCTAT AAGGACGTGG TCTCGAAGTT CGTCCACGGC
CAGATCAAGA GCTCCGACGA AGCCGTCAAG CAGCTCGTCC AGGCGATCGA CGACGCTCGC
TGA
 
Protein sequence
MNRFLTTTAM IALALAGAHV SARGADVKEV QMLHWWTSGG EAAALNVLKQ DLSKEGFAWK 
DVPVAGGGGD AAMTALKAMV AAGTYPTASQ MLGYTVLDYA QAGVMGDLTE TAKKEGWDKS
VPAALQKFSV YDGKWVAAPV NVHSVNWLWI NKAVMDKIGG TQPKTFDDLI ALLDKAKAAG
VIPLALGGQN WQEATMFDSI VLSTGGPEFY KKAFNDLDEE SLKSDTMKKS FDNLATIIKY
VDPNFSGRDW NLATAMVIKG DALVQVMGDW AKGEFVAAKK TPDTDFLCYR FPGTEGSVVY
NSDMFGMFNV PDDRKAAQVA LATATLSKSF QSAFNVVKGS VPARTDVPDT DFDACGKKGI
ADLKAANEGG TLFGSLAQGY GAPPAIANAY KDVVSKFVHG QIKSSDEAVK QLVQAIDDAR