Gene Rleg_1482 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1482 
Symbol 
ID8012568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1468098 
End bp1469360 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content63% 
IMG OID644824071 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002975313 
Protein GI241204217 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.340124 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGT TTTTGACCAC GACCGCAATG ATCGCGTTGG CTCTGGCCGG CGCTGGCGTG 
TCCGCCCACG CCGCCGACGT GAAGGAAGTG CAGATGCTGC ACTGGTGGAC GTCTGGTGGC
GAGGCGGCGG CGCTGAACGT CTTGAAGCAG GATCTTTCGA AGGAAGGTTT TGCCTGGAAG
GACGTGCCTG TTGCTGGCGG TGGCGGCGAT GCGGCGATGA CGGCGCTGAA GGCGATGGTT
GCGGCCGGCA CCTATCCGAC GGCCTCGCAG ATGCTGGGCT ATACCGTGCT CGATTATGCT
CAGGCCGGCG TCATGGGCGA TCTGACGGAG ACGGCCAAGA AGGAAGGCTG GGACAAGTCG
GTTCCGGCAG CGCTGCAGAA GTTCTCGGTC TATGACGGCA AGTGGGTCGC AGCCCCCGTC
AACGTCCACT CGGTCAACTG GCTGTGGATC AACAAGGCTG TGATGGACAA GATCGGCGGC
ACCCAGCCGA AGACCTTCGA CGAGCTGATC GCCCTGCTCG ACAAGGCGAA GGCCGCAGGC
GTCATCCCGC TGGCTCTCGG CGGCCAGAAC TGGCAGGAGG CGACGATGTT CGATTCCATC
GTGCTGTCGA CCGGCGGGCC GGAGTTCTAC AAGAAGGCCT TCAACGACCT CGACGAGGAA
TCGCTGAAGT CCGACACGAT GAAGAAGTCG TTCGACAATC TGGCGACGAT CATCAAATAT
GTCGACCCGA ACTTCTCGGG CCGCGACTGG AACCTGGCAA CCGCCATGGT CATCAAGGGT
GACGCGCTAG TGCAGGTGAT GGGCGACTGG GCCAAGGGCG AATTCGTCGC CGCCAAGAAG
ACGCCGGATA CCGACTTCCT GTGCTACCGC TTCCCCGGCA CCGACGGCAG CGTCGTCTAC
AACTCCGACA TGTTCGGCAT GTTCAACGTT CCCGACGACC GCAAGGCGGC CCAGGTGGCG
CTGGCGACCG CAACGCTGTC GAAGAGCTTC CAGTCGGCCT TCAACGTCGT CAAGGGTTCG
GTGCCGGCTC GTACCGACGT TCCCGATACC GACTTCGATG CTTGCGGCAA GAAGGGCATC
GCCGACCTGA AGGCAGCCAA CGAAGGCGGC ACGCTGTTCG GCTCGCTGGC ACAGGGCTAC
GGCGCTCCTC CGGCGATCGC CAATGCCTAC AAGGATGTCG TCTCGAAGTT CGTCCACGGC
CAGATCAAGA CCTCCGACGA AGCCGTCAAG CAGCTCGTCC AGGCGATCGA CGACGCCCGC
TGA
 
Protein sequence
MNKFLTTTAM IALALAGAGV SAHAADVKEV QMLHWWTSGG EAAALNVLKQ DLSKEGFAWK 
DVPVAGGGGD AAMTALKAMV AAGTYPTASQ MLGYTVLDYA QAGVMGDLTE TAKKEGWDKS
VPAALQKFSV YDGKWVAAPV NVHSVNWLWI NKAVMDKIGG TQPKTFDELI ALLDKAKAAG
VIPLALGGQN WQEATMFDSI VLSTGGPEFY KKAFNDLDEE SLKSDTMKKS FDNLATIIKY
VDPNFSGRDW NLATAMVIKG DALVQVMGDW AKGEFVAAKK TPDTDFLCYR FPGTDGSVVY
NSDMFGMFNV PDDRKAAQVA LATATLSKSF QSAFNVVKGS VPARTDVPDT DFDACGKKGI
ADLKAANEGG TLFGSLAQGY GAPPAIANAY KDVVSKFVHG QIKTSDEAVK QLVQAIDDAR