Gene Rleg_2337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2337 
Symbol 
ID8013330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2341145 
End bp2342380 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content62% 
IMG OID644824920 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002976150 
Protein GI241205054 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.836128 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCC GTATATTGGC GGCCGTACTT GCCGCTTCGG TCGCGCTTCC GTTCGGCGCC 
GCAAATGCCA CCGATCTCGA AGTCACGCAT TGGTGGACTT CGGGCGGCGA ATCAGCTGCC
GTCGCCGAAC TGGCAAAGGC ATTCGACGCT ACCGGCAACC ACTGGGTCGA CGGTGCTATC
GCCGGTTCCG GCGGCACTGC CCGTCCGATC ATGATCAGCC GCATCACCGG CGGCGACCCA
ATGGGCGCCA CCCAGTTCAA CCATGGCCGC CAGGCCGAGG AACTCGTTCA GGCCGGGCTG
ATGCGCGATC TGACGGATGT GGCGACTGCC GAGCACTGGA AGGACATCGT TCGTCCGTCG
AGCCTGCTCG ATTCCTGCAC CATCGACGGC AAGATCTATT GCGCTCCCGT CAACATCCAC
TCCTGGCAGT GGTTGTGGCT GTCGAACGCC GCCTTCAAGA AGGCCGGCGT CGAAGTTCCG
AAGAACTGGG ACGAGTTCGT CGCCGCCGCT CCGGCGCTCG AAAAGGCCGG CATCATTCCG
CTCGCCGTCG GCGGTCAGCC GTGGCAGGCG ACTGGCGCCT TCGACGTGCT GATGGTTGCC
GTCGCCGGCA AGGATACCTT CAACAAGGTC TTCAAGGACA AGGATGCGGA AGTTGCCGCC
GGTCCCGAAA TCGCCAAGGT GTTCAAGGCG GCCGACGACG CTCGGCGCAT GGCCAAAGGC
AGCAACGTCC AGGATTGGAA CCAAGCCACC AACCTTGTCA TCACAGGCAA GGCCGGCGGT
CAGATCATGG GCGACTGGGC GCAGGGTGAG TTCGCGCTCG CCGGTCAGAA GGCCGGTACC
GACTATACCT GCCTGCCGGG CCTCGGCGTG AACGAGATCA TCTCGACTGG CGGCGACGCC
TTCTACTTCC CGCTGCTGAA GGACGAGGAA AAGTCCAAGG CGCAGGCCGT GCTTGCCAAG
ACCCTGCTCG ATCCCAAGAC CCAGGTTGCC TTCAACCTGA AGAAGGGTTC TCTGCCGGTT
CGCGGCGACG TCGATCTCGC CGCCGCCAAC GATTGCATGA AGAAGGGTCT CGAAATCCTG
GCCAAGGGCA ACGTGATCCA AGGTACCGAC CAGCTGCTTT CGGCCGACAG CCAGAAGCAG
AAGGAAGACC TCTTCTCCGA ATTCTTCGCC AACCCATCGA TGACGCCGGA AGACGCTCAG
AAGCGTTTCG CCGGGATCAT CGCTTCTGCT GACTGA
 
Protein sequence
MKIRILAAVL AASVALPFGA ANATDLEVTH WWTSGGESAA VAELAKAFDA TGNHWVDGAI 
AGSGGTARPI MISRITGGDP MGATQFNHGR QAEELVQAGL MRDLTDVATA EHWKDIVRPS
SLLDSCTIDG KIYCAPVNIH SWQWLWLSNA AFKKAGVEVP KNWDEFVAAA PALEKAGIIP
LAVGGQPWQA TGAFDVLMVA VAGKDTFNKV FKDKDAEVAA GPEIAKVFKA ADDARRMAKG
SNVQDWNQAT NLVITGKAGG QIMGDWAQGE FALAGQKAGT DYTCLPGLGV NEIISTGGDA
FYFPLLKDEE KSKAQAVLAK TLLDPKTQVA FNLKKGSLPV RGDVDLAAAN DCMKKGLEIL
AKGNVIQGTD QLLSADSQKQ KEDLFSEFFA NPSMTPEDAQ KRFAGIIASA D