Gene Rleg2_2098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2098 
Symbol 
ID6980837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2159437 
End bp2160672 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content63% 
IMG OID643396820 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002281608 
Protein GI209549691 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.072788 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTC GTATTATGGC GGCCCTGCTT GCCGCTTCGG TCGCGCTTCC GTTCGGCGCC 
GCCAATGCTA CCGATCTCGA AGTCACGCAT TGGTGGACTT CAGGCGGCGA ATCGGCTGCT
GTCGCCGAGT TGGCAAAGGC ATTCGACGCC ACCGGCAACC ACTGGGTCGA CGGCGCGATC
GCCGGTTCCG GCGGCACCGC TCGTCCGATC ATGATCAGCC GCATCACCGG CGGCGACCCG
ATGGGCGCCA CCCAGTTCAA TCACGGCCGC CAGGCTGAGG AGCTCGTTCA GGCTGGCCTG
ATGCGCGACC TGACCGACGT GGCGACTGCC GAGCACTGGA AGGACATCAT CCGCCCGGCG
AGCCTGCTCG ATTCCTGCAC GATCGACGGC AAGATCTATT GCGCTCCAGT CAACATCCAC
TCCTGGCAGT GGCTGTGGCT GTCGAATGCC GCCTTCAAGA AGGCCGGCGT CGAGGTTCCG
AAGAACTGGG ACGAGTTCGT CGCTGCCGCT CCGGCTCTCG AAAAGGCCGG CATCATTCCG
CTCGCCGTCG GCGGTCAGCC GTGGCAGGCG ACAGGCGCCT TCGACGTGCT GATGGTCGCG
GTTGCCGGCA AGGATACCTT CAACAAGGTT TTCAAGGACA AGGATGCGGA AGTTGCCGCC
GGTCCTGAAA TCGCCAAGGT GTTCAAGGCC GCGGACGATG CGCGGCGCAT GGCCAAAGGC
AGCAACGTAC AGGACTGGAA CCAGGCCACC AACCTCGTCA TCACAGGCAA GGCCGGCGGT
CAGATCATGG GCGACTGGGC GCAGGGTGAA TTCGCGCTCG CCGGTCAGAA GGCCGGCACC
GACTACACCT GCCTGCCGGG CCTCGGCGTG AATGAGATCA TCTCGACCGG CGGCGATGCC
TTCTACTTCC CGCTGCTGAA GGACGAGGAA AAATCCAAGG CGCAGGCCGT GCTTGCCAAG
ACCCTGCTCG ATCCCAAGAC CCAGGTTGCC TTCAACCTGA AAAAGGGCTC GCTGCCGGTT
CGCGGCGATG TCGATCTCGC CGCCGCCAAC GACTGCATGA AGAAGGGTCT CGACATCCTC
GCCAAGGGCA ACGTGATCCA GGGTACCGAC CAGTTGCTTT CGGCCGACAG CCAGAAGCAA
AAGGAAGACC TCTTCTCCGA ATTTTTCGCC AACCCGTCGA TGACGCCGGA GGACGCTCAG
AAGCGTTTCG CCAAAATCAT CGCTTCGGCT GACTGA
 
Protein sequence
MKIRIMAALL AASVALPFGA ANATDLEVTH WWTSGGESAA VAELAKAFDA TGNHWVDGAI 
AGSGGTARPI MISRITGGDP MGATQFNHGR QAEELVQAGL MRDLTDVATA EHWKDIIRPA
SLLDSCTIDG KIYCAPVNIH SWQWLWLSNA AFKKAGVEVP KNWDEFVAAA PALEKAGIIP
LAVGGQPWQA TGAFDVLMVA VAGKDTFNKV FKDKDAEVAA GPEIAKVFKA ADDARRMAKG
SNVQDWNQAT NLVITGKAGG QIMGDWAQGE FALAGQKAGT DYTCLPGLGV NEIISTGGDA
FYFPLLKDEE KSKAQAVLAK TLLDPKTQVA FNLKKGSLPV RGDVDLAAAN DCMKKGLDIL
AKGNVIQGTD QLLSADSQKQ KEDLFSEFFA NPSMTPEDAQ KRFAKIIASA D