Gene Rleg2_3119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3119 
Symbol 
ID6981864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3187665 
End bp3189020 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content60% 
IMG OID643397829 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002282612 
Protein GI209550695 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGCCA AGCTTCTCGG CGCCGTTGGC GCCCTTCTCG CTACAGCGTT TCTTGCCGGT 
CCCGCTGCCG CCGCCGATAA GACCAAGATT GACTTCTGGT TCGGCAATTC CGGTGACATC
GCAAAGCGTG TCCAGGAACA GTGCGATCGC TTCAACCAGT CGCAGGCCGA TTACGAAGTC
GTCTGCACCA GCCAGGGCAG CTACGACGCG TCCTTGCAGA ACACCATTGC CGCCTTCCGC
GCCGGCAAGC AGCCGACCAT CGCTCAGGTT TCCGACGCCG GCACGCTCGA CATCATGCTT
TCCGGCGCCT ACTACCCGGC CAATCAGCTG ATGACCGACA TGGGCTACAC GGTCGACTGG
AAAGATTACT TCTCCGGCAT CGCCAACTAT TATGCGACGT CCAAGGGCGA GATGTATTCC
TTCCCCTTCA ACTCCTCGAC CGCATTGCTC TACTGGAACA AGGATGCCTT CGCCAAGATC
GGCAAGGACC ATGCTCCGGC CACCTGGCAG GAAGCCGGCG AAGATTTCAA GGCTCTGAAG
GATGCAGGTT ATGCTTGCCC GCTCGCCTTC GACATCTCCA ACAACGAAGT CTGGCAATAT
GTCGAGCAGT TCGAAGCCGT TAACGGCGAA GCGATCGCGA CGAAGAAGAA CGGCTTTGAA
GGCCTCGACG CCGAGCTGAC CTACAACAAG AACCCGCTGC TCGTCAGCTA CATCAAGGAC
CTCAAGTCCT GGTACGACAA CAAGCTGGCT TTCATCAAGA ACAAGGCCGT CGGCCAGACC
TTCGTCGAAG CCTTCGCCGC CGGCGATTGC CAGGTTATCC TCACCTCGGT CGGCGACCAC
GGCAATATCG GCCGCACCGC CAAGCAGGGC ATGAACTGGG GCGTTGCCAT GCTCCCGACC
TACGGCACTG CAACCCGCCA CAGCTCCTAT GTCGGCGGCG CTTCGCTCTG GGTTCTGAAG
GGTCACACCG ACGCCGAATA CAAGGCTGCC GCTGCCTTTT TCAACTTCAT CGCAAAGCCG
GAAGAAGCCC TGACCTGGTC GACGGTCACC GGCTACATCC CGGTTCGTAA CTCCGGCTTC
GAATATCTCA AGAAGCAGGG CTTCTACGAC AAGGCTCCTT ATGCCGGCCG TGAACTCGCC
ATTCAGAGCC TGACGGCATC GCCTGCCGAC GACACGGCGC CGCACGGCAT CCGTCTCGGC
GGCCTGCTTC AGGTCCGCAC CGAAATCGCC AACGGCCTGC AGGCGATCTT CGTCAACAAT
GCCGACGTCC AGGCTTCGCT CGACGGCGCT GCCGAACGCG GCAACCAGCT GCTGCGCCGC
TTCCAGCAGA CCTACAAGAA CGTTCAGCTT CCTTGA
 
Protein sequence
MQAKLLGAVG ALLATAFLAG PAAAADKTKI DFWFGNSGDI AKRVQEQCDR FNQSQADYEV 
VCTSQGSYDA SLQNTIAAFR AGKQPTIAQV SDAGTLDIML SGAYYPANQL MTDMGYTVDW
KDYFSGIANY YATSKGEMYS FPFNSSTALL YWNKDAFAKI GKDHAPATWQ EAGEDFKALK
DAGYACPLAF DISNNEVWQY VEQFEAVNGE AIATKKNGFE GLDAELTYNK NPLLVSYIKD
LKSWYDNKLA FIKNKAVGQT FVEAFAAGDC QVILTSVGDH GNIGRTAKQG MNWGVAMLPT
YGTATRHSSY VGGASLWVLK GHTDAEYKAA AAFFNFIAKP EEALTWSTVT GYIPVRNSGF
EYLKKQGFYD KAPYAGRELA IQSLTASPAD DTAPHGIRLG GLLQVRTEIA NGLQAIFVNN
ADVQASLDGA AERGNQLLRR FQQTYKNVQL P