Gene Rleg_3391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3391 
Symbol 
ID8014268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3406846 
End bp3408201 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content60% 
IMG OID644825949 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002977176 
Protein GI241206080 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.297931 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGCCA AGCTTCTCGG CGCCGTTGGC GCCCTTCTCG CTACAGCGTT TCTTGCTGGT 
CCCGCTGCCG CCGCCGATAA GACCAAGATC GACTTCTGGT TCGGCAATTC CGGGGACATC
GCAAAGCGTG TCCAGGAGCA GTGCGATCGC TTCAACCAGT CGCAGGCCGA CTACGAAGTC
GTCTGCACCA GCCAGGGCAG CTATGACGCC TCCCTGCAGA ACACCATCGC CGCCTTCCGC
GCCGGCAAGC AGCCGACCAT CGCCCAGGTC TCCGACGCCG GCACCCTCGA CATCATGCTC
TCCGGCGCCT ACTACCCGGC AAACAAGCTG ATGACCGACA TGGGCTATAC CGTCGACTGG
AAGGACTATT TCTCCGGTAT CTCCGGCTAT TACGCGACAT CGAAGGGCGA GATGTACTCC
TTCCCCTTCA ACTCCTCGAC CGCTCTTCTC TACTGGAACA AGGACGCCTT CGCCAAGATC
GGCAAGGATC ATGCTCCGGC AACCTGGAAG GAAGCAGGCG AGGACCTCAA GGCTCTGAAG
GATGCAGGCT ATGCTTGCCC ACTCGGCTTT GACATCTCCA ACAATGAAGT CTGGCAGTAC
ATCGAGCAGT TCGAAGCCGT CAACGGCGAA GCGATCGCCA CGAAGAAGAA CGGCTTTGAA
GGTCTGGACG CCGAGCTGGT GTTCAACAAG AACCCGCTTC TCGTCAGCTA CGTCAAGGAT
CTCAAGTCCT GGTACGACGA CAAGCTTGTC GTCATCAAGA ACAAGGCTGT CGGCCAGACC
TTCGTCGAAG CCTTTGCCGC CGGCGATTGC CAGGTCATCC TGACCTCGGT CGGCGACCAC
GGCAATGTCG GCCGCACCGC CAAGCAAGGC ATGAACTGGG ACGTTGCCAT GCTCCCGACC
TACGGCGACG CAGCCCGTCA CAGCTCTTAC GTCGGCGGCG CTTCGCTCTG GGTTCTGCAG
GGTCACTCCG ACGCCGAATA CAAGGCTGCC GCTGCCTTCT TCAACTTCAT CGCAAAGCCG
GAAGAAGCTC TTACCTGGTC GACCGTTACC GGCTACATCC CGGTTCGCAA CTCCGGTTTC
GAATATCTGA ATAAGCAGGG CTTCTACGGC AAGGCGCCTT ATGCCGGCCG CGAACTCGCC
ATTCAGAGCC TGACCGCTTC TCCGGCTGGC GATGCGGCTC CGCAGGGCAT CCGCCTCGGT
GGCCTGCTGC AGGTCCGCAC CGAAATCGCC AATGGTCTGC AGGCAATCTT CGTTAACAAT
GCCGATGTCC AGGCTTCGCT CGACAGTGCT GCCGAACGCG GCAATACGCT GCTCCGTCGC
TTCCAGCAGA CCTACAAGAA CGTTCAGCTC CCCTGA
 
Protein sequence
MQAKLLGAVG ALLATAFLAG PAAAADKTKI DFWFGNSGDI AKRVQEQCDR FNQSQADYEV 
VCTSQGSYDA SLQNTIAAFR AGKQPTIAQV SDAGTLDIML SGAYYPANKL MTDMGYTVDW
KDYFSGISGY YATSKGEMYS FPFNSSTALL YWNKDAFAKI GKDHAPATWK EAGEDLKALK
DAGYACPLGF DISNNEVWQY IEQFEAVNGE AIATKKNGFE GLDAELVFNK NPLLVSYVKD
LKSWYDDKLV VIKNKAVGQT FVEAFAAGDC QVILTSVGDH GNVGRTAKQG MNWDVAMLPT
YGDAARHSSY VGGASLWVLQ GHSDAEYKAA AAFFNFIAKP EEALTWSTVT GYIPVRNSGF
EYLNKQGFYG KAPYAGRELA IQSLTASPAG DAAPQGIRLG GLLQVRTEIA NGLQAIFVNN
ADVQASLDSA AERGNTLLRR FQQTYKNVQL P