Gene Rleg_5068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5068 
Symbol 
ID8007661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp449589 
End bp450518 
Gene Length930 bp 
Protein Length309 aa 
Translation table11 
GC content61% 
IMG OID644821983 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_002973243 
Protein GI241113408 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.449495 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACGG CCACGCTTGC ATCCATCGCC TTGGCGATAT CGATCTCTGC TGCCCATGCC 
CAGACGATCG GGGTTTCGAT GTCCGGTCTC GACAAATTCA GGACGGCGCT TCTAAACGGC
GTTGTCTCGC ATGGGCAGAC GATATCCGGC CTCAAACTCG TCACCGAGAA TGCCAATGGC
GACAAGGAGC TCCAGAAGCA GCAGGTGCAG AAGCTCATTG CCGACAAGGT CGACGCGATC
ATCCTTGCCG TCTCCGATGG CGACCTCGGG CCGCAAATGA CCAAGATGGC GGCAGATGCC
GGCATTCCGC TGGTGTACAT CAACAACGTT CCTTCCAACC TGCTGGACCT GCCTGACAAT
CAGGTGGTGG TCGCCTCCAA CGAGAAGGAA TCCGGAACGC TGGAGACCAA GCAGGTCTGC
GCGCTCCTTA AAGGCAAAGG CCGCGTCGTC GTGCTGATGG GCGAACCATT CCACGCCGCC
GCGCGTGCCC GCACCCAGGA TATATCAGAC GTCATTGCCA CCCCGGATTG CAGGGGCCTT
CAGATCGTCG AGCGGCAGGC GGCCTATTGG TCGAGCGATT ATGCCGACCA GCAGATGCAG
GAATGGCTGT CGGCCGGCGT CAAGTTCGAC GCGGTCATCG CCAACAATGA CGAGATGGCG
CTCGGCGCGA TCCGGGCCAT GAAGAAGGCC GGCATACCGA TGAAAAATGT CGTCGTCGCC
GGCGTCGACG CGACCGACGA CGCGCTCGCA GCGATGGTCG CCGGCGATCT CGACGTGACC
ATTCTCCAGA GCGCCGTCGG GCAGGGCGCT GCCGCTGTCG ACGCTGCCGT CAAGCTGATC
CGCAAAGAGA AGGTGCCGCG CGAAAACAAC GTTCCCTTCG AACTCGTCAC ACCTGAGAAC
ATTGCCACCT ATCTGCCGAA GAGCCAGTGA
 
Protein sequence
MKTATLASIA LAISISAAHA QTIGVSMSGL DKFRTALLNG VVSHGQTISG LKLVTENANG 
DKELQKQQVQ KLIADKVDAI ILAVSDGDLG PQMTKMAADA GIPLVYINNV PSNLLDLPDN
QVVVASNEKE SGTLETKQVC ALLKGKGRVV VLMGEPFHAA ARARTQDISD VIATPDCRGL
QIVERQAAYW SSDYADQQMQ EWLSAGVKFD AVIANNDEMA LGAIRAMKKA GIPMKNVVVA
GVDATDDALA AMVAGDLDVT ILQSAVGQGA AAVDAAVKLI RKEKVPRENN VPFELVTPEN
IATYLPKSQ