Gene Rleg_1904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1904 
Symbol 
ID8012952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1890037 
End bp1890999 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content59% 
IMG OID644824493 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_002975725 
Protein GI241204629 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.591469 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.698974 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTGA AGACTGCACT TTTGAGTGCC ACGATTCTGG CTGCCTGCAT GTTCGGTTCG 
GCTTCGGCCG CCGGCCTCAC CGTCGGCTTC TCGCAGATCG GCTCGGAATC GGGCTGGCGT
GCGGCTGAAA CGACAGTGAC CAAGGAGCAG GCCAAGAAGC GCGGCATCGA TCTGAAGTTT
GCCGATGCGC AGCAGAAGCA GGAAAACCAG ATCAAGGCTT TGCGCTCCTT CATTGCTCAG
GGCGTCGATG CCATTCTCAT TGCTCCGGTC GTTGAAACCG GCTGGGACGA CGTTCTCAAG
GAAGCCAAGG AAGCCAAGAT TCCGGTCATC CTTCTCGACC GTACCATCAA GGCTCCGGAC
GATCTCTATC TGACGGCTGT TACGTCAGAC CTGGTCCATG AAGGCAAGGT CGCAGGCGAC
TTCCTGGTCA AGACCGTCGG CGACAAGAAG TGCAATGTGG TTGAACTTCA GGGCACCACC
GGTTCGTCGC CGGCCATCGC CCGCAAGAAG GGCTTCGAAG AGGCCCTTGC CGGCCACGAC
AACCTCAAGA TCGTTCGCAG CCAGACCGGT GACTTCACCC GCACCAAGGG CAAGGAAGTC
ATGGAAAGCT TCCTGAAGGC CGAGAATGGC GGCAAGGATA TCTGCGCTCT CTACGCCCAT
AACGACGACA TGGCCGTCGG CGCCATCCAG GCGATCAAGG AAGCCGGTCT GAAGCCTGGC
AAGGATATCC TCGTCGTCTC CATCGACGCC GTACCGGATA TCTTCAAGGC GATGTCCGAA
GGCGAGGCCA ACGCCACGGT CGAGCTGACG CCGAATATGG CAGGTCCGGC CTTCGATGCG
CTGGAAGCCT ATCTGAAGGA CAAGAAGGCT CCGGCCAAGT GGATCCAGAC CGAGTCCAAG
CTCTACACGC CTGCCGACGA GCCGATGAAG GTCTATGAAG AGAAAAAGGG TCAGGGCTAC
TGA
 
Protein sequence
MKLKTALLSA TILAACMFGS ASAAGLTVGF SQIGSESGWR AAETTVTKEQ AKKRGIDLKF 
ADAQQKQENQ IKALRSFIAQ GVDAILIAPV VETGWDDVLK EAKEAKIPVI LLDRTIKAPD
DLYLTAVTSD LVHEGKVAGD FLVKTVGDKK CNVVELQGTT GSSPAIARKK GFEEALAGHD
NLKIVRSQTG DFTRTKGKEV MESFLKAENG GKDICALYAH NDDMAVGAIQ AIKEAGLKPG
KDILVVSIDA VPDIFKAMSE GEANATVELT PNMAGPAFDA LEAYLKDKKA PAKWIQTESK
LYTPADEPMK VYEEKKGQGY