Gene Rleg2_1720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1720 
Symbol 
ID6980457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1755875 
End bp1756837 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content59% 
IMG OID643396443 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_002281233 
Protein GI209549316 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.000157019 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTGA AGACTGCACT TTTGAGTGCC ACGATTCTTG CTGCCTGCAT GTTCGGTTCG 
GCTTCGGCCG CCGGATTGAC CGTCGGCTTC TCGCAGATCG GCTCGGAATC GGGCTGGCGC
GCGGCTGAAA CGACTGTAAC CAAGGAACAG GCCAAGAAGC GCGGCATCGA TCTGAAGTTT
GCCGATGCAC AGCAGAAGCA GGAAAATCAG ATCAAGGCTC TGCGCTCCTT TATTGCTCAG
GGCGTCGATG CCATTCTCAT TGCTCCGGTC GTCGAAACCG GCTGGGACGA CGTTCTGAAA
GAAGCCAAGG AAGCCAAGAT TCCGGTCATC CTTCTCGACC GCACCATCAA GGCTCCTGAT
GATCTCTACC TGACGGCTGT CACCTCGGAC CTCGTCCACG AAGGCAAGGT CGCCGGTGAC
TTCCTGGTCA AGACCGTCGG CGACAAGAAG TGCAACGTCG TCGAGCTGCA GGGCACGACC
GGTTCGTCGC CGGCCATCGC CCGCAAGAAG GGCTTCGAAG AAGCTCTCGC CGGCCACGAC
AACCTCAAGA TCGTTCGCAG CCAGACCGGT GACTTCACCC GCACCAAGGG CAAGGAAGTC
ATGGAAAGCT TCCTGAAGGC CGAGAATGGC GGCAAGGATA TCTGCGCTCT CTACGCCCAT
AATGACGATA TGGCCGTCGG CGCCATCCAG GCGATCAAGG AAGCCGGCCT GAAGCCCGGC
AAGGATATCC TCGTCGTCTC GATCGACGCC GTTCCGGATA TCTTCAAGGC TATGTCCGAA
GGCGAGTCCA ATGCCACGGT CGAGCTGACG CCTAACATGG CAGGTCCGGC CTTCGATGCG
CTCGACGCCT ACCTGAAGGA CAAGAAGGCT CCGCCGAAGT GGATCCAGAC CGAATCCAAG
CTCTATACGC CTGCCGACGA GCCGATGAAG GTCTACGAAG AGAAGAAGGG TCAGGGCTAC
TGA
 
Protein sequence
MKLKTALLSA TILAACMFGS ASAAGLTVGF SQIGSESGWR AAETTVTKEQ AKKRGIDLKF 
ADAQQKQENQ IKALRSFIAQ GVDAILIAPV VETGWDDVLK EAKEAKIPVI LLDRTIKAPD
DLYLTAVTSD LVHEGKVAGD FLVKTVGDKK CNVVELQGTT GSSPAIARKK GFEEALAGHD
NLKIVRSQTG DFTRTKGKEV MESFLKAENG GKDICALYAH NDDMAVGAIQ AIKEAGLKPG
KDILVVSIDA VPDIFKAMSE GESNATVELT PNMAGPAFDA LDAYLKDKKA PPKWIQTESK
LYTPADEPMK VYEEKKGQGY