Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5789 |
Symbol | |
ID | 6977178 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | - |
Start bp | 198592 |
End bp | 199497 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643393244 |
Product | transcriptional regulator, LysR family |
Protein accession | YP_002278062 |
Protein GI | 209546172 |
COG category | [K] Transcription |
COG ID | [COG0583] Transcriptional regulator |
TIGRFAM ID | [TIGR03418] putative choline sulfate-utilization transcription factor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.04167 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGACC ACTCGCTTGA GCTTGGATGG ATGCGCATTT TCGTCGAGGT CGCGAGACTC GGGAGCTTTT CGTCGGCCGC AGCGCTGCTC GGGCTTACCC AGCCGGCCGT CAGCTACCAG ATCCGCCGGC TGGAGGAGCA GTTCGGCGTC AGCCTGCTGC GCCGCCAGCA TCGCGGCGTG ACGCTGACTG CCGAAGGCGA GCGGCTTCTC GACGTCACCG CCAAAGCGGT CGGCGACATC GATGCGCTCG CCCGCAGCTT CCGTGCCGAA GCCCAGAGAC CGGTCGTTCG GCTCAGAACA GACTATGCCT TTTCATCGCT TTGGCTGATC CCGCGCATGG ACGGCTTTCG GCTTCTTCAT CCGGAAACGG ACATCCAGAT CGTCGCGACG CAGCGGTTTG CAGCCGGTTT TCGCGACGAA GCGGATGTCG CGGTCGTTTT CGGCACGCGG GCGGAATTCG GCGCCGCCGG TACGCTTCTG CTGCAGGAAA AAGTCGTGCC GGTCTGCACG CAGGGATTTC TCGATCGCAA CGGTCCGTTC GAGGATGCGA GGCAGCTTGC CAAGGCGGTG CTGATCCATC TCGACACGCC GATGCCATCG CCCTGGTTCG ACTGGCGCAG TTATCTGACG GAATTTTCGG TTGTCCGCGA CATCAATGCC GGCCGCGGCG ATATCAGCTT CAATACCTAC TCGCTGGTCA TCCAGGCTGC GTTGAGCGGG CAGGGCGTGG CGATCGGCTG GATGGGGCTC GTCGATACGC TGCTCCAAGC GGGCATGCTG GTCGAAGCCG GCCCGCCGCT TGAGGCGCAG GATCGCGGTT ACTGGCTGGT GCCACCGCGA TCTCCCAGCC CGCACAGCGA AAGCCTCGGT GCCTGGCTAG TGGGCGAAGT CGGAGGGAGC GGCTGA
|
Protein sequence | MPDHSLELGW MRIFVEVARL GSFSSAAALL GLTQPAVSYQ IRRLEEQFGV SLLRRQHRGV TLTAEGERLL DVTAKAVGDI DALARSFRAE AQRPVVRLRT DYAFSSLWLI PRMDGFRLLH PETDIQIVAT QRFAAGFRDE ADVAVVFGTR AEFGAAGTLL LQEKVVPVCT QGFLDRNGPF EDARQLAKAV LIHLDTPMPS PWFDWRSYLT EFSVVRDINA GRGDISFNTY SLVIQAALSG QGVAIGWMGL VDTLLQAGML VEAGPPLEAQ DRGYWLVPPR SPSPHSESLG AWLVGEVGGS G
|
| |