Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6692 |
Symbol | |
ID | 8022602 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | - |
Start bp | 122235 |
End bp | 123137 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644833559 |
Product | transcriptional regulator, LysR family |
Protein accession | YP_002984693 |
Protein GI | 241666609 |
COG category | [K] Transcription |
COG ID | [COG0583] Transcriptional regulator |
TIGRFAM ID | [TIGR03418] putative choline sulfate-utilization transcription factor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.363849 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.247449 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGACC GCCGGCCTGA GCTGGGATGG ATGCGCATCT TTACGGAGGT CGCAAGGCTC GGCAGCTTTT CGGCCGCGGC GGCAGGTCTC GGTCTTACGC AGCCTGCTGT CAGTTATCAG ATTCGGCGGC TGGAAGAGCA GTTCGGCGTC GCCCTGCTGC GCCGCCAGCA CCGCGGCGTC GAGTTGACCG CGGAGGGAGA GCGGCTTTTC CAGGTCGCCG CCAAGACGGT CGGTGATATC GATGCCCTGG CGCGCAGTTT CCGCACTGAG GCTCAAAGGC CGGTGGTCAG GCTGAGAACC GACTATGCCT TTTCAGCGCT TTGGCTGATC CCGCGCATGC ACGGCTTTCG CCTGCTTCAC CCCGAAACGG ATATACAGAT CGTCGCGACC CAAAGGCTTG AACCCGGCTT TCGTGACGAC GCGGATGTGG TGGTGGTTTT CGGCACCAAA GCGGAATTCG GCGCCATCGG ATCGCTTCTG CTGCAGGAAA AGGTCGTGCC CGTCTGCACG CGAGGCTTTC TCGATCGCAA CGGTCCGTTC GACGAACCGC AGCAGCTTGC CAAGGCGATC CTGATTCATC TCGACTCGCC GATGCCATCG CCCTGGTTCG ATTGGCGAAG TTATCTTGCC GAATTCTCCG TTACTCGCGA TCTCCATGCC GGCCGCGGTG ATGTCAGTTT CAACACCTAC TCGCTGGTCG TCCAGGCCGC TCTCAGCGAA CAGGGCGTGG CGATCGGCTG GATGGGGCTG GTCGATACGC TTCTCTCCAC ACATATGCTG GTGGAAGCCG GGCCGCCGCT CGAGGCCTGG GACCGCGGTT ACTGGCTGAT ACCGCCGCGA TCGGCAAATG TTGATAGCGA GAGGCTCAGC ACCTGGCTGG TGGATGAAGT CGGCAGGACA TGA
|
Protein sequence | MPDRRPELGW MRIFTEVARL GSFSAAAAGL GLTQPAVSYQ IRRLEEQFGV ALLRRQHRGV ELTAEGERLF QVAAKTVGDI DALARSFRTE AQRPVVRLRT DYAFSALWLI PRMHGFRLLH PETDIQIVAT QRLEPGFRDD ADVVVVFGTK AEFGAIGSLL LQEKVVPVCT RGFLDRNGPF DEPQQLAKAI LIHLDSPMPS PWFDWRSYLA EFSVTRDLHA GRGDVSFNTY SLVVQAALSE QGVAIGWMGL VDTLLSTHML VEAGPPLEAW DRGYWLIPPR SANVDSERLS TWLVDEVGRT
|
| |