Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4336 |
Symbol | |
ID | 8015115 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 4457721 |
End bp | 4458740 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644826912 |
Product | transcriptional regulator, LacI family |
Protein accession | YP_002978115 |
Protein GI | 241207019 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.563264 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.000141472 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACGACC AAAAGATCAG ACGGCCGCGT CAGGCCGATA TAGCTACATT GGCCGGCGTT TCCGTCTCCA CGGTGTCACG CGTGCTCGCC AACGAACCTG GTATCAGCGA AACGGTGCGC CGCCAGATAT TGAAGGTGGC GGCCGAGAAC GGCTATCCCG TCAAGCCTGC TTCCGAGGCC GTTGCGGGGG GGCTGGCACT GATTGCCAGT GACGGCGTCA CCGGCACTCT CAGCGTCTTT TATGAAGCGA TCGTCGACGG CCTGCGTGCC GGCGCTGCCG AAGCGGGCAT GCCTTTCGAA GTCCGGTTGG TCCGCGAGGA CCGAACCACC CCGGATGCCG TGCGTGACTA TATGCAGACG GCAGGCGCCG AAGGCCTCTT TCTCGTCGGC ATCGATCCGA ACGAGACGTT GCGCGACTGG CTGCAAACCA GCATGACACC CACGGTTCTT GTCAACGGCA CCGATCCGAG GATGCAGTTC GATGGCGTTT CGCCGGCTAA TTTCTTTGGT GCCTATGAGG CGACCAGCCG GCTGACAAAA GCCGGCCATC GCCGCATCCT GCATCTGAGC GGTTCTCACC GCCATACGAT CCGGGAGCGC GTGCGCGGTT TCGAGGCGGC GATCGCCGCC GTCCCCGGCG CTGAGGGCCG TCTCCTGTCC CTGGCCCTTC AAGGCAGCGC CAGCCGAGAG GCGCATGAAC GCACGGTAGC AGCACTTGCC GAGGATGCCG GTTTTACCGC CGCCTTCTGC ATGAATGATT TCATCGCCGT CGGCGTGCTC GAAGCCGTCA CCGAGGCCGG CCTGCGTGTG CCGGAGGATT TCGCGATTGT CGGCTTCGAC GATCTGCCCT GCGCGCAAAT GACCAATCCG CAACTTTCCA CCATGCGTGT CGACCGCGCT GCCCTCGGGC GCGAGGCCGT TTCGCTGATG CTGTCCCGTT TCCGCAACAG GACGGCCTCT GCGCGCCACA TCTGCCAGGC GGTCGTTCCC ATTCCGGGAG GGACCGTTCC GAACGCCTAG
|
Protein sequence | MNDQKIRRPR QADIATLAGV SVSTVSRVLA NEPGISETVR RQILKVAAEN GYPVKPASEA VAGGLALIAS DGVTGTLSVF YEAIVDGLRA GAAEAGMPFE VRLVREDRTT PDAVRDYMQT AGAEGLFLVG IDPNETLRDW LQTSMTPTVL VNGTDPRMQF DGVSPANFFG AYEATSRLTK AGHRRILHLS GSHRHTIRER VRGFEAAIAA VPGAEGRLLS LALQGSASRE AHERTVAALA EDAGFTAAFC MNDFIAVGVL EAVTEAGLRV PEDFAIVGFD DLPCAQMTNP QLSTMRVDRA ALGREAVSLM LSRFRNRTAS ARHICQAVVP IPGGTVPNA
|
| |