Gene Rleg_4336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4336 
Symbol 
ID8015115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4457721 
End bp4458740 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content64% 
IMG OID644826912 
Producttranscriptional regulator, LacI family 
Protein accessionYP_002978115 
Protein GI241207019 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.563264 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.000141472 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACGACC AAAAGATCAG ACGGCCGCGT CAGGCCGATA TAGCTACATT GGCCGGCGTT 
TCCGTCTCCA CGGTGTCACG CGTGCTCGCC AACGAACCTG GTATCAGCGA AACGGTGCGC
CGCCAGATAT TGAAGGTGGC GGCCGAGAAC GGCTATCCCG TCAAGCCTGC TTCCGAGGCC
GTTGCGGGGG GGCTGGCACT GATTGCCAGT GACGGCGTCA CCGGCACTCT CAGCGTCTTT
TATGAAGCGA TCGTCGACGG CCTGCGTGCC GGCGCTGCCG AAGCGGGCAT GCCTTTCGAA
GTCCGGTTGG TCCGCGAGGA CCGAACCACC CCGGATGCCG TGCGTGACTA TATGCAGACG
GCAGGCGCCG AAGGCCTCTT TCTCGTCGGC ATCGATCCGA ACGAGACGTT GCGCGACTGG
CTGCAAACCA GCATGACACC CACGGTTCTT GTCAACGGCA CCGATCCGAG GATGCAGTTC
GATGGCGTTT CGCCGGCTAA TTTCTTTGGT GCCTATGAGG CGACCAGCCG GCTGACAAAA
GCCGGCCATC GCCGCATCCT GCATCTGAGC GGTTCTCACC GCCATACGAT CCGGGAGCGC
GTGCGCGGTT TCGAGGCGGC GATCGCCGCC GTCCCCGGCG CTGAGGGCCG TCTCCTGTCC
CTGGCCCTTC AAGGCAGCGC CAGCCGAGAG GCGCATGAAC GCACGGTAGC AGCACTTGCC
GAGGATGCCG GTTTTACCGC CGCCTTCTGC ATGAATGATT TCATCGCCGT CGGCGTGCTC
GAAGCCGTCA CCGAGGCCGG CCTGCGTGTG CCGGAGGATT TCGCGATTGT CGGCTTCGAC
GATCTGCCCT GCGCGCAAAT GACCAATCCG CAACTTTCCA CCATGCGTGT CGACCGCGCT
GCCCTCGGGC GCGAGGCCGT TTCGCTGATG CTGTCCCGTT TCCGCAACAG GACGGCCTCT
GCGCGCCACA TCTGCCAGGC GGTCGTTCCC ATTCCGGGAG GGACCGTTCC GAACGCCTAG
 
Protein sequence
MNDQKIRRPR QADIATLAGV SVSTVSRVLA NEPGISETVR RQILKVAAEN GYPVKPASEA 
VAGGLALIAS DGVTGTLSVF YEAIVDGLRA GAAEAGMPFE VRLVREDRTT PDAVRDYMQT
AGAEGLFLVG IDPNETLRDW LQTSMTPTVL VNGTDPRMQF DGVSPANFFG AYEATSRLTK
AGHRRILHLS GSHRHTIRER VRGFEAAIAA VPGAEGRLLS LALQGSASRE AHERTVAALA
EDAGFTAAFC MNDFIAVGVL EAVTEAGLRV PEDFAIVGFD DLPCAQMTNP QLSTMRVDRA
ALGREAVSLM LSRFRNRTAS ARHICQAVVP IPGGTVPNA