Gene Rleg_4597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4597 
Symbol 
ID8015344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4721702 
End bp4722688 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content57% 
IMG OID644827174 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_002978374 
Protein GI241207278 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0491755 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAATT TTTTGAAGCG TGAATCCGTC TTGATCAATG CGCCGCGCCG TGCGGCATTG 
AAGCTTGGAC TTGCCGGCAC CTTGGTGCTT GCGCTGTCCT GTGGCGTGTC GCCCGCCTTT
GCGGCCGGCA AGCCGAAGGT CGGCCTGATC ATGAAATCCT TGTCCAATGA GTTCTTCAAG
CAGATGAAGG CCGGCGCCGA CAAATATGCG GCCGAAAACA AGGACAAGTT CGACTTCAAG
GCCGTCGGCA TGAAGGATGA GCGTGATTTC GCGTCCCAGG TCGATGCCGT CGAAAACTTC
GTCACGCAGA AATACGATAT CATCGTTGTT GCGCCGGCCG ATTCAAAGGC CATGGCAACC
CCGCTGGCAA AGGCAGTCAA GGCTGGCGTC AAGGTCATCA ACATCGATGT GCCGCTCGAT
GCCGATGCGA AGAAGAAAGC CGGTATCGAT CTCGCTTTCT TCGGGCCTGA CAATAAGGGC
GGGGCGACGC TTGCGGGCGA CGCGCTTGCC AAGGATCTGG GGCCTGGCGC CAAGGTCGTC
ATCCTCGAGG GTAACCCCGA GGCCGACAAT GCCAAGGAGC GCAAGGAAGG CTTCATGGAC
TCCGTCAAGT CTGGCAAGCT TGAGCTTCTC GACAGCAAGA CGGCCCATTG GGAGACGGAG
GAAGCCAATA CCGTCATGAC GAATTTCCTG ACGAAGTATA AGGATATTCA GGGCGTTATG
GCAGCCAACG ACTCGATGGC TCTCGGCGTT GTCAAGGCGC TCGATGCGGC CGGCCAGAGT
GGCAAGATCA AGGTTGTCGG CTTCGATAAC ATTCCGCCGG TTCAGCCGCT GATCAAGGAT
GGCAAGATGC TTGCCACTGT CGAACAGTAT GGCGCGCAGA TGGCGGTCCT CGGCATCCAG
TATGGCATGC GTGAACTTGC CGGCGAGAAA TTCACCGGCT GGGTCAAGAC CGACATCAAG
CTGGTCACCG CAGCGGATCT CAAATAG
 
Protein sequence
MSNFLKRESV LINAPRRAAL KLGLAGTLVL ALSCGVSPAF AAGKPKVGLI MKSLSNEFFK 
QMKAGADKYA AENKDKFDFK AVGMKDERDF ASQVDAVENF VTQKYDIIVV APADSKAMAT
PLAKAVKAGV KVINIDVPLD ADAKKKAGID LAFFGPDNKG GATLAGDALA KDLGPGAKVV
ILEGNPEADN AKERKEGFMD SVKSGKLELL DSKTAHWETE EANTVMTNFL TKYKDIQGVM
AANDSMALGV VKALDAAGQS GKIKVVGFDN IPPVQPLIKD GKMLATVEQY GAQMAVLGIQ
YGMRELAGEK FTGWVKTDIK LVTAADLK