Gene Rleg2_6459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6459 
Symbol 
ID6983530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011371 
Strand
Start bp123089 
End bp124102 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content62% 
IMG OID643399456 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_002284212 
Protein GI209552297 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0427942 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0971645 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCG AAAAATCCGG AAGCGGTCTT GGAAGACGAG ATCTGTTGAA ACTGTCCGCA 
GCCGCCGGGG TCGCCGTTGC CGGCGCTTCG CTGGTCGGGC AGAAGGCGGT TTTGGCAGCC
GACGAAGAAC TGTCGCTGAA AGGCAAGCGC ATCGCCATCA GCGCAACCGG CACCGACCAT
TTCTTCGACC TGCAGGCCTA TAATGCCCAG ATCGAAGAGG TAAAACGCCT CGGCGGCGAG
CCGATCGCCG TCGATGCCGG GCGCAATGAC GGCAAGCTGG TGTCACAGCT GCAGACGCTG
ATCGCCCAGA AGCCGGATGC AATCGTTCAA ATCCTCGGCA CGCTGAGCGT CATCGACCCC
TGGCTGAAGA AGGCGCGTGA CGCCGGCATT CCGGTTCTGA CCGTCGACGT CGGCTCGACC
AACTCGATCA ACAACACCAC CTCCGACAAC TGGGGCATCG GCAAGGACCT GGCGCTGCAG
CTCGTCTCCG ATATCGGCGG CGAAGGCAAT ATCGTCGTCT TCAACGGTTT CTATGGCGTC
ACCCCCTGCG CGATCCGCTA TGATCAGCTG GTCAATGTCG TCAAATATTT CCCGAAGGTG
AAAATCCTTC AGCCGGAACT GCGCGACGTC ATCCCGAACA CCGTGCAGGA TGCCTTCACG
CAGATCACCG CAATCCTCAA CAAATATCCC GAAAAAGGTT CGATCAAGGC GATCTGGTCG
GCCTGGGATA TTCCGCAGCT TGGCGCCACC CAGGCTTTGG CGGCGGCCGG CCGGACCGAG
ATCCGTACCT ACGGCGTCGA TGGCAGCCCC GAGGTTCTGC AGCTTGTCGC CGATCCGAAG
TCGCCGGCCG GTGCCGACGT CGCGCAGCAG CCGGCGGAAA TCGGCCGCAC CGCCATCCGC
AACGTCGCCA AGCTGCTCGC CGGCCAGACG CTGCCGCGCG AGACCTATGT TCCCGCCCTT
CTCGCCAACA AGGCCAATGT CGGCGAAGTC ACCAAGAAGC TCGGTATCGG CTGA
 
Protein sequence
MSIEKSGSGL GRRDLLKLSA AAGVAVAGAS LVGQKAVLAA DEELSLKGKR IAISATGTDH 
FFDLQAYNAQ IEEVKRLGGE PIAVDAGRND GKLVSQLQTL IAQKPDAIVQ ILGTLSVIDP
WLKKARDAGI PVLTVDVGST NSINNTTSDN WGIGKDLALQ LVSDIGGEGN IVVFNGFYGV
TPCAIRYDQL VNVVKYFPKV KILQPELRDV IPNTVQDAFT QITAILNKYP EKGSIKAIWS
AWDIPQLGAT QALAAAGRTE IRTYGVDGSP EVLQLVADPK SPAGADVAQQ PAEIGRTAIR
NVAKLLAGQT LPRETYVPAL LANKANVGEV TKKLGIG