Gene Rleg2_6543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6543 
Symbol 
ID6983613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011371 
Strand
Start bp216513 
End bp217814 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content61% 
IMG OID643399539 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_002284295 
Protein GI209552380 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1175] ABC-type sugar transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.903497 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.436322 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCGT CCAGAACGCT CGGACTGGTG ATGATCGCGC CTGCGGCGGT CATGATCGTC 
CTGTTCTTCC TGATGCCGGT GGTTCTGACG GCGGTCTTCT CGATGACCAA CATGACGACG
GCGACCGGCA TTTCCGGCGG CGTCTACCAG ATCGCGCCGA ACTCCCTGAT CGCGCTGAAA
TCGGCGCTGC CCGAAATCGC CACCGAGATG GCCGAACCGC GTTACACCAT CGACGAGGCG
GGCCTTAAGG CCGTCGAAGG CCTCGGGCTT GCGCCGGGCA TTGCCGCGGA ATTGCGCGCC
AAACATGCCG GTGAGGCCTT CCCGGCACGC CGCGACGTCG AGCGCATGAT CAAGGATCTC
GCCGAGCGAC CTTCGACACG TGAGGTCAAG CAGATCTCCG AACAGTTCAA CCGCTCCGTC
CTCAACACCC GCTTCGACAG CAAGGAGCAG CTGTTTTCGG CACTGGACGA TCTCGGTTTC
AAGCTGACGC CGGAGCAGAA GGAAACCGTC GCCAAGATAA CCTATACCGG CTGGACCTGG
ACGACCGACA ATTTCTCGCG CATGGCCAGT TCGCCCGACA TGGCGCGGGT GCTTTTCAAT
ACGGTGCTCT ATGTCGCGCT GGTGCTGACG CTCTTCAACG TCGGTTATGC CCTGCTGCTT
GCCATCTGGA CGCATTACAT GCCGCCGACG CCCGCCTCGA TCTTTCGCGG CATCTGGCTG
CTGCCGCGCA TCACGCCCGT CGTCATCTAC GTCCTGCTGT GGAAGTGGCT CGCCTGGGAC
AGCGGCTTCA TCTCTATCCT GATGGGCAAG TTCGGCTACC CCCCGAAGAA CTACCTGCTC
GATACCGACT ATAATGCCTG GTTCTTCGTC GTGCTGATCA ACGGCTTCAT CGGCGCTTCA
ATGGGCATGC TGGTGTTCTC CTCGGCGATG AAGGCCATTC CGAAGAGCCA GTTCTATGCA
AGCGAGGTCG ACGGCGCTTC GCGCTGGCAG CAGATCCGCT ACATCATCCT GCCGCAGATG
CGTTGGCCGA TCCTCTTCGT CACCTGCTAC CAGACGCTGT CGCTGCTCGC CTCCTTCAAT
GAGATACTGC TCGCCACCAA TGGCGGCCCG GGCAACGCCA CCGAGGTCTG GGCGCTCGCC
GCCTACCACA CCGCGCTCAG GAACTATGCC GGCAATCTCG AATACGGGCT GGGCGCCGCC
ATGGCGCTGG TGCTCGTTGT CATCGGCGTG GCGCTGTCAC TCGTCTATCT GCGCGTCTTC
AACTACGGCA CGCTTGTCGC CAAGCCCCTG ATCGAGGATT GA
 
Protein sequence
MKSSRTLGLV MIAPAAVMIV LFFLMPVVLT AVFSMTNMTT ATGISGGVYQ IAPNSLIALK 
SALPEIATEM AEPRYTIDEA GLKAVEGLGL APGIAAELRA KHAGEAFPAR RDVERMIKDL
AERPSTREVK QISEQFNRSV LNTRFDSKEQ LFSALDDLGF KLTPEQKETV AKITYTGWTW
TTDNFSRMAS SPDMARVLFN TVLYVALVLT LFNVGYALLL AIWTHYMPPT PASIFRGIWL
LPRITPVVIY VLLWKWLAWD SGFISILMGK FGYPPKNYLL DTDYNAWFFV VLINGFIGAS
MGMLVFSSAM KAIPKSQFYA SEVDGASRWQ QIRYIILPQM RWPILFVTCY QTLSLLASFN
EILLATNGGP GNATEVWALA AYHTALRNYA GNLEYGLGAA MALVLVVIGV ALSLVYLRVF
NYGTLVAKPL IED