Gene Rleg_6131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6131 
Symbol 
ID8016088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012852 
Strand
Start bp174898 
End bp176199 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content59% 
IMG OID644827437 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_002978637 
Protein GI241258753 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1175] ABC-type sugar transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.336729 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.227822 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCGT CCAGAACGCT CGGACTGGTG ATGATCGCGC CTGCGGCGAT CATGATCGTT 
CTTTTCTTCC TGATGCCGGT CGTTCTGACG GCGGTCTTTT CGATGACCAG CATGACGACG
GCGACCGGTA TTTCCGGCGG CGTCTATCAG ATCGCACCCA ACTCCATGAT TGCGCTAAAA
TCGGCAATAC CGGACATTGC CGCCGAGATG GCCGAACCGC GCTACACGAT CGACGAGGCG
GGCCTCAAGG CCGTCGAAGG ACTCGGGCTT GCGCCGGGGA TTGCTGGGGA ATTGCGCGCC
AAACATGCAG GTGAGGTGTT CACGGCACGC CGCGACGTCG AGCGCATGCT CAAGGATCTC
GCCGACCGGC CTTCGACGCG CGACGTCAAG CAGATTTCCG AACAGTTCAA CCGCTCCGTC
CTCAACACCC GCTTCGACAG CAAGGAGCAG CTCTTTTCGG CGCTGGATAG TCTGGGTTTC
AAACTGACAC CGGAGCAGAA GGAAACGGTC GCCAAGGCCA CCTATACCGG CTGGGTTTGG
ACGACCGACA ATTTCTCGCG CATGACCACC TCACCCGATA TGGCGCGTGT ACTCTTGAAT
ACCGTGCTCT ACGTCGCGCT GGTGCTGATG CTGTTCAATG TCGGCTATGC GCTGCTACTT
GCCATTTGGA CGCATTACAT GCCGCCGACG CCGGCCTCGA TCTTCCGCGG CATCTGGCTC
CTGCCGCGCA TTACCCCTGT CGTCATCTAT GTCATGCTAT GGAAGTGGCT TGCCTGGGAT
ACCGGCTTCA TTTCGATCCT GATGGGCAAG TTCGGCTATC CGCCAAAGAA CTACCTTCTC
GACAACGCTT ACAACGCCTG GTTCTTCGTC GTGTTGATCA ACGGCTTCAT CGGCGCCTCG
ATGGGCATGC TGGTGTTCTC CTCGGCTATG AAGGCCATTC CGAAGAGCCA GTTCTATGCG
AGCGAGGTCG ACGGCGCCTC GCGCTGGCAG CAGATTCGCT ACATCATTCT GCCGCAGATG
CGCTGGCCAA TCCTCTTTGT TACCTGCTAC CAGACCTTGT CGCTGCTTGC CTCCTTCAAT
GAAATCCTGC TCGCCACCAA TGGCGGACCG GGCAATGCGA CCGAGGTCTG GGCGCTCTCG
GCCTATCACA CTGCGCTGAG GAACTATGCC GGCAACCTCG AATACGGGTT GGGTGCCGCC
ATGGCCTTGG TGCTCGTCGT CATCGGCGTG ACGCTGTCGC TCCTCTATCT GCGCGTCTTC
AACTACGGCA CGCTTGTCGC CAAGCCCTTG ATCGAGGATT GA
 
Protein sequence
MKSSRTLGLV MIAPAAIMIV LFFLMPVVLT AVFSMTSMTT ATGISGGVYQ IAPNSMIALK 
SAIPDIAAEM AEPRYTIDEA GLKAVEGLGL APGIAGELRA KHAGEVFTAR RDVERMLKDL
ADRPSTRDVK QISEQFNRSV LNTRFDSKEQ LFSALDSLGF KLTPEQKETV AKATYTGWVW
TTDNFSRMTT SPDMARVLLN TVLYVALVLM LFNVGYALLL AIWTHYMPPT PASIFRGIWL
LPRITPVVIY VMLWKWLAWD TGFISILMGK FGYPPKNYLL DNAYNAWFFV VLINGFIGAS
MGMLVFSSAM KAIPKSQFYA SEVDGASRWQ QIRYIILPQM RWPILFVTCY QTLSLLASFN
EILLATNGGP GNATEVWALS AYHTALRNYA GNLEYGLGAA MALVLVVIGV TLSLLYLRVF
NYGTLVAKPL IED