Gene Rleg2_4568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4568 
Symbol 
ID6977662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp203806 
End bp205038 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content60% 
IMG OID643393745 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002278563 
Protein GI209546645 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.267408 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACAA TGATGAAGCT TTTTCTTGCC GGTGTGGCAT TCGCCGGTTT CTTCGCTTCG 
GCGCATGCTG AGGACAAGCC GACAATCGAG ATCATGTCGT CCTGGACGTC GGGCGGCGAA
GCGGCAGCCC TCAATGTCAT CAGGACCGAG TTCGAAAAGC GCGGCGGCGT CTGGAAGGAT
TCCTCGATCG CCGGCTTCGG CGCCGCTGAT GCCGCCTTCC AGAACCGTAT CGTCGCCGGT
GACGCGCCGG GCGCCAAACA GGGCGTCATC GGTCTTGCGG CTGCGGATTT CGTCGGCCAG
GGACTGTTCA ATCCGATCGA CGATGTCGCT GCTGCCGGCA AATGGGCCGA CGTGCTGCCG
AAATCGATCC ATGATCTCAT CTCTTATGAC GGCAAGGTCT ATCTCGCGCC GACCGGCGCC
CATGGCGAGA GCTGGGTCTT CTATTCAAAG GAAGCCTTCG CCAAGGCCGG CATCGCCGAG
GAGCCCAAAA CCTGGGACGA GTTCTTCGCC GACTTCGACA AGCTGAAGGC TGCCGGTATC
GTTCCCGTTG CATGGGGTGG CCAGCCCTGG CAGCAGACCA AGGTCTTCAA CATGATCCTG
CTCTCGCAGG TCGGGATCGA CGGCTTTCTG AAGATCTATG TCGACAAGGA CAAGAGCCAG
GCCTCTGTCG AGGGCGTGAA GAAGACCCTC GAAATTCTCG GCAAGCTGCG CGGCTATATC
GATGCGGGGG CTGCCGGCCG CAACTGGAAC GACGCAACGG CGATGCTGAT CACCGCCAAG
GCCGGCGTGC AGTTCATGGG CGACTGGGCA AAGGGCGAAT TCACCGTTGC CGGCAAGGAA
CCGGGCAAGG ATTATGGCTG CATGATCGTG CCGGAGTCCA AGGGCATGGT CTACATCGCC
GATTCCCTCT GGTTCCCGAA GACCGGCAAT GCCGCGACCG ACAAGGCGCA GAAACTTCTC
GCCGAAGTCG TCATGGATCC CGCGGTCCAG GTCGAATTCG CTTTGAAGAA GGGCTCGGTT
CCGATGCGCA CCGATGTCGA CAAGTCGAAG CTTGATGTCT GCGCCCAGAA GGGTGTCGAG
TTGATGGCTT CCGGTGCGAT CGTCCCGGAT CAGGCAATCG TGCTGACACC CCAGCAGGTC
GGCGCGCTCG ACGATTTCGT CGACGAATAC TGGAGCGGTG GCTCGAACGA TACGGCATCT
GCGGCTGAGA ATTTCTTCGC CGTCTTCGAG TAA
 
Protein sequence
MKTMMKLFLA GVAFAGFFAS AHAEDKPTIE IMSSWTSGGE AAALNVIRTE FEKRGGVWKD 
SSIAGFGAAD AAFQNRIVAG DAPGAKQGVI GLAAADFVGQ GLFNPIDDVA AAGKWADVLP
KSIHDLISYD GKVYLAPTGA HGESWVFYSK EAFAKAGIAE EPKTWDEFFA DFDKLKAAGI
VPVAWGGQPW QQTKVFNMIL LSQVGIDGFL KIYVDKDKSQ ASVEGVKKTL EILGKLRGYI
DAGAAGRNWN DATAMLITAK AGVQFMGDWA KGEFTVAGKE PGKDYGCMIV PESKGMVYIA
DSLWFPKTGN AATDKAQKLL AEVVMDPAVQ VEFALKKGSV PMRTDVDKSK LDVCAQKGVE
LMASGAIVPD QAIVLTPQQV GALDDFVDEY WSGGSNDTAS AAENFFAVFE