Gene Rleg2_4984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4984 
Symbol 
ID6978078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp628377 
End bp629663 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content57% 
IMG OID643394130 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002278948 
Protein GI209547030 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA AGCTTTTCGT CGGCGCAAGC GTGCTCGCTC TCTTGATGTC AGCGGCATCA 
ACCTGGGCGG CGGACAAGGA AATCACGGTG TGGTCCTGGT TCGTCCAGAG CACTATGCAA
AAGTCCATTG ATGCTTTCCA AAAAGCGCAT CCGGACGTCA AGGTGACGTA CACCTATTAC
AACTTCTCGC CGGAATACAT CACCGCGCTG AAAGCTGCGG CCGCATCTGG CAGCCTTCCC
GACGTGATTG GCTTGCAGCC CGGCTCACTT GCCCAGCAGT ACCGAGAACA GCTCGAACCG
ATCAATGACC GTGCCACCAA ACAGTGGGGC GCGGATTGGG AAAAGAACAT CTTCCCAGTC
AATCGCAAGC AGATGCAGAT GGGTAACCCA AAGGGCGACA CCAACTATTA CCTGATGCCG
CAGGAGTCAC AGGTTCTCTG CATTTGGTAT AATCGAAAGC TCTTCGAGGA GCTCGGCATT
GCGGTCCCGA AAACCTACGA CGATCTGAAG GCTGCATCTA AGAAGCTCAC TGAAGGCGGC
TTCATCCCAA TGTTCCAGGG TGCTGCCGAC GGCTGGCAGA ATGAGAATGT CTTCCTGATG
CTTGCCAACC AGTTCTCTCC GGGTATCGTC GATAAGGCGC AAGCAGGCGA AACGCCTTGG
ACAGCGCCGG AACTCGTAGA AGCGATGCAG GCTTGGAAGG GTCTGTTCGA TGACGGAGTG
TTCCAGCAGG GTGCTCTAGG TGCCCATGCG TATCCGACAG GCGCACAGCT GTTCCAGCAG
GGTAGGGTCG GCATGATGGC GCTCGGATCG TGGTGGATGC AGGAAAGCAA ATTCCCGCCA
CCGCTTTCGG AGTTCGTCCA TAACATGGAA GGTTTCGACT TCTTCTATAT GCCGCCAGTG
AAGGATGGCA ACAAAGCCAG CCCGCCAGTC GGTGGCATCG ATATTGGCTA CGGTCTCACC
AAAAACGACG CAAAGAACGA GGAGGCCTGG ACATTCCTCG CCGAACTCAC CAATGGCGTC
GGTCTTCAAG AAGCCCTTAA CGATCTCAAT GACCTTCCGG CATTTTCGGG ACACGAGCCC
AAGGGCGACA TTACCGACCA CGTCAAGGAA ATGTCCGCTC GCTTTATGGC CGACCTCCCC
AAAGCCGAAA ACCAGCGCTT CGCTTCGCCT GCCGTCGCCG AGGCGCTCGA CAATGCTCTG
GCCGGGGTCG CGGCGGGAAG CCTGGAACCC AAGGCGGCCC TGAAGTCCGT CGACGAAGCG
ACGCAGAAAG CGCTGGCCTC GAAGTAA
 
Protein sequence
MKKKLFVGAS VLALLMSAAS TWAADKEITV WSWFVQSTMQ KSIDAFQKAH PDVKVTYTYY 
NFSPEYITAL KAAAASGSLP DVIGLQPGSL AQQYREQLEP INDRATKQWG ADWEKNIFPV
NRKQMQMGNP KGDTNYYLMP QESQVLCIWY NRKLFEELGI AVPKTYDDLK AASKKLTEGG
FIPMFQGAAD GWQNENVFLM LANQFSPGIV DKAQAGETPW TAPELVEAMQ AWKGLFDDGV
FQQGALGAHA YPTGAQLFQQ GRVGMMALGS WWMQESKFPP PLSEFVHNME GFDFFYMPPV
KDGNKASPPV GGIDIGYGLT KNDAKNEEAW TFLAELTNGV GLQEALNDLN DLPAFSGHEP
KGDITDHVKE MSARFMADLP KAENQRFASP AVAEALDNAL AGVAAGSLEP KAALKSVDEA
TQKALASK