Gene Rleg_4707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4707 
Symbol 
ID8007182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp75310 
End bp76569 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content57% 
IMG OID644821640 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002972900 
Protein GI241113065 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAATT TGCTTGGTGC CAGCGCACTT GCGCTTGTTT TCATAACCGC CACTGCCAAT 
GCCGAAACCA TCAACATTCT GGTGGAGGGC GGCGGCGAAA TGTTGCAGAA GGCGGTCGCC
GAGAAGTTTA CTGCCGAAAC AGGCATCAAA GTGAACTTCA CGACCGTTCC CTATCAGGGT
GTCTTTGACA AGTTCTCAGC CGAAATCGCC TCAGGCTCTT CCGCGTTCGA TGTCGTGACG
ATCGACGTCG TGTGGAATGC GAAGTTTGCA AGCCACGTCG AGGATCTCTC GGCTCTTTTC
ACCGATGCGG TTCGCGCGGA TCTGCCGCCT GTCTTGCTTG CCGACGCGAA GGTTGGTGAC
AAACTGATCG GCATGCCTGC CTGGGCCAAT GCCGAGATCG TGTTCTATCG CAAGGACCTG
TTTGATAAGG CTGAGGAAAA GGAGGCTTTT CAGGCCAAGT ATGGCTATCC TCTCGCGCCG
CCCAAAACCT GGCAGCAGTG GCGCGACATT GCGAAATTCT TCACGCGTGA CACTGACGGC
GATGGAAAGA CCGACTTCTG GGGCACCGAC ACCATCGGCA CGTTTTCAGA GGAATGGATG
GCGCATGTGC TGCAAGCGGG TTCGCCAGGG GTGATCCTCG ATAAGGACGG GCAGGTCATC
ATCGACAACG AGGCGCACAA AAAGGCACTG GAATTCTACA TCGCGCCACA CTGCATTGAT
CATTCCGTTC CTGAAAACGT GAACGAAATC GGCTGGGGCG AGGCGCAGAA CCTGTTCTAT
CAGGGCAAAA CAGCGATGAT GAAGTTCTGG GCGCACGCCT ACAAGATGAC GCCTCCGGAT
TCAAAGGTCA GCGGCAAGGT CGGCGTGGTG CCGATGCTGG CCGGCGACGC CGGGATCGCA
GCTGTTCCTG GCCCTTGGTA CAACGTCGTT CCCTCGACAT CCGAGCACAA GGATGCAGCG
AAAAAATTCA TCTCGTTTGC CATCGCCAAT AATGCCCTGG GTATCGAAGC TCCGCTCGGC
CTTGCCGCGA CGAATTCCGC CTATCGCAGC TATTCAGGCA AGGCCGGCTA TGAGCACTTC
CCCCCACTTC TTGAGACGCT GAGCGCGCCT GCCACCCAGG GCCGGCCGAT CAATGAAAAA
TATCAGGAAA TCGTCGATGA AGCTGTGCTG CCGGCTATCC AGCAGGCACT CACCTGCAAG
GCGGATATCG GGGAGGTCCT GACGGAAGCC AAGGAAACGA TCGAGGACAT TCTCAACTAG
 
Protein sequence
MKNLLGASAL ALVFITATAN AETINILVEG GGEMLQKAVA EKFTAETGIK VNFTTVPYQG 
VFDKFSAEIA SGSSAFDVVT IDVVWNAKFA SHVEDLSALF TDAVRADLPP VLLADAKVGD
KLIGMPAWAN AEIVFYRKDL FDKAEEKEAF QAKYGYPLAP PKTWQQWRDI AKFFTRDTDG
DGKTDFWGTD TIGTFSEEWM AHVLQAGSPG VILDKDGQVI IDNEAHKKAL EFYIAPHCID
HSVPENVNEI GWGEAQNLFY QGKTAMMKFW AHAYKMTPPD SKVSGKVGVV PMLAGDAGIA
AVPGPWYNVV PSTSEHKDAA KKFISFAIAN NALGIEAPLG LAATNSAYRS YSGKAGYEHF
PPLLETLSAP ATQGRPINEK YQEIVDEAVL PAIQQALTCK ADIGEVLTEA KETIEDILN