Gene Rleg_4989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4989 
Symbol 
ID8007580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp372390 
End bp373700 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content58% 
IMG OID644821904 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002973164 
Protein GI241113329 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.192631 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAAAA CTATGACCGG TCTGTTGGCC GGTGTCGGAT TGATGTGGGC GTGCGGAACA 
TCCGCACAGG CCCAGGAACT GACCATCTTC TGGGCCGAGT GGGATCCGGC AAACTACCTT
CAGGAACTCG TCAACGAATA CGAGGCTCAA ACCGGCGTCA AGGTCACGGT CGAGACCACA
CCGTGGGCCG ACTTCCAGAC CAAGGCCTTC ACCGAGTTCA ACGCCAAGGG TTCAGCCTAT
GACATGGTCG TCGGCGACAG TCAGTGGATC GGGGCAGCGT CAGAAGCCGG CCATTACGTC
GATCTGACCG ACTTCTTCAC CAAGCACAAT CTGACCCAGG TGATGGCCCC GGCAACGGTG
AAATACTACG CCGAATATCC GTCGAACTCG AAAAAGTACT GGTCGGTTCC GGCCGAAGGC
GACGCCGTCG GCTGGTCCTA CCGCAAGGAC TGGTTCGAAG ACCCCAAGGA GATGGAGGCG
TTCAAGGCCA AATACGGCTA CGATCTCGCA CCGCCGAAGA CATGGGCCGA GATGCGTGAC
ATCGCCGAGT TCTTCCACCG TCCAGACCAG AAGCGATACG GAATCGCCAT CTACACCGAC
AACTCTTATG ACGGTCTCGT CATGGGTGTC GAGAACGCGA TCTTCTCGTT TGGAGGCGAA
CTCGGCGACT ACCAGAGCTA CAAGGTCGAC GGCATCATCA ATTCCGAGAA GAACGTCAAG
GCGCTCGAGC TTTATCGCGA GCTCTACGGC TTTACGCCAC CGGGCTGGGC CAAGTCCTTC
TTCGTCGAGA ACAACCAGGC GATCACTGAG AACCTGGCGG CGATGAGCAT GAACTACTTC
GCCTTCTTCC CGGCCCTGGT GAACGAGGCG TCCAACCCGA ACGCCAAGGT TACCGGCTTC
TTTGCCAATC CGGCGGGCCC GAACGGCGAG CAATTCGCAG CGCTCGGCGG CCAAGGCATA
TCGGTCATCT CCTACTCAAA AAACCAGGAA GAGGCGATGA AATTCCTCGA ATGGTTCATC
AAGGACGAGA CCCAGAAGCG CTGGGCCGAA CTCGGCGGCT ATACGGCAAG CGCCAAGGTG
CTTGAATCGC CGGAGTTTCA GAACGCGACA CCCTATAACA AGGCCTTCTA CGAGACGATG
TTCAAGGTGA AGGACTTCTG GGCAACGCCT GAATATGCCG AACTGCTGAT CCAGATGAAC
CAGCGCATTT ATCCCTTCGT CACTGCCGGC CAAGGCACGG CGAAGGAAGC GCTCGAATCC
CTGGCAGCGG ACTGGAACGC GACGTTCGCG AAATACGGAC GCCACAAGTA G
 
Protein sequence
MRKTMTGLLA GVGLMWACGT SAQAQELTIF WAEWDPANYL QELVNEYEAQ TGVKVTVETT 
PWADFQTKAF TEFNAKGSAY DMVVGDSQWI GAASEAGHYV DLTDFFTKHN LTQVMAPATV
KYYAEYPSNS KKYWSVPAEG DAVGWSYRKD WFEDPKEMEA FKAKYGYDLA PPKTWAEMRD
IAEFFHRPDQ KRYGIAIYTD NSYDGLVMGV ENAIFSFGGE LGDYQSYKVD GIINSEKNVK
ALELYRELYG FTPPGWAKSF FVENNQAITE NLAAMSMNYF AFFPALVNEA SNPNAKVTGF
FANPAGPNGE QFAALGGQGI SVISYSKNQE EAMKFLEWFI KDETQKRWAE LGGYTASAKV
LESPEFQNAT PYNKAFYETM FKVKDFWATP EYAELLIQMN QRIYPFVTAG QGTAKEALES
LAADWNATFA KYGRHK