Gene Rleg_6794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6794 
Symbol 
ID8022724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp235416 
End bp236708 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content59% 
IMG OID644833661 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002984795 
Protein GI241666711 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAAAA ATGGCATAGG CCGGCAGTTG ATTTTCGCCG CGACGACGGC GGCGTCGATG 
ATTTTTGCGG TCGCTCCATC GGCGGCGCAG GACAAGGTCC AGCTGACGTT TCGTCAATTC
GATCCCCCGA CCGAAATTCA AGGCCTGATT GCCGCCGTCG AGGCCTGGAA CTCTAGTCAT
CCCGATGTCC AGGTCAAATT GGAGACGATG TCGGGCGGCG ACACGCTTGC CCAGTTGGCG
CGCGAAATTC CCGCCGGTGC GGGACCGGAT GTTCAGCAGC TGGCTTTCGT GTGGACGCGT
GATCTCGCGC GATCAAAGCT GCTCCTCGAT CTTAGCCCTC TCATCCAATC GAATGCGCCG
GGAGCCGGCA CCGACGATTT TCTGGCTCTC GACCTCGCCA CCCTCGATGG CAAAATCTTC
GGCCTGCCAT GGACGGCGGA TACATTCTCG ATGGCCTATC GCCCGGATCT CCTGCAGGCA
GCAGGTGTCT CGAACTTTCC CGATAGCTGG GACGATCTTG CGGCAGCTGC CAAGAAGTTG
ACCACCGAAG GTGGTGGGAC CGAACAATAC GGTTTCTGCT TCCCCGCAGG CAGCGCACCC
GACAGCGGCA TGTGGACGCT GGTAAATTAT TACCTCTGGA GCAACGGCTC GACCTTGGTC
ACGGAGGAAA GCCCGGGGAA ATGGAAGGTG GCGGTGACGC CGGAGCAACT GGCGGCGGCG
ATGAACTATT TCAACCAGTT TTTCGTGGAC GGAACGGCGC CCGAAAACCT CATCACCGTG
AATGCCTGGG GCGATCCCGA GCTGATCGGC GGGCTCGGCC GAGGCGACTG CGCGATCACT
TTCTTTCCGC CGCAAACCTT CAGGGCCGCC GAGAAACAAT CTGAAAAGCC GCTGCTGACC
GCACCGATCC CGAAAGGGAC GGAAAAGCGC ATCTCCCATC TTGGCGGACG TGCGCTGGGC
ATCAATCCCA ACACCAAGCA TCAGAAGGAG GCCTGGGAGT TCGTCAAATA TCTGGTCGGC
CCCGAGACCT TCAAAACCTA CAACCAATAT CCCTCGCAGA AGTCGCTGCT TTCACAGCTC
CAGTTTCCGC CGGCCGAACA GGGCTATGTA ACGATGCTTC CGTTGGCGCA GACCTTCGAG
CGCTACATCT CGTCCCCGAT CAAGGTGTCG AGCATGACAG CGCTCATCAA TCGTGAGTTC
GGCGCCGTAT TCTCCGGGCA GCGCAATCCT GATGAGGCTG CGGATGTCAT CATCAAGGAG
CTCAACGACT TGCTTGCCCG CGGCAAGGGC TGA
 
Protein sequence
MFKNGIGRQL IFAATTAASM IFAVAPSAAQ DKVQLTFRQF DPPTEIQGLI AAVEAWNSSH 
PDVQVKLETM SGGDTLAQLA REIPAGAGPD VQQLAFVWTR DLARSKLLLD LSPLIQSNAP
GAGTDDFLAL DLATLDGKIF GLPWTADTFS MAYRPDLLQA AGVSNFPDSW DDLAAAAKKL
TTEGGGTEQY GFCFPAGSAP DSGMWTLVNY YLWSNGSTLV TEESPGKWKV AVTPEQLAAA
MNYFNQFFVD GTAPENLITV NAWGDPELIG GLGRGDCAIT FFPPQTFRAA EKQSEKPLLT
APIPKGTEKR ISHLGGRALG INPNTKHQKE AWEFVKYLVG PETFKTYNQY PSQKSLLSQL
QFPPAEQGYV TMLPLAQTFE RYISSPIKVS SMTALINREF GAVFSGQRNP DEAADVIIKE
LNDLLARGKG