Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2652 |
Symbol | |
ID | 8013608 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 2642942 |
End bp | 2644216 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644825226 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002976456 |
Protein GI | 241205360 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.142803 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATTTT TGAAGCAATT TCCGTCGACC ACCCGCAGAC GCTTCCTGAA AGGCGCGGGG CTGGTTTCCG CCGCAGCGGT CACCGGCAGC TTTCCGATCC CGGCGATCGC GCAGGCCCAG GAGGTCACGA TGATCTCCGC CGAAAACAAT GGTGCCGCAC TCGATGCGCT GAAGGCGATT GCCGCGGGCT TCAGCAAGGA AGCCGGCGTC AACGTCGTCA TCAACAATAT GGACCACGAG GCCCACAAGA CGGCCATTCG CAACTATCTC GTCGCCGGCG CGCCCGACGT CTGCTCCTGG TTTTCGGGCA ACCGCATGCG CGCCTTCGTC AAGCGCGGCC TCTTCGACGA TATCTCCGAC CTTTTCGAGA AAGAGAAATA TAAGGACGTG CTCGGCGCGA CGGCAGGGGC CGTCACCGAA GACGGCAAGC AGTACGGCCT CCCCACCGGC GGCACGCTCT GGGGCATGTT CTATCGCAAG GATGTGTTCG AGCAGCATGG CCTGACCGTG CCGAAGACGG CCGAAGAGTT CATGGCCTAT GGCGACAAGT GCAAGGCCGC GGGCATCACG CCGGTCGCGA TCGGCACCAA GGAATTATGG CCGGCGGCCG GCTGGTTCGA CCAGATGAAT CTTCGCATCA ACGGCCTCGA CAAGCATATG GCGCTGATGA ACGGCGAGAT GAGCTATCTC GATCCCTCGC TGAAAGCGGT TTTCGACCAG TGGGAGGCGA TGATTTCGAA GGGGTTCTTC ACGGAGAACC ATACGTCCTT CGGCTGGCAG GAGGCTGCAG CGCTGCTGGC GCAGAAGAAA GCCGGCATGA TGAACCTCGG CGCTTTCCTG CGCTCGGCTT TCACCGCCGA AGATCTGCCA CAGCTCTCTT ACGCGACCTT CCCGGTGCTA GACGCCAAGG TCGGCCACTA CGAGGAATTC TCGGTCAACT CGATCCACAT TCCCGCCAAG GCCAAGAACA AGCAGGGCGC GCGCGAATTC CTCGCCTATT TCTACAAGCC GGAGAATCTG GCGGCCTATC TGGAGCCGGG CGGCAATGTG CCGCCGCGCC ACGACCTGCC GCCAAGCAAG GACCCGCTGG TCAACGTCGC TGTCGAGACG ATGAAGACGG TGCAGGGCAC GTCGCAATAT TACGACCGCG ACAGCGATCC CGATATGGCG CAGGCCGGCC TCGTCGGCTT CCAGGAGTTC ATGGCCAAAC CCGACCGGCG CAAGGCCATC CTGACGCGCC TCGAGGGCAC GCGCAAGCGC ATCTACAAGA TCTAG
|
Protein sequence | MTFLKQFPST TRRRFLKGAG LVSAAAVTGS FPIPAIAQAQ EVTMISAENN GAALDALKAI AAGFSKEAGV NVVINNMDHE AHKTAIRNYL VAGAPDVCSW FSGNRMRAFV KRGLFDDISD LFEKEKYKDV LGATAGAVTE DGKQYGLPTG GTLWGMFYRK DVFEQHGLTV PKTAEEFMAY GDKCKAAGIT PVAIGTKELW PAAGWFDQMN LRINGLDKHM ALMNGEMSYL DPSLKAVFDQ WEAMISKGFF TENHTSFGWQ EAAALLAQKK AGMMNLGAFL RSAFTAEDLP QLSYATFPVL DAKVGHYEEF SVNSIHIPAK AKNKQGAREF LAYFYKPENL AAYLEPGGNV PPRHDLPPSK DPLVNVAVET MKTVQGTSQY YDRDSDPDMA QAGLVGFQEF MAKPDRRKAI LTRLEGTRKR IYKI
|
| |