Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2327 |
Symbol | |
ID | 6981066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 2386807 |
End bp | 2388081 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643397040 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002281828 |
Protein GI | 209549911 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.169538 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATTTT TGAAGCAATT TCCGTCAACC ACCCGCAGAC GCTTCCTGAA GGGCGCGGGC CTGGTGTCCG CCGCGGCCGT CACCGGCAGC TTCCCGATCC CGGCGATCGC GCAGGCGCAG GAGGTCACGA TGATCTCGGC CGAGAACAAC GGCGAAGCGC TTGACGCGCT GAAGGCGATT GCCGCGGGCT TCAGCAAGGA AGCCGGTGTC AATGTCGTCA TCAACAATAT GGATCACGAG GCCCACAAGA CGGCGATCCG CAACTATCTC GTCGCCGGCG CGCCGGATGT CTGCTCCTGG TTTTCCGGCA ACCGCATGCG CGCCTTCGTC AAGCGCGGCC TGTTCGACGA TATTTCCGAT CTCTTCGAGA AGGAAAAGTA CAAGGATGTG CTGGGGGCGA CGGCCGGCGC CGTCACCGAA GACGGCAAGC AGTACGGCCT TCCCACCGGC GGCACGCTCT GGGGCATGTT CTACCGCAAG GACGTGTTCG AAGAGCATGG CCTCACCGTG CCGAAGACCG CCGAGGACTT CATGGCCTAT GGCGACAAGT GCAAGGCGGC CGGCATCACC CCGGTTGCGA TCGGCACCAA GGAATTGTGG CCGGCGGCCG GCTGGTTCGA TCAGATGAAC CTGCGCATCA ACGGGCTCGA CAAGCACATG GCGCTGATGA ACGGCGAAAT GAGCTATCTC GACCCGTCGC TGACCGCCGT CTTCGACCAG TGGGAAGCGA TGATTTCCAA GGGCTTCTTC ACCCCCAACC ATACCTCCTT CGGTTGGCAG GAGGCCGCAG CGCTTCTGGC GCAGAAGAAG GCGGGGATGA TGAACCTCGG CGCCTTCCTG CGTTCGGCCT TCACCGCCGA GGATCTGCCG CAGCTTGGCT ACGCGACCTT CCCGGTGCTC GACGCCAAAG TCGGTCATTT CGAGGAGTTC TCGGTCAATT CGATCCACAT TCCCGCCAAG GCGAAAAACA AGCAGGGGGC CCGCGACTTC CTCGCCTATT TCTACAGACC GGAGAACCTG GCGGCCTATC TCGAGCCCGG CGGCAACGTG CCGCCGCGCA ACGACCTGCC TCAGAGCAAG GATCCGCTGG TCAATATCGC TGTCGAGACG ATGAAGACGG TGCAGGGCAC CTCGCAATAT TACGACCGCG ACAGCGACCC CGACATGGCC CAGGCCGGCC TCGTCGGCTT CCAGGAGTTC ATGGCCAAAC CCGACCGGCG CAAGGCGATC CTCACGCGCC TCGAGGGGAC GCGCAAGCGG ATCTATAAGA TCTGA
|
Protein sequence | MTFLKQFPST TRRRFLKGAG LVSAAAVTGS FPIPAIAQAQ EVTMISAENN GEALDALKAI AAGFSKEAGV NVVINNMDHE AHKTAIRNYL VAGAPDVCSW FSGNRMRAFV KRGLFDDISD LFEKEKYKDV LGATAGAVTE DGKQYGLPTG GTLWGMFYRK DVFEEHGLTV PKTAEDFMAY GDKCKAAGIT PVAIGTKELW PAAGWFDQMN LRINGLDKHM ALMNGEMSYL DPSLTAVFDQ WEAMISKGFF TPNHTSFGWQ EAAALLAQKK AGMMNLGAFL RSAFTAEDLP QLGYATFPVL DAKVGHFEEF SVNSIHIPAK AKNKQGARDF LAYFYRPENL AAYLEPGGNV PPRNDLPQSK DPLVNIAVET MKTVQGTSQY YDRDSDPDMA QAGLVGFQEF MAKPDRRKAI LTRLEGTRKR IYKI
|
| |