Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5365 |
Symbol | |
ID | 8007323 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 774533 |
End bp | 775789 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644822269 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002973529 |
Protein GI | 241113694 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.559287 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATAAAT TGATCTTGCC TCTACTTGCA TCGGCGGCCT TGGCAATCGC CGCACCCGCT CTGGGCCAGG ACGCCAAGCC GCTTGCCGGT CAGTCGATCA CGGTGCTCAT GCCATCGCCG CAGGGACCGA ACATCGCCTC TGACTTCGAG GCCGAAACCG GCATCCACGT CGACCTGCAG ACCCTGTCGT GGGACGACAT CCGCCCGAAG CTAGTGACAG CCTTGGTTGC CGGTACAGCG CCAGCGGATG TGACCGAATT CGATTGGTCC TGGACCGGTC AGTTCAGCGC GGCCGGCTGG TATATGCCGC TCAACGACGT GATCGACGCC GACACGTTGA AAGATATCGG CGTTGCCAAG ATCTTCACGG TTGACGGCAA ATTATTGGGA ATTCCCTACA CCAACGACTT CCGGGTGATG CTCGTCAACA AGAAGCATTT CGCGGATGCA GGCATCACCG AGATGCCGAA AACATTGGAC GCCCTGGTCG CTGCAGCTAA GAAGATCAAG GAAAAGGGTA TTGTCGAGTA TCCGGTCGGC CTGCCGGTTT CAGCGACCGA AGGCGCGTCA ACGAGCTGGT ATCTGCTGAC CAAGGCCTTC GGTGGCGAAC TCTTCGACAA GGACTTCAAT CCGCTCTTCA CCTCGCCGGA TTCGGCTGGT TACAAGGCGC TTGCTTTCGA GTTGATGCTC CTCAAGGAAG GGCTGGTCGA TCCGGCGTCT ACCGGCCTGA AGGACAGCCA AATCAATGAG AGCATGTTCG CGCAGGGCAT CACCAGCATC ATGATCTCCG GCGAGCCTGG TCGTCTTGGC CAGATGAACG ATCCCAAGCA GTCGAAAGTC GCCGGCCAGG TAGAGGCGAT CCTCGTGCCG ACGGCAAGCG GCGAAACGCG CAGTTTCGGC CTTCCGGAAG CGCTGGCTAT TCCAAACGTC TCTCCCAACA AGGAGGCGGC GATTGCCTTC GTCAAATGGT TTACCAGCAA GGACTTCCAG AAGAAGAATG CCGTGAACGG CTTCCTGCCG ACACGCACGT CGGCGCTTTC CGAACTCAAC GAGTCCGGAA AGCTGAACAG CGGCGACGCT CTCGTGGCGC AATCGAAGAC GGTTGAGCCG CTGTTCCCGC AAGGCACGCC GCCTTGGTAC CCGCAATTTT CCAGCGGCGT GAACACAGCG ATCAATAGCG CTGCCAAGGG ACAGATGAGC GTCGACCAGG CGATGGAGGC CATCGCTTCC GCCGCCAAGC AGGCAATGGC GCAATGA
|
Protein sequence | MHKLILPLLA SAALAIAAPA LGQDAKPLAG QSITVLMPSP QGPNIASDFE AETGIHVDLQ TLSWDDIRPK LVTALVAGTA PADVTEFDWS WTGQFSAAGW YMPLNDVIDA DTLKDIGVAK IFTVDGKLLG IPYTNDFRVM LVNKKHFADA GITEMPKTLD ALVAAAKKIK EKGIVEYPVG LPVSATEGAS TSWYLLTKAF GGELFDKDFN PLFTSPDSAG YKALAFELML LKEGLVDPAS TGLKDSQINE SMFAQGITSI MISGEPGRLG QMNDPKQSKV AGQVEAILVP TASGETRSFG LPEALAIPNV SPNKEAAIAF VKWFTSKDFQ KKNAVNGFLP TRTSALSELN ESGKLNSGDA LVAQSKTVEP LFPQGTPPWY PQFSSGVNTA INSAAKGQMS VDQAMEAIAS AAKQAMAQ
|
| |