Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5048 |
Symbol | |
ID | 8007641 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 431192 |
End bp | 432466 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644821963 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002973223 |
Protein GI | 241113388 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.133709 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATTTGG ATAAATTCGG GGGGACGGTG AAACTTGCCG TGGCAGGATT CACCTTGGCG GCGATGACGG CCGGGGCAGC GTTTGCCCAG GACGCCGTGA CGCTGAAATG GGCTTTGTGG GACTGGGACA AGACCGCCTA TTACAAACCG CTGATCGAGG CCTATCAGGC CAAGCACCCC AACGTGAAGT TCGAGCCGAT GGATCTCGGC TCGCAAGACT ATCAGCAGAT GATCTCAACG CAGTTGACCG GCGGCTCCAA GGACATCGAC ATCGTCACTA TCAAGGATGT GCCGGGCTAC ACCAATCTGG TGCGCGCCGG CAACATCGCC GATCTCAGCG GCTTCGTGAA GGATCAGAAG ATCGACCCGG CTCCCTTTGG CGGCCTGATC GAGGAACTGA CCATCGATGG CAAGATCTAC TCCCTGCCGT TCCGCTCCGA CTTCTGGGTT GTCTATTACA ATAAGGATAT ATTCGACAAA GCAGGCGTCC CCTACCCCAC CAATGACATG ACCTGGGCGC AGTTCGACGA GACCGCCGAG AAGCTTTCAG GCGGCATGGG CACCAACAAG ACCTATGGCG CGCTTCTGCA TACCTGGCGG TCAACCGTTC AATTGCCTGC CATCCTCGAC GGAAAACACA CGCTTGTCGA CGGCGACTAC GGCTTCCTGA AGCCCTGGTA CGAGAGGGCG CTGACCCTGC AGAAGGATGG CGCGATTCCC TCCTATGCCT TCCTGAAGAC GTCGAACACA CATTATTCGG CGCTCTTCTT CAATGGCACG ATCGGCATGC TGCCGATGGG AACCTGGTTC GTCGGCACCC AGATCACCAA GGTGAAATCG GGTGAATCGA AGAGCAGGAA CTGGGGCATC GTCAAGTTCC CGCATCCCGA CGGTGTGGCA ACCGGCACGA CCGCTGCGCA GATTTCCGGC CTGGCGGTCA ACGCCAATTC CGACCACAAG GATGCCGCGC TCGATTTCAT CAAGTTTGTC ACCGGTCCTG AGGGCGCAGC AGTTATCGCG TCGACGGGCA CCTTCCCTGC GCTCAAGACG GATGATGTCA GCGCCAAGAT CGCGGCAACA CCCGGATTTC CTGAGGATGC GGCCAGCAAG GAGGCGCTGA AGCCGTCGAA AGCCTACCTG GAGATGGCGG TCAACCCGAA CGCCGCCAAG ATCGAGGTCG TACTCAACCG GGTGCATGAC GCGATCATGA CTGACAGCAC CTCCGTCGAT GACGGGCTGA AGGAAATGAC CGAAGGCGTG AAGGCCATCA AGTAG
|
Protein sequence | MYLDKFGGTV KLAVAGFTLA AMTAGAAFAQ DAVTLKWALW DWDKTAYYKP LIEAYQAKHP NVKFEPMDLG SQDYQQMIST QLTGGSKDID IVTIKDVPGY TNLVRAGNIA DLSGFVKDQK IDPAPFGGLI EELTIDGKIY SLPFRSDFWV VYYNKDIFDK AGVPYPTNDM TWAQFDETAE KLSGGMGTNK TYGALLHTWR STVQLPAILD GKHTLVDGDY GFLKPWYERA LTLQKDGAIP SYAFLKTSNT HYSALFFNGT IGMLPMGTWF VGTQITKVKS GESKSRNWGI VKFPHPDGVA TGTTAAQISG LAVNANSDHK DAALDFIKFV TGPEGAAVIA STGTFPALKT DDVSAKIAAT PGFPEDAASK EALKPSKAYL EMAVNPNAAK IEVVLNRVHD AIMTDSTSVD DGLKEMTEGV KAIK
|
| |