Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4989 |
Symbol | |
ID | 8007580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 372390 |
End bp | 373700 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644821904 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002973164 |
Protein GI | 241113329 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.192631 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAAAA CTATGACCGG TCTGTTGGCC GGTGTCGGAT TGATGTGGGC GTGCGGAACA TCCGCACAGG CCCAGGAACT GACCATCTTC TGGGCCGAGT GGGATCCGGC AAACTACCTT CAGGAACTCG TCAACGAATA CGAGGCTCAA ACCGGCGTCA AGGTCACGGT CGAGACCACA CCGTGGGCCG ACTTCCAGAC CAAGGCCTTC ACCGAGTTCA ACGCCAAGGG TTCAGCCTAT GACATGGTCG TCGGCGACAG TCAGTGGATC GGGGCAGCGT CAGAAGCCGG CCATTACGTC GATCTGACCG ACTTCTTCAC CAAGCACAAT CTGACCCAGG TGATGGCCCC GGCAACGGTG AAATACTACG CCGAATATCC GTCGAACTCG AAAAAGTACT GGTCGGTTCC GGCCGAAGGC GACGCCGTCG GCTGGTCCTA CCGCAAGGAC TGGTTCGAAG ACCCCAAGGA GATGGAGGCG TTCAAGGCCA AATACGGCTA CGATCTCGCA CCGCCGAAGA CATGGGCCGA GATGCGTGAC ATCGCCGAGT TCTTCCACCG TCCAGACCAG AAGCGATACG GAATCGCCAT CTACACCGAC AACTCTTATG ACGGTCTCGT CATGGGTGTC GAGAACGCGA TCTTCTCGTT TGGAGGCGAA CTCGGCGACT ACCAGAGCTA CAAGGTCGAC GGCATCATCA ATTCCGAGAA GAACGTCAAG GCGCTCGAGC TTTATCGCGA GCTCTACGGC TTTACGCCAC CGGGCTGGGC CAAGTCCTTC TTCGTCGAGA ACAACCAGGC GATCACTGAG AACCTGGCGG CGATGAGCAT GAACTACTTC GCCTTCTTCC CGGCCCTGGT GAACGAGGCG TCCAACCCGA ACGCCAAGGT TACCGGCTTC TTTGCCAATC CGGCGGGCCC GAACGGCGAG CAATTCGCAG CGCTCGGCGG CCAAGGCATA TCGGTCATCT CCTACTCAAA AAACCAGGAA GAGGCGATGA AATTCCTCGA ATGGTTCATC AAGGACGAGA CCCAGAAGCG CTGGGCCGAA CTCGGCGGCT ATACGGCAAG CGCCAAGGTG CTTGAATCGC CGGAGTTTCA GAACGCGACA CCCTATAACA AGGCCTTCTA CGAGACGATG TTCAAGGTGA AGGACTTCTG GGCAACGCCT GAATATGCCG AACTGCTGAT CCAGATGAAC CAGCGCATTT ATCCCTTCGT CACTGCCGGC CAAGGCACGG CGAAGGAAGC GCTCGAATCC CTGGCAGCGG ACTGGAACGC GACGTTCGCG AAATACGGAC GCCACAAGTA G
|
Protein sequence | MRKTMTGLLA GVGLMWACGT SAQAQELTIF WAEWDPANYL QELVNEYEAQ TGVKVTVETT PWADFQTKAF TEFNAKGSAY DMVVGDSQWI GAASEAGHYV DLTDFFTKHN LTQVMAPATV KYYAEYPSNS KKYWSVPAEG DAVGWSYRKD WFEDPKEMEA FKAKYGYDLA PPKTWAEMRD IAEFFHRPDQ KRYGIAIYTD NSYDGLVMGV ENAIFSFGGE LGDYQSYKVD GIINSEKNVK ALELYRELYG FTPPGWAKSF FVENNQAITE NLAAMSMNYF AFFPALVNEA SNPNAKVTGF FANPAGPNGE QFAALGGQGI SVISYSKNQE EAMKFLEWFI KDETQKRWAE LGGYTASAKV LESPEFQNAT PYNKAFYETM FKVKDFWATP EYAELLIQMN QRIYPFVTAG QGTAKEALES LAADWNATFA KYGRHK
|
| |