Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4335 |
Symbol | |
ID | 8015114 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4456289 |
End bp | 4457575 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644826911 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002978114 |
Protein GI | 241207018 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.000444207 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGGTCA ATCGCCGTTC ATTTCTGATG GGTTCAGCCG GTGCGGCCGC CGGTCTCGCC CTTGGTGCCG GAAGCGCCAT TCCGGCTTTT GCCGAAGACG CACAGCTCCG CGCCATGTGG TGGGGATCCA ACGACCGCGC CAAGCGCACG CTTGAGGTTG CCAAGCTCTA TCAGTCGAAA TCGCCTGGCG TCACTCTCGT CGGTGAATCA CTTTCGGGCG ACGGCTACTG GACGAAGCTC GCAACGCAGA TGGCCGGGCG CTCGATCGCC GACATCTTCC AGCTCGAGCC GGGAACAATT TCAGACTACT CGAAGCGCGG CGCCTGCCTG CCACTCGATG AATTCGTGCC CTCCACGCTG GACGTTCAGT CCTTTGGCGC CGACATGCTG AAACTGACCA CCATCGACGG CAAGCTCTAT GGTGTTGGCC TCGGCCTCAA TTCCTTCTCG ATGTTCTTCG ACACAGTCGA ATTCGAAAAG GCAGGCATCC CATTGCCGAC GCCCGACCTC ACCTGGGATG AGTACGCCAA GCTCGCCGTC GAACTCGCCA AGTCTTCCGG CAAGGGCGGC GGCCCCTATG CGGCGCGCTA CGCCTATGTG TTCGATGCCT GGCTGCGCCA GCGCGGCAAG AGCCTCTTTG CGAGGGAAAG CGTTGGGCTC GGCTTCACGG CCGACGATGC CAAGGAATGG TTCGACTACT GGGAGAAACT GCGCAAAGCG GGTGGCACCG TTGCCGCCGA CGTGCAGACG CTCGATCAAA ATACAATCGA CACCAATTGC CTTGGTCTCG GCAAATCGGT GATTGGGATG GCCTATTCAA ACCAGATGGT CGGATATCAA CTGATCATCA AGAACAAGCT TGGCATCACC ATGCTGCCAC GGGACAAGAA GGGCGGTCCG TCCGGCCATT ATTACCGTCC GGCACTGATC TGGAGTGTGG GCGCGACGAG CAAGCATGGC GAAGCTGCCG CGAAGTTTAT CAGCTTCTTC GTCAACGATC CCGAAGCCGG CAAGATCCTT GGCGTGGAAC GCGGCGTGCC AATGTCGCCT ACCGTGCGCG AAGCCATCCT GCCGCAACTC AACCCGACAG AGCAGGAAAC GGTCAAATAC GTGAATCTGC TCAAGGATCA GGTCGGCGAA TATCCGCCAC CGGTGCCGAT GGGCGCAACC CAATTCGACC AGCGCGTGCT GCGCCCGCTT TGTGACGAAC TCGCCTTCGA ACGGATTTCG CCCGCCGATG CGGCGACCCG GCTCATCGAA GAGGGTAAGG CAACGATCAA GGGATGA
|
Protein sequence | MQVNRRSFLM GSAGAAAGLA LGAGSAIPAF AEDAQLRAMW WGSNDRAKRT LEVAKLYQSK SPGVTLVGES LSGDGYWTKL ATQMAGRSIA DIFQLEPGTI SDYSKRGACL PLDEFVPSTL DVQSFGADML KLTTIDGKLY GVGLGLNSFS MFFDTVEFEK AGIPLPTPDL TWDEYAKLAV ELAKSSGKGG GPYAARYAYV FDAWLRQRGK SLFARESVGL GFTADDAKEW FDYWEKLRKA GGTVAADVQT LDQNTIDTNC LGLGKSVIGM AYSNQMVGYQ LIIKNKLGIT MLPRDKKGGP SGHYYRPALI WSVGATSKHG EAAAKFISFF VNDPEAGKIL GVERGVPMSP TVREAILPQL NPTEQETVKY VNLLKDQVGE YPPPVPMGAT QFDQRVLRPL CDELAFERIS PADAATRLIE EGKATIKG
|
| |