Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3723 |
Symbol | |
ID | 8014561 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 3776798 |
End bp | 3777958 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644826286 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002977505 |
Protein GI | 241206409 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0687] Spermidine/putrescine-binding periplasmic protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.415779 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.44187 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAATC TGTTGAAATC CTGCACGGCA GCGCTCGCTT GCCTCAGCTT CGCGACGCAG GGAATCGCCG CCGAACCGCT GAAGGCTCTG GGCAAAGGCG AAGGTGCGGT CAGCATCGTC GCCTGGGCCG GCTATATCGA ACGCGGCGAA ACCGACAAGA ACTACGACTG GGTCACCGAT TTCGAAAAGG AGACCGGCTG CAAGGTTTCA GTCAAGACCG CCGCCACCTC GGATGAAATG GTGTCGCTGA TGAACGAAGG CGGCTTCGAT CTCGTCACGG CATCGGGCGA CGCCTCGCTC CGCCTTATCG CCGGCAAGCG TGTCCAGCCG ATCAACACCG ACCTGATCCC GAGCTTCAAG ACCGTCGACG AGCGTCTGCA GAACGGACCG TGGTATACGG TCGGCGGCGT GCATTACGGC GTGCCCTATC TCTGGGGGCC GAACGTGCTG ATGTACAATA CCGATGCCTT CAAGGACAAG GCGCCGACCA GCTGGAACGT CGTCTTCGAA GAGCAGACGC TGCCCGACGG CAAGTCCAAC AAGGGCCGCG TCCAGGCCTA TGACGGCGCT ATCTACATCG CCGATGCCGC TATGTATCTG ATGGCCCATA AGCCGGATCT CGGCATCAAG GATCCCTACG AGCTGAACGA GGACCAGTAC AAGGCCGCCC TCGACCTGCT GCGCGGCCAG CGCAAGCTCG TCTCCCGCTA CTGGCACGAC GCGATGATCC AGATCGACGA TTTCAAGAAC GAAGGCGTCG TCGCCTCCGG CTCCTGGCCC TTCCAGGTGA ACCTGCTGCA AGCCGACAAG CAGAAGATCG CCTCCACTTT TCCGGATGAA GGCGTCACCG GCTGGGCCGA CACCACCATG CTGCATGCCG ACAGCGAACA TCCGAACTGC GCCTATATGT GGATGGAACA TTCGCTGAAG GCCAAGGTCC AGGGCGACGC CGCCGCCTGG TTCGGCGCCG TGCCCTCCGT TCCCGCTGCC TGCAAGGGCA ACGAGCTGAT AGGCGACAGC GGTTGCGCCA CCAACGGCTT CGATCACTTC GACAAGATCA AGTTCTGGAA GACCCCGGTC GCCAAATGCA CCACGCAGAG CGAATGCGTG CCGTACCATC GTTGGGTGTC TGATTATATC GGCGTGATCG GCGGGCGGTA A
|
Protein sequence | MTNLLKSCTA ALACLSFATQ GIAAEPLKAL GKGEGAVSIV AWAGYIERGE TDKNYDWVTD FEKETGCKVS VKTAATSDEM VSLMNEGGFD LVTASGDASL RLIAGKRVQP INTDLIPSFK TVDERLQNGP WYTVGGVHYG VPYLWGPNVL MYNTDAFKDK APTSWNVVFE EQTLPDGKSN KGRVQAYDGA IYIADAAMYL MAHKPDLGIK DPYELNEDQY KAALDLLRGQ RKLVSRYWHD AMIQIDDFKN EGVVASGSWP FQVNLLQADK QKIASTFPDE GVTGWADTTM LHADSEHPNC AYMWMEHSLK AKVQGDAAAW FGAVPSVPAA CKGNELIGDS GCATNGFDHF DKIKFWKTPV AKCTTQSECV PYHRWVSDYI GVIGGR
|
| |