Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_0375 |
Symbol | |
ID | 6979089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 383318 |
End bp | 384415 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643395087 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002279900 |
Protein GI | 209547983 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0687] Spermidine/putrescine-binding periplasmic protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00751674 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000421612 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGGTCAA TCATTGCAAG TGTGACGGCC GCCGCCGTTG CCGCGCTGCT TACCGCCGCG CCGGCCTTTG CGCAGGAGCG CGTGGTCAAC GTCTACAACT GGTCGGATTA TATCGACGAC AGCATTCTTG CCGACTTCAC CAAGGAAACC GGCATCAAAG TCGTCTACGA CACCTTCGAT TCCAACGAGA CCGTGGAAAC CAAGCTGCTG GCCGGCGGCA CCGGTTATGA CGTCGTCGTT CCCACAGCCG ACTTCCTGCA GCGCCAGATC CAGGCCGGCG TCTTCCAGAA GCTCGACAAG TCGAAACTGC CGAACCTCTC CAACATGTGG GATGTGATCC AGCAGCGCAC CGCCGAATAC GACCCGGGCA ACGAACATGC GGTCGATTAC ATGTGGGGCA CCGACGGCAT CGGCTACAAC GTCAAGAAGG TCGCCGAAAT CCTCGGTCCC GATGCCAAGC CCGGCCTCGA AGTGATCTTC GATCCGAAGG TCGCCGCAAA GTTCAAGGAT TGCGGCATCT ATATTCTCGA CACACCGAAG GACGTCATTA CCACGGCGTT TCGCTATCTC GGCCTCGACC CGAACTCCAC CAAGGCCGAG GATTTCAAGA AGGCCGAAGA GCTGCTGACG GCCGCCCGCC CCTATGTCCG CAAGTTCCAT TCGTCCGAAT ACATCAATGC GCTTGCCAAC GGCGACATCT GCATCGCCTT CGGCTATTCC GGAGACATGC TGCAGGCGCG CGACCGTGCG GCCGAAGCCA AGAACGGCGT CGAGGTCAAT TATTCGGTTC CCCCGCAGGG CGCCCAGATG TGGTTCGACA TGATGGCCAT CCCCGCCGAT GCGCCCCACG TCGCCGAAGC CCACGAATTC CTCAACTACA TGATGAAGCC CGAGGTCATC GCCAAGGCGA GCGATCACAC CTTCTATGCC AACGGCAACA AGGCCTCGCA GCAGTTCGTC AGCAAGGACA TTCTGGAAGA CCCTGCCGTC TATCCGACCG AGGCGGTGAT GAAGAACCTC TTCACGGTCA AGCCGTGGGA TCCGAAAACG CAGCGCCTGG GGACGCGCCT CTGGACGAAG GTCGTTACCG GCCAGTAA
|
Protein sequence | MRSIIASVTA AAVAALLTAA PAFAQERVVN VYNWSDYIDD SILADFTKET GIKVVYDTFD SNETVETKLL AGGTGYDVVV PTADFLQRQI QAGVFQKLDK SKLPNLSNMW DVIQQRTAEY DPGNEHAVDY MWGTDGIGYN VKKVAEILGP DAKPGLEVIF DPKVAAKFKD CGIYILDTPK DVITTAFRYL GLDPNSTKAE DFKKAEELLT AARPYVRKFH SSEYINALAN GDICIAFGYS GDMLQARDRA AEAKNGVEVN YSVPPQGAQM WFDMMAIPAD APHVAEAHEF LNYMMKPEVI AKASDHTFYA NGNKASQQFV SKDILEDPAV YPTEAVMKNL FTVKPWDPKT QRLGTRLWTK VVTGQ
|
| |