Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2897 |
Symbol | |
ID | 8013830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 2894316 |
End bp | 2895419 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644825467 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002976696 |
Protein GI | 241205600 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCTCTA ACATTTCTCG ACTACTGTCG CTTTCTACTG CGATGATCGT GGCTTCGACC GCGATTGCCG CTGCCGAGCC GAGCGCTGAA CTTATCGCCG CCGCCAAGAA GGAAGGCACC CTGACGACGA TCGCTCTTCC GCACGACTGG TGCGGCTACG GCGAGGTCAT TGCCGGCTTC AAGGCCAAGT ATGGCCTCGA GGTCAACGAG CTCAACCCAG ATGCGGGTTC GGGCGACGAA GTCGAAGCCA TCAAGGCCAA CAAGGGCAAC ACCGGCCCGC AGGCTCCTGA CGTCATCGAC GTCGGGCTCT CTTTCGGCCC GTCCGCAAAG AAAGACGGCC TGATCCAGCC TTACAAGGTC TCCACCTGGG ACTCCATCCC GGATACCGCC AAGGATGCCG AAGGCTACTG GTACGGCGAT TATTACGGCG TTCTCTCGTT CCTCGTGAAC AAGGATCTCG TCAAGGAATC GCCGGCCGAT TGGGCCGATC TGAAGAAGAG CGACTACGCA AACAGCGTCG CCCTCGCGGG CGATCCGCGC GCCGCCAACC AGGCTGTCCA GGGCGTCTAT GCCGCTGGTC TTTCCGCATC CGGCGGTGAC GCGGCCAAGG CAGGCGAAGA AGGCCTGAAG TTCTTCGCCG AATTGAACAA GAGCGGTAAT TTCGTGCCGG TCGTCGGCAA GGCAGCTCCG TTCGCCCAGG GCTCCACGCC GATCATCGTT GCCTGGGATT ACAACGCCCT CTCCTGGGGC GAAAGCCTGA AGGGCAATCC TCCGTTCGAG GTCGTCGTTC CGAAGACAGG CGTCGTTGCC GGCGTCTACG TCCAGGCGAT TTCCGCCTTC GCTCCGCACC CGAACGCTGC GAAGCTCTGG ATGGAATACC TCTATTCCGA CGAAGGTCAG CTCGGCTGGC TGAAGGGCTA TTGCCACCCG ATCCGCTTCA ACGATCTTGC CAAGAACAAC AAGATCCCGA AGGAACTGCT CGATAAGCTG CCGCCGGCTG CATCCTATGA AAAGGCTGTC TTCCCGACGC TCGAAGAACA GTCTGCTGGC AAGGAAGCCA TCACCAAGAA CTGGGATTCC GTCGTCGGCG CCGCCGTCAA GTAA
|
Protein sequence | MISNISRLLS LSTAMIVAST AIAAAEPSAE LIAAAKKEGT LTTIALPHDW CGYGEVIAGF KAKYGLEVNE LNPDAGSGDE VEAIKANKGN TGPQAPDVID VGLSFGPSAK KDGLIQPYKV STWDSIPDTA KDAEGYWYGD YYGVLSFLVN KDLVKESPAD WADLKKSDYA NSVALAGDPR AANQAVQGVY AAGLSASGGD AAKAGEEGLK FFAELNKSGN FVPVVGKAAP FAQGSTPIIV AWDYNALSWG ESLKGNPPFE VVVPKTGVVA GVYVQAISAF APHPNAAKLW MEYLYSDEGQ LGWLKGYCHP IRFNDLAKNN KIPKELLDKL PPAASYEKAV FPTLEEQSAG KEAITKNWDS VVGAAVK
|
| |