Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2481 |
Symbol | |
ID | 6981223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 2514448 |
End bp | 2515440 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643397196 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002281981 |
Protein GI | 209550064 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.37214 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.635435 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGATCC CTCCCATGAA AACGATCGTT TCCGCCGCTG CACTTCTTGC CGCGAGCTTC CTTGCCGTTC CGGCCCTTGC CGCAAACCTC GTTCTCTACA CCAGCCAGCC GAACGAAGAC GCGCAGGCGA CGGTCGATGG CTTCATGGCC GCCAATCCCG ATATCAAGGT CGACTGGGTG CGCGACGGCA CGCCGAAGAT CATGGCCAAG CTCCAGGCCG AAATCCAGGC CGGCAACCCG GTTGCCGATC TTCTCCTGAT CGCCGATATG GTGACGCTGG AGCGCCTCAA GGAAGACGGC AAGCTGCTCG CCTATAAGTC GCCGGAAGCC GCGCAATACG ATGTCGCCCT CTATGATGCC GACGGCTATT ACTATTCGAC CAAGCTGATC ACCACCGGCA TCATGTACAA CACCTCGGCG GCGATGAAGC CTGTCAGTTG GAAGGATATG ACGAAGCTGG AAGCCAAGGG CCTCGTCACC ATGCCGAGCC CGCTCGCTTC GGGTGCTGCC CTCATCCATG CCCAGACGCT TGCCGCCGTT CCGGGCCTCG GCTGGGACTT CTACAAATCG CTCGCGGAAA ACGGCGCGAT CGCTGCGGGC GGCAACGGCG CCGTGCTGAA GTCGGTCGCC TCGGGCGAGA AGGCTTACGG CATGGTCGTC GACTATCTGC CGATCCGTGA GAAGGCCAAG GGCGCTCCGG TCGAGTTCGT CTTCCCGAGC GAAGGCGTTT CGGCCGTCAC CGAGCCGGTC GGCATCCTCG CCAGCACCAA AAATGCCGAT GCCGCCAAGA AGTTCGTCGA TTACGTGCTC TCCGAAAAAG GCCAGGAAGG TTTTCTCAAG CTCGGCTACA TCCCGGCCCG CAACGGCATG AAGCTGCCGG AAGGTTTTCC GGCGCGCGAC ACCATCAAGG TCCTGCCGAT CAAGGCCGCA GATGCGCTCA AGAATACCGA CCAGGATCTG AAGACCTTCT CGGGCATCTA CGGTTCGAAC TGA
|
Protein sequence | MRIPPMKTIV SAAALLAASF LAVPALAANL VLYTSQPNED AQATVDGFMA ANPDIKVDWV RDGTPKIMAK LQAEIQAGNP VADLLLIADM VTLERLKEDG KLLAYKSPEA AQYDVALYDA DGYYYSTKLI TTGIMYNTSA AMKPVSWKDM TKLEAKGLVT MPSPLASGAA LIHAQTLAAV PGLGWDFYKS LAENGAIAAG GNGAVLKSVA SGEKAYGMVV DYLPIREKAK GAPVEFVFPS EGVSAVTEPV GILASTKNAD AAKKFVDYVL SEKGQEGFLK LGYIPARNGM KLPEGFPARD TIKVLPIKAA DALKNTDQDL KTFSGIYGSN
|
| |