Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3783 |
Symbol | |
ID | 6982546 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 3909295 |
End bp | 3910317 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643398505 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002283271 |
Protein GI | 209551354 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTTTG CTCTCAACCG TTTTTCGCAG GCGCTTGCGC TTGCCGCCTC GTTTCTCTCC GCAACTGCGC TCGCCGGCAG CGCCAATGCA CAGGACGAAG GTCTCGTCGT CTACAATGCC CAGCACGAAA GCCTCGGCCG CGAATGGATC GACGCCTTCA CCAAGGAGAC CGGCATCAAG GTCACCATGC GCCAGGGCGG CGACATGCAA TTCGCCAACC AGATCATCCA GGAAGGCGAC GCCTCACCGG CTGACGTATT CCTGACCGAG AATTCGCCGG CGATGACGCT GGTCGATGGC GCGGGCCTCT TTGCCCCCAT CGAAAAGGAC ACGCTGGATC AGGTTCCTGA TCAGTACCGG CCGTCCGACG GCATGTGGAC CGGCATTGCT GCCCGCACCA CGGTTTTTGC TTATGACAAG ACGAAGCTCA CCGAAGACAA GCTGCCGAAG TCGATGCTCG ACCTCGCAGA CCCGGCCTGG AAGGGCCGCT GGGGCGCGGC GCCTGCCGGC GCCGACTTCC AGGCCATTGT CGCTTCCCTG CTGCAGCTGA AGGGTGAAGA CGCCACCAAG GCATGGCTGA AGGGTCTCAA GGACAACGCC ACGCCCTATA AGGGCAACAG TGTCGCCATG AAGGCGGTCA ACGCAGGCGA AGTCGAAGGG GCTGTTATCT ATCACTATTA CTGGTTCGGC GATCAGGCGA AAACCGGCGA GAACAGCAAG AATGTCGGCC TGCACTATTT CAAGAACCAG GATCCAGGCG CTTTCGTCAG CGTTTCGGGC GGCGGTATCC TGAAGTCCAC GCAGCACATG AAGCAAGCCC AGGCTTTCCT CAAGTTCGTC ACCAGCAAGG CAGGCCAAGC TGTGCTCAAG GACGGCACGT CCTATGAATA TGCCATCGGC AAGGATACAC CCTCCAACGA CAAGCTCACG CCGCTTGCCG ATCTCAACGC GCCAAAAGTC GAAGCTTCGA CGCTGGACAG CAAGAAGGTC GTGGAACTGA TGACGGCCGC CGGCCTGATC TGA
|
Protein sequence | MNFALNRFSQ ALALAASFLS ATALAGSANA QDEGLVVYNA QHESLGREWI DAFTKETGIK VTMRQGGDMQ FANQIIQEGD ASPADVFLTE NSPAMTLVDG AGLFAPIEKD TLDQVPDQYR PSDGMWTGIA ARTTVFAYDK TKLTEDKLPK SMLDLADPAW KGRWGAAPAG ADFQAIVASL LQLKGEDATK AWLKGLKDNA TPYKGNSVAM KAVNAGEVEG AVIYHYYWFG DQAKTGENSK NVGLHYFKNQ DPGAFVSVSG GGILKSTQHM KQAQAFLKFV TSKAGQAVLK DGTSYEYAIG KDTPSNDKLT PLADLNAPKV EASTLDSKKV VELMTAAGLI
|
| |