Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4112 |
Symbol | |
ID | 8014910 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4189770 |
End bp | 4190786 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644826682 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002977892 |
Protein GI | 241206796 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.512643 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATTG CTCTTAACCG TTTTTCGCTC GCGCTTGCCG CTTCGCTTCT CTCCGCCACC GCGCTCGCAG GCAGCGCCCA AGCGCAGGAC GAAGGTCTCG TCGTTTACAA TGCCCAGCAC GAAAGCCTCG GCCGCGAATG GATCGACGCC TTCTCCAAGG AGACCGGCAT CAAGGTCACC ATGCGCCAGG GCAGCGACAT GCAGTTCGCC AACCAGATCA TCCAGGAAGG CGACGCCTCC CCGGCGGATG TCTTCCTGAC CGAAAATTCG CCGGCAATGA CGCTGGTCGA TGGCGCGGGC CTCTTCGCCC CCATCGAAAA GGAAACGCTG GATCAGGTTC CGGATCAGTA TCGCCCGGCT GACGGCATGT GGACCGGCAT TGCCGCCCGT ACCACCGTTT TTGCCTATGA CAAGACCAAG CTCACCGAAG ACAAGCTGCC GAAGTCGATG CTCGATCTGG CAGATCCCGC CTGGAAGGGC CGCTGGGGTG CAGCACCTGC CGGTGCAGAC TTCCAGGCCA TCGTCGCTGC CCTGCTGCAG CTCAAGGGCG AAGATGCCAC CAAGGCATGG CTGAAGGGCC TGAAAGACAA TGCCACGCCC TACAAGGGCA ACAGCGTCGC GATGAAGGCC GTCAATTCAG GCGAAGTCGA AGGCGCTGTT ATCTATCACT ATTATTGGTT CGGCGATCAG GCCAAAACAG GCGAGAACAG CAAGAATGTT GGTCTGCACT ACTTCAAGAA CCAGGATCCG GGCGCTTTCG TCAGCGTTTC GGGTGGCGGC ATCCTGAAGT CCACGCAGCA CATGAAGGAA GCCCAGGCTT TCCTGAAGTT CCTGACAAGC AAGGCCGGCC AGGCAGTCCT CAAGGCCGGC GATTCCTATG AATATGCCAT CGGCAAGGAT GCGCCCTCCA ACGACAAGCT CACTCCGCTT GCCGATCTCA ATGCCCCGAA GGTCGAAGCT TCGACCCTCG ACAGCAAGAA AGTCGTGGAG CTGATGACGG CTGCCGGCCT GATCTAA
|
Protein sequence | MKIALNRFSL ALAASLLSAT ALAGSAQAQD EGLVVYNAQH ESLGREWIDA FSKETGIKVT MRQGSDMQFA NQIIQEGDAS PADVFLTENS PAMTLVDGAG LFAPIEKETL DQVPDQYRPA DGMWTGIAAR TTVFAYDKTK LTEDKLPKSM LDLADPAWKG RWGAAPAGAD FQAIVAALLQ LKGEDATKAW LKGLKDNATP YKGNSVAMKA VNSGEVEGAV IYHYYWFGDQ AKTGENSKNV GLHYFKNQDP GAFVSVSGGG ILKSTQHMKE AQAFLKFLTS KAGQAVLKAG DSYEYAIGKD APSNDKLTPL ADLNAPKVEA STLDSKKVVE LMTAAGLI
|
| |