Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4368 |
Symbol | |
ID | 8015142 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 4496148 |
End bp | 4497101 |
Gene Length | 954 bp |
Protein Length | 317 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644826944 |
Product | aliphatic sulfonates family ABC transporter, periplsmic ligand-binding protein |
Protein accession | YP_002978146 |
Protein GI | 241207050 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.178631 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCA CAAGGCGAGC TTTCACTGCA GTGGTGGCAG GTGCCATCGC GACACCTTTG ACCCACGTTC GGCTGGCCGG CGCCGCTGAC AAGGCCGTGC GTATCGGCTA CCAGAAATAT GGAACGCTGG TGCTTCTCAA GGGCAAAGGC ACGCTGGAGA AGAAACTGGA ACCCATCGGC TACACCGTTG AATGGACCGA GTTTCCGGGC GGTCCTCAAT TGCTCGAGGC CCTGAACGCT GGTGCTGTCG ATTTCGGATC GACCGGCGAA ACGCCGCCGA TCTTCGCCCA GGCCGCAAAC GCTCCGCTTG TCTATATCGC ACACGAGCCG CCCGCCCCGC GCGGAGAAGC CATACTGGTC CCGAAGGACA GCCCCATAAA GTCCGTCGCG GAACTGAAAG GCAAGAAGGT CGCCTTCAAC AAGGGCTCGA ACGTGCATTA CCTGCTCGTC AAGGCGCTCG AAGAAGCCGG CTTGACCTAT GAGGACGTCG AATCATCCTT CCTCGCTCCG GCCGACGGCC GCGCAGCCTT CGAAAAGGGC GCGGTTGATG CTTGGGTCAT CTGGGATCCC TTCCAGGCCG CGGCGGAAGT CGCGGTCGAA GCGCGAGAAC TCAGAAACGG CGAAGGCATC GTCCCCAATC ACCAGTTCTA TCTCGGCACC AAGTCGCTGG TCGACGGTCA CGCCGAGGCG ATCGATGTCG TGATCGACGC CATTTCCGAG ATCGACGAAT GGACCAAGTC CGACACCGCG GCTGCGGCTG CGGAGCTCTC ACCATCGGTC GGCATCCCCG AACCTGTCCT CGTCAAGGCG CTCGAGCGCC AGTCCTACGG GGTCAAGAGC CTGGACGACA CCGTCGTCGC GCAGCAGCAG AACATCGCCG ACACCTTTTT CAAACTGAAG CTCATTCCCA AGGAGGTGAC GATTGCCGAC GTTGTCCGCA AGGGCAAGGC GTGA
|
Protein sequence | MKITRRAFTA VVAGAIATPL THVRLAGAAD KAVRIGYQKY GTLVLLKGKG TLEKKLEPIG YTVEWTEFPG GPQLLEALNA GAVDFGSTGE TPPIFAQAAN APLVYIAHEP PAPRGEAILV PKDSPIKSVA ELKGKKVAFN KGSNVHYLLV KALEEAGLTY EDVESSFLAP ADGRAAFEKG AVDAWVIWDP FQAAAEVAVE ARELRNGEGI VPNHQFYLGT KSLVDGHAEA IDVVIDAISE IDEWTKSDTA AAAAELSPSV GIPEPVLVKA LERQSYGVKS LDDTVVAQQQ NIADTFFKLK LIPKEVTIAD VVRKGKA
|
| |