Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5843 |
Symbol | |
ID | 6977232 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | + |
Start bp | 255379 |
End bp | 256404 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643393298 |
Product | sulfate ABC transporter, periplasmic sulfate-binding protein |
Protein accession | YP_002278116 |
Protein GI | 209546226 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.302205 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGACAC AACGGCTCAC CCGGCTCCTC GCGGCCGCGG TCATGGCAGG CAGCTTCGCG ATCGGGAGCA TTGCTCCTGC TTTCGCGGAT CAGACGCTGC TCAACGTTTC CTATGATCCG ACTCGCGAAT TGTACAAGGA TTTCAACGCC GCCTTTGCCG CGAAGTGGAA AAAGGACAAC GGCGAAGCCG TAACGATCCA GGCCTCCCAT GGCGGTTCCG GCGCCCAGGC CCGCTCGGTC ATCGACGGCC TCGACGCCGA TGTCGTGACG CTGGCCCTCG AAGGCGATAT CGACGCCATT GCCAAAGCCA CCGGCAAGAT CCCGGCCGAC TGGAAGACGA AATTCCCCAA CAATTCGACG CCTTATACGT CGACGATCGT CTTCCTGGTC CGCAAGGGCA ACCCGAAGGG CATCAAGGAT TGGGGCGACC TGGTCAAGGA CGACGTGCAG GTGATCACGC CGAACCCGAA GACCTCGGGC GGCGCCCGCT GGAACTTCCT CGCCGCCTGG GCATGGGCCA AGCAGGCGAA TGGCGGCGAT GAAGCCAAGG CGCAGGATTA TGTCGCGAAA CTGCTGCAGC ACGTTCCGGT TCTCGATACC GGCGCGCGCG GCGCCACGAC CACCTTCGTC CAGCGCGGCC TCGGCGACGT GCTGCTTGCC TGGGAAAACG AGGCCTATCT TTCGCTTGAA GAACTCGGCC CCGACCAGTT CGAGATCGTC ACCCCGAGCT TCTCCATCCG CGCCGACCCG CCGGTCGCCG TCGTCGACGG CAATGTCGAC AAGAAGGGCA CGCGCAAGGT CGCCGAAGCC TATCTCAACT ACCTCTATTC GGATGAAGGC CAGAAGATCG CCGCCAAGCA TTATTACCGG CCGTTCAAGC CTGAAGCCGC CGATCCGGCC GATATCGCCC GCTTCCCGAA GCTGACGCTC GCGACCATCG ACGACTTCGG CGGCTGGAAA GAAGCCCAGC CGAAATTCTT CGGCGACGGC GGGGTATTTG ACCAGATCTA TAAGCCGGCC CAATAA
|
Protein sequence | MQTQRLTRLL AAAVMAGSFA IGSIAPAFAD QTLLNVSYDP TRELYKDFNA AFAAKWKKDN GEAVTIQASH GGSGAQARSV IDGLDADVVT LALEGDIDAI AKATGKIPAD WKTKFPNNST PYTSTIVFLV RKGNPKGIKD WGDLVKDDVQ VITPNPKTSG GARWNFLAAW AWAKQANGGD EAKAQDYVAK LLQHVPVLDT GARGATTTFV QRGLGDVLLA WENEAYLSLE ELGPDQFEIV TPSFSIRADP PVAVVDGNVD KKGTRKVAEA YLNYLYSDEG QKIAAKHYYR PFKPEAADPA DIARFPKLTL ATIDDFGGWK EAQPKFFGDG GVFDQIYKPA Q
|
| |