Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2919 |
Symbol | |
ID | 6981663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 2975543 |
End bp | 2976544 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643397629 |
Product | Substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_002282413 |
Protein GI | 209550496 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0603439 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAC TGCTCGCATC TACATGTCTG CTGGCCGGCC TTGCCGGCGG CGCATCGCTG TCTCATGCGG CCGAATGCGG CGACGTCGTG CTCGCCGTGC ACAATGTGCA AAGCGCCGAG GCCCTGACCT TCGTCGACAA GTTCATCCTC GAGAACGGTT ACGGCTGCAA TGTCGAAACC GTTCCTGGCG ATACGGTGCC GACCACCACG TCGATGGTGG AGAAGAGCCA GCCCGACGTC TCGTCCGAAA CCTGGGTCGA CCTGCTGCCG GAGATCGTTC CGCGTGGCGT TGAGGAAGGC AAGATCGTCT TCGGCGCCCC GGCGCTTCCC GATGGCGGCA TCCAGGGCTG GTGGATCCCG AAATATCTGG CCGACGCCCA CCCCGACATC AAGACCGTCG AGGACGCGCT TGCCCATCCC GACCTCTTTC CCGACCAGGA AGACTCGAGC AAGGGCGTGA TCTTCAACGG CGCCGAAGGC TGGGGCGCAA CCGTCGTCAC GACGCAGCTG TTCAAGGCCT ACAAGGCGGC CGATAAGGGC TTCACTCTGA TGAACCCGGG TTCGGCCGCC GGCCTCGACG GCGCGATCGC CAAGGCCTAT GAACGCAAGC AGGGCTTCAT CACCTATTAT TGGGCGCCGA CGGCGCTGCT CGGCAAATAT GAAATGGTGA GGCTCAAGCA GAACGGCGCA CCGGATGCCG TCGAATGGAA GCGCTGCATC ACCAATCTTG CCTGCCCCGA TCCGAAGGTC GCCGACTGGC CGGTCGACAA GGTGATGACC GTCGTGACCA AGAAGTTCGC CGATCGCACC AGCCCCGACG TCATGGCCTA TTTCGGCAAA CGCGGCTGGA GCAACGACAC GGTCGGCAAG CTCATGGCCT GGGAAACGGA GAACCAGGCC ACCGGCGAAG ACGGCGCCAA GCACTTCCTC CAGGAAAACG AGACGATCTG GTCGCAATGG GTGCCGGCCG ATGTCGCCGA GAAAATCAAG GCCGCCCTTT AA
|
Protein sequence | MKKLLASTCL LAGLAGGASL SHAAECGDVV LAVHNVQSAE ALTFVDKFIL ENGYGCNVET VPGDTVPTTT SMVEKSQPDV SSETWVDLLP EIVPRGVEEG KIVFGAPALP DGGIQGWWIP KYLADAHPDI KTVEDALAHP DLFPDQEDSS KGVIFNGAEG WGATVVTTQL FKAYKAADKG FTLMNPGSAA GLDGAIAKAY ERKQGFITYY WAPTALLGKY EMVRLKQNGA PDAVEWKRCI TNLACPDPKV ADWPVDKVMT VVTKKFADRT SPDVMAYFGK RGWSNDTVGK LMAWETENQA TGEDGAKHFL QENETIWSQW VPADVAEKIK AAL
|
| |