Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3121 |
Symbol | |
ID | 6981866 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 3190788 |
End bp | 3192026 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643397831 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002282614 |
Protein GI | 209550697 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAAAT CCTTTACCAA GACGCTTCTG GGTGCGGCCT TGATCGGCGC ATCTCTTGCG CCGCATGCTT TCGCCGAAAC GACACTGAAC GCGCTTTTCA TGGCGCAGGC CGCTTATAGC GAGGCCGATG TGCGCGCGAT GACCGACGCC TTCGCCAAGG CAAACCCCGA TATCAAGGTC AATCTCGAAT TCGTTCCCTA TGAGGGCCTG CACGACAAGA CGGTGCTGGC GCAGGGTTCC GGCGGCGGTT ACGACGTCGT GCTCTTCGAC GTCATCTGGC CGGCCGAATA CGCCACCAAC AAGGTGCTGG TCGACGTCTC CTCCCGCATC ACCGACGACA TGAAGAAGGG CGTGCTGCCG GGCGCCTGGA CCACCGTGCA ATATGACGGC AAATATTACG GCATGCCGTG GATCCTCGAT ACCAAATACC TGTTCTACAA CAAGGAAATC CTCGAAAAAG CCGGCATCAA GGCGCCGCCG AAGACCTGGG ACGAGCTGAC TGAGCAGGCG AAGGCGATCA AGGACAAGGG CTTGCTCGCC ACCCCGATCG CCTGGAGCTG GTCGCAGGCC GAAGCCGCGA TCTGCGACTA CACCACGCTC GTCAGCGCCT ATGGCGGTGA TTTCCTCAAG GACGGCAAGC CGGCCTTCCA GAGCGGCGGC GGCCTCGATG CGCTGAAATA TATGGTGGCA AGCTATTCGT CCGGCCTGAC CAACCCGAAC TCCAAGGAAT TTCTGGAAGA GGATGTCCGC AAGGTCTTTG AAAACGGCGA TGCCGCTTTC GCTTTGAATT GGACCTACAT GTACAACATG GCCAACGATC CGAAGGACAG CAAGGTGGCG GGCAAGGTCG GCGTCGTGCC GGCACCGGGT GTTGCCGGTA AAAGCCAGGC GTCTGCCGTC AACGGCTCGA TGGGCCTCGG CATCACTTCC GCCAGCCAGC ATCCCGATGA GGCCTGGAAA TACATCACCT TCATGACCTC GCAGGCGACA CAGAACGCCT ATGCGAAGCT CAGCCTGCCG ATCTGGGCCT CCTCCTACGA GGACCCTGCC GTCACCAAGG GCCAGGAAGA ACTGATCTCG GCCGCCAAGG TCGGATTGGC CGCCATGTAT CCGCGCCCGA CGACGCCGAA ATATCAGGAG CTTTCGACCG CGCTGCAGCA GGCGATCCAG GAATCCCTGC TCGGCCAATC CTCTCCGGAG GATGCGTTGA AATCGGCGGC CGACAATAGC GGCCTCTGA
|
Protein sequence | MLKSFTKTLL GAALIGASLA PHAFAETTLN ALFMAQAAYS EADVRAMTDA FAKANPDIKV NLEFVPYEGL HDKTVLAQGS GGGYDVVLFD VIWPAEYATN KVLVDVSSRI TDDMKKGVLP GAWTTVQYDG KYYGMPWILD TKYLFYNKEI LEKAGIKAPP KTWDELTEQA KAIKDKGLLA TPIAWSWSQA EAAICDYTTL VSAYGGDFLK DGKPAFQSGG GLDALKYMVA SYSSGLTNPN SKEFLEEDVR KVFENGDAAF ALNWTYMYNM ANDPKDSKVA GKVGVVPAPG VAGKSQASAV NGSMGLGITS ASQHPDEAWK YITFMTSQAT QNAYAKLSLP IWASSYEDPA VTKGQEELIS AAKVGLAAMY PRPTTPKYQE LSTALQQAIQ ESLLGQSSPE DALKSAADNS GL
|
| |