Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1378 |
Symbol | |
ID | 6980106 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 1398742 |
End bp | 1400004 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643396099 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002280898 |
Protein GI | 209548981 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAGGT TTTTGACCAC GACGGCGATG ATCGCATTGG CTCTGGCCGG CGCGCATGTC TCCGCCCGCG GAGCCGACGT GAAGGAAGTT CAGATGCTGC ATTGGTGGAC GTCTGGCGGC GAGGCGGCGG CTTTGAACGT GCTGAAGCAG GATCTGTCGA AGGAAGGTTT TGCCTGGAAG GACGTTCCAG TGGCCGGCGG CGGCGGTGAT GCGGCGATGA CGGCACTGAA GGCGATGGTT GCGGCCGGCA CCTATCCGAC GGCCTCGCAG ATGCTGGGCT ATACCGTGCT CGATTATGCC CAGGCCGGCG TCATGGGCGA CCTGACCGAG ACGGCGAAGA AGGAAGGCTG GGACAAGTCG GTGCCGGCGG CGCTGCAGAA GTTCTCGGTC TATGACGGCA AGTGGGTCGC AGCCCCTGTT AACGTGCACT CGGTCAACTG GCTGTGGATC AACAAGGCGG TGATGGACAA GATCGGCGGC ACCCAGCCGA AGACCTTCGA CGATCTGATC GCGCTGCTCG ACAAGGCCAA GGCCGCAGGT GTCATCCCCT TGGCGCTCGG CGGTCAGAAC TGGCAGGAAG CGACGATGTT CGATTCCATC GTGCTGTCGA CCGGCGGGCC GGAATTCTAC AAGAAGGCCT TCAACGATCT CGATGAGGAG TCGCTGAAGT CGGACACGAT GAAGAAGTCC TTCGACAATC TGGCGACGAT CATCAAATAT GTCGATCCGA ACTTCTCCGG CCGCGACTGG AACCTGGCGA CCGCCATGGT CATCAAGGGT GATGCGCTGG TGCAGGTGAT GGGCGACTGG GCCAAGGGCG AATTCGTCGC CGCCAAGAAG ACCCCGGATA CCGACTTCCT CTGCTACCGC TTCCCCGGCA CCGAAGGCAG CGTCGTCTAT AACTCCGACA TGTTCGGCAT GTTCAACGTT CCCGATGACC GCAAGGCCGC TCAGGTGGCG CTGGCAACCG CGACGCTGTC GAAGAGCTTC CAGTCGGCCT TCAACGTCGT CAAGGGTTCG GTGCCGGCCC GCACCGACGT TCCCGACACC GACTTCGATG CCTGCGGCAA GAAGGGCATC GCCGATCTGA AGGCGGCCAA TGAGGGCGGC ACGCTGTTCG GCTCGCTGGC CCAGGGCTAT GGCGCGCCTC CGGCCATCGC CAATGCCTAT AAGGACGTGG TCTCGAAGTT CGTCCACGGC CAGATCAAGA GCTCCGACGA AGCCGTCAAG CAGCTCGTCC AGGCGATCGA CGACGCTCGC TGA
|
Protein sequence | MNRFLTTTAM IALALAGAHV SARGADVKEV QMLHWWTSGG EAAALNVLKQ DLSKEGFAWK DVPVAGGGGD AAMTALKAMV AAGTYPTASQ MLGYTVLDYA QAGVMGDLTE TAKKEGWDKS VPAALQKFSV YDGKWVAAPV NVHSVNWLWI NKAVMDKIGG TQPKTFDDLI ALLDKAKAAG VIPLALGGQN WQEATMFDSI VLSTGGPEFY KKAFNDLDEE SLKSDTMKKS FDNLATIIKY VDPNFSGRDW NLATAMVIKG DALVQVMGDW AKGEFVAAKK TPDTDFLCYR FPGTEGSVVY NSDMFGMFNV PDDRKAAQVA LATATLSKSF QSAFNVVKGS VPARTDVPDT DFDACGKKGI ADLKAANEGG TLFGSLAQGY GAPPAIANAY KDVVSKFVHG QIKSSDEAVK QLVQAIDDAR
|
| |