Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_6276 |
Symbol | |
ID | 6983349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011370 |
Strand | + |
Start bp | 225056 |
End bp | 226546 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643399284 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002284040 |
Protein GI | 209552124 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.0695772 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGCTG CATTCGGTGG AGCTGTAAAG GCTGCTCCCA AGAGAGGCGG TAACTTACGC ATTGGCATCG CCGGAGGCAC TTCCGGAGAC AGCCTCGATC CGACGTCAAC GCCGGTCGAC GCGGGCTTTC TCACGCTCAA TACCATGCGC AGTACGCTTG TCGGGATGAA TGCGAAAGGC GAGCCCACTC CTTTGTTGGC GGAGAGCTGG GAGCCATCGA ATGATCTGAC CAAATGGTAT TTTAACATCA GGAAAGGCGC GACCTTCCAC AGCGGCAAAT CGGTGACGGC TGACGATGTG GTCGCATCGC TCAATCTGCA TCGCGGTGAC AAGACAACTT CGCCGGCAAA AGCTCTCCTT GATCCAGTTA CAAATGTCGA GGCCGATGGT CCTAACCGGG TCGTCATCAC GCTCAACCGT GCGAACATTG AGTTTGTTAG CCTTTTCAAA ACGGATTTCC TGGTCATACT GCCTTCTAAG GATGGCGTAA TCGACCGTGC TAGCAAGGAT GGGACAGGTC CATATGCACT TGAAAGCTTT GAACCTGGTC AGCACCTCAG TTTCAAGCGA AATCCAAACT ATTGGGATCT TGACAATTAC GGCTTCTTCG ACTCAGCGGA AGTCGTTGTC ATTGCGGACC CAGCCGCCCG CATGAACGCC CTGCGCTCGG GTCGGGTCGA TCTTGTTAAC TCGGTAGATC TAAAGACCGC CGCGATGCTG AAGCGCGTTG CAGGCCTTAA GCTAGAAAAC ATTCCGAGCG GATTGTACTA CGGCATGCCG ATGCTTGTCG ACGTTGCTCC GTTCAATGAT AATAATGTCC GTATGGCTCT GAAGTACGCG ATCAAACGGC AGGAAATCGT TGACAAGGTC CTTCTTGGCC ACGGAACTGT GGGCAACGAC CAGCCAATCT TCAAGAACGT CAAGTTTGCC GCGACGGACC TCCCGCAACG GGAATACGAT CCGGACAAAG CCCGTCATTA CCTGAAGCAG GCGGGCTATG ATTCAATCGA TCTACCGCTG AACGTCGCGG AAATTGGCTT CCCGGGCGCG ACCGCTGTTA GTCAGCTGTT CGCCGCTTCG GCCAAGGCAG CTGGGATCAA CCTCAACGTT ACGAGAGAGC CCGACGACGG TTACTTCGAA CGCGTATGGA TGAAGCAACC GTTTACAACC GCCTATTGGC ATCAGGCCGT GACGGCTGAC TCCCGCTTTA CCGAGGCTTT CCTGCCTGGT GCTGCTTGGA ACGAAACTCA CTTCAACAAC CCACGCTTCA ACGAGCTTGC CGTAAAGGCG CGCGAGACCG TGGACGAGAA CACCCGAGCC GGTATGTATC ATGAAATGCA GCGCATCATA TACGACGAAG GCGGCCTCTT GAACCCTGTG TTCGCCAACT ACGTCTGGGC AATGAAGGAT AACGTTCACC GCCCCGACGA TGTGACCACT CTTGGAGACT TGGACTCGTT CCAATGCATT TCTCGCTGGT GGATGGCTTA A
|
Protein sequence | MTAAFGGAVK AAPKRGGNLR IGIAGGTSGD SLDPTSTPVD AGFLTLNTMR STLVGMNAKG EPTPLLAESW EPSNDLTKWY FNIRKGATFH SGKSVTADDV VASLNLHRGD KTTSPAKALL DPVTNVEADG PNRVVITLNR ANIEFVSLFK TDFLVILPSK DGVIDRASKD GTGPYALESF EPGQHLSFKR NPNYWDLDNY GFFDSAEVVV IADPAARMNA LRSGRVDLVN SVDLKTAAML KRVAGLKLEN IPSGLYYGMP MLVDVAPFND NNVRMALKYA IKRQEIVDKV LLGHGTVGND QPIFKNVKFA ATDLPQREYD PDKARHYLKQ AGYDSIDLPL NVAEIGFPGA TAVSQLFAAS AKAAGINLNV TREPDDGYFE RVWMKQPFTT AYWHQAVTAD SRFTEAFLPG AAWNETHFNN PRFNELAVKA RETVDENTRA GMYHEMQRII YDEGGLLNPV FANYVWAMKD NVHRPDDVTT LGDLDSFQCI SRWWMA
|
| |