Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3852 |
Symbol | |
ID | 6982615 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 3993824 |
End bp | 3995467 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643398574 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002283340 |
Protein GI | 209551423 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.479782 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGATAA CCAAGCTTAG CCGCAATTTC CGCATGCTTT CCACGGGGGC TGCTCTTTCG CTCCTGATGA TGACCGCACC CTCCGCCTTC GCCGAGACGC CCAAGGATAC GCTGGTCGAG GGCTTTGCCA TCGACGACAT CATTACGATG GATCCGGGCG AGGCGTTCGA GCTTTCGACC GCCGAAATCA CCAGCAATAG CTACAGCCTG CTTGTCCGTC TCGACATGGA CGACACGTCC AAGGTCAAGG GCGATCTGGC CGACAGCTGG AGCGTTTCCG ACGACGGTCT GACCTATACG TTCAAGCTGA AATCCGGCAT GAAATTCGCC TCCGGCAACC CGATCACCGC CGAAGATGTT GCTTGGTCGT TCGAGCGCGC CGTCAAGCTC GACAAGAGCC CGGCCTTCAT CCTCACTCAG TTCGGCCTGA CCGGCGACAA CGTCAGCGAA AAGGCCAAGG CGGCCGATGC CGGCACTTTC GTCTTCACCG TCGACAAGGC CTATGCGCCG AGCTTCGTTC TCAACTGCCT GACGGCGACG GTTGCCTCCG TCGTCGACAA GAAGCTGGTG ATGGACCATG TGAAGGCGGT AACACCAGAT GCCGAGCACA AATACGACAA TGATTTCGGC AATGAATGGC TGAAGACCGG CTATGCCGGC TCCGGAGCCT TCAAGCTGCG CGAATGGCGC GCCAATGAAG TGGTCGTTCT CGAGCGCAAC GACAATTATT ACGGCGACAA GGCAAAGCTC AACCGCGTCA TCTACCGCTA CATGAAGGAA AGCTCGGCCC AGCGGCTGGC GCTCGAAGCC GGCGATATCG ATATCGCCCG CAACCTCGAG CCTGGCGACA TCGACGCCGT TTCGAAAAAT GCCGATCTCG CGACGACGAG TGCGCCGAAG GGCACGATCT ATTATGTCAG CCTCAACAAC AAGAACGAGA ACCTGAAGAA GCCCGAGGTG CAGGAAGCCT TCAAATATCT GGTCGACTAT GATGCGATCG GCGCGACCTT GATCAAGGGT ATCGGCGAAA TCCACCAGAC CTTCCTGCCG AAGGGCCAGC TCGGCGCGCT CGACGAAAAT CCCTACAAGC TCGATGTCGC CAAGGCCAAG GAACTGCTGG CCAAGGCCGG CGTGCCCGAC GGTTTCTCGA TCACCATGGA CGTGCGCAAC AGCCAGCCGG TGACCGGTAT CGCCGAATCC ATGCAGCAGA CGCTGGCGCA GGCCGGCGTG AAGATGGAAA TCATCCCGGG TGACGGCAAG CAGACGCTGA CCAAATACCG CGCCCGCACA CATGATATGT ATATCGGCCA GTGGGGTTCG GACTATTTCG ATCCGAATTC CAACGCCGAT ACCTTTACCG GCAATCCCGA CAATTCCGAT GCCGGCACGG TCAAGACGCT CGCATGGCGC AACACCTGGG AGGCGCCGGA GCTCGACAAG GAAGCCAAGG CAGCTCTTCT CGAACGTGAT GCCGCCAAAC GCGCCGCCAT ATATCAGGAC ATCCAGAAGA AATACCTGGC AAACAGCCCC TTCGTCTTTA TCTTCCAGCA GACCGAAGTG GCCGGTTACC GCAAGAACCT CAAGGACTTC AAGTTGGGTC CGAGCTTCGA TACCAATTTC GTCGGTCCGA TCGCCAAGGA ATAG
|
Protein sequence | MMITKLSRNF RMLSTGAALS LLMMTAPSAF AETPKDTLVE GFAIDDIITM DPGEAFELST AEITSNSYSL LVRLDMDDTS KVKGDLADSW SVSDDGLTYT FKLKSGMKFA SGNPITAEDV AWSFERAVKL DKSPAFILTQ FGLTGDNVSE KAKAADAGTF VFTVDKAYAP SFVLNCLTAT VASVVDKKLV MDHVKAVTPD AEHKYDNDFG NEWLKTGYAG SGAFKLREWR ANEVVVLERN DNYYGDKAKL NRVIYRYMKE SSAQRLALEA GDIDIARNLE PGDIDAVSKN ADLATTSAPK GTIYYVSLNN KNENLKKPEV QEAFKYLVDY DAIGATLIKG IGEIHQTFLP KGQLGALDEN PYKLDVAKAK ELLAKAGVPD GFSITMDVRN SQPVTGIAES MQQTLAQAGV KMEIIPGDGK QTLTKYRART HDMYIGQWGS DYFDPNSNAD TFTGNPDNSD AGTVKTLAWR NTWEAPELDK EAKAALLERD AAKRAAIYQD IQKKYLANSP FVFIFQQTEV AGYRKNLKDF KLGPSFDTNF VGPIAKE
|
| |