Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4796 |
Symbol | |
ID | 6977890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | - |
Start bp | 433456 |
End bp | 435054 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643393959 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002278777 |
Protein GI | 209546859 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.478428 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGAC TGTCTAGATT ATCCGCTATC GCGCTTGGTG CCCTGCTGTC GACGGCCGCC GTTCCGGCTC TTGTCGTTTC GGGCGCGGCA ATCCAGGCTC AGGCAGCTAC GCTGTCCGGC GGCTTCGATG TCGGCCCCGG AGGCTTCCAG GGCAACTTCA ATCCGCTGGC GGCGACCGCC GGCTTCACCT GGCTCAGCAT CTACTACGAA CCGCTGATCA CTTATGACGA GAAGCTGCAG AAGGTCGTCG GCGCGCTGGC GGACGCCTAC GAGGTCAGTC CGGATCAGAT GACCTACACA TTCAAGCTCG CCGATGCCAA ATGGCATGAC GGCAAGCCCT TCACCGCCAA GGATGCAAAA TTCACCGTCG GCCTTGCGAT AGATGCAAAA ACCGGCTCGG TGCTCGCTGC CCGGCTGAAG GGCATATCAT CCGTCGAGAC GCCGGACGAT CACACCGTCG TCATCAAGCT CAGCGCACCC AGCAGCAGTT TCCTCGACAC GATGACCAAG GTGATGATGC TGCCCGAGCA TGCTCTCGCC TCGATACCGG CCGACCAGCT GGCAAAGAAC ACGTGGTGGT CCACCGCGCC GATCGGCACC GGCCCGTTCA AATTCACCAA ATACGTCTCC GATCAGTATG TCGAGCTTGC CGCAAACACG GACTATCGCG GCGGCAAACC GGCCCTGGAG CGCGTCATCA ACCGCTATTT CGCCAACCCG GCCGCAGCGA TCGCGGCGCT GAGATCCGGC GAAATCCAGT TCACCTATGT CGATTCCAAC GACGTGCCGA CCTTCAAGGA CAACAAGGAT TTCAAGGTCA TCGAAGGCAA CTCTTTCGTC GTCAACTATC TGGGCTTCAA CCATGATTCC CCGATCTGGA AGGATGTGCG CGTCCGCCAG GCGGTGATGT ATGCGATCAA TCGCGATACC ATCATCCAAA GCCTCTATGG CGGCGCGGCC AAACCGGCCA ACTGCGCCTA TGTCGCCGAA CAGCTGATAC CCCAGGGCAT CGACACCTAC GCCTACGATC CCGAAAAGGC CAAGAAACTG CTCAAGGAAG CCGGCTGGGA TCAGATCAAC GGCGGCAAGC CGATCACGCT TCTGACCTAT TACACCACGC CGCTTGCCAC CAACGTCCTT GCCGCCGTCC AGGCGATGCT TGCGGAGGTC GGCATCAACA TCGTGCCGCG CGCCGTCGAT GCGCCGACCT ATAACAGCAT CGTGCTGAAT GCGACGCCGG ATATCGCCCA GTTCCAGATG GTGTACGCCG GGCTGCAAAA CGGGCCGGAC GCCGGAAGCA TCAATGTCGG CCTCAACGAG AAGCAGATCC CTCCGGCCGG GCCGAACGTC GCCAGAGTTC GCATGCCCGA TCTCACCAAG GCACTCGATA GCGCGCTTGC CGAGCCCGAC AGCGCCAAGC GGGATGCGGC CTACCAGAAT GTCTGCAAGG TGATGAACAC GAACCTGCCC TGGGCGACGC TTTGGGTGGC GAACCGTTAC GGCATCGTCT CGACCAAAGC GAAGGATTTC GTCTGGACGC CGGCGCCGGG TGGCGGCCCC TACCAGGCCG CCCCGCAGAA ATGGTCGCTC GCCGAATAG
|
Protein sequence | MKRLSRLSAI ALGALLSTAA VPALVVSGAA IQAQAATLSG GFDVGPGGFQ GNFNPLAATA GFTWLSIYYE PLITYDEKLQ KVVGALADAY EVSPDQMTYT FKLADAKWHD GKPFTAKDAK FTVGLAIDAK TGSVLAARLK GISSVETPDD HTVVIKLSAP SSSFLDTMTK VMMLPEHALA SIPADQLAKN TWWSTAPIGT GPFKFTKYVS DQYVELAANT DYRGGKPALE RVINRYFANP AAAIAALRSG EIQFTYVDSN DVPTFKDNKD FKVIEGNSFV VNYLGFNHDS PIWKDVRVRQ AVMYAINRDT IIQSLYGGAA KPANCAYVAE QLIPQGIDTY AYDPEKAKKL LKEAGWDQIN GGKPITLLTY YTTPLATNVL AAVQAMLAEV GINIVPRAVD APTYNSIVLN ATPDIAQFQM VYAGLQNGPD AGSINVGLNE KQIPPAGPNV ARVRMPDLTK ALDSALAEPD SAKRDAAYQN VCKVMNTNLP WATLWVANRY GIVSTKAKDF VWTPAPGGGP YQAAPQKWSL AE
|
| |