Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_6279 |
Symbol | |
ID | 6983352 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011370 |
Strand | + |
Start bp | 229203 |
End bp | 230741 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643399287 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002284043 |
Protein GI | 209552127 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.334528 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTTCT CTCGTAGAAC TTTCTTGCAG GGAACGACCG CCGTTGCCTT GGCTTCGTCG TTCGGGGTTA AGGCGATCGC CGCTCCCAAG CGTGGGGGAC ATTTGCGTGT CGCAGTCGCG ACCGGCTCGA CGACCGATAG CCTCGATCCA ACTAGCACTC CCGTAACGTG GGGGTTCATC AATCTTGCCA CCTCGCGTAA CACGCTGGTC GGCACCGATC ATGCCGGCAC CCTCATTCCA AAACTGGCGG AAAGCTGGGA GCCGTCGAGC GATCTCAAGA CCTGGGTGCT CAATCTCAGG AAAGGAGTGA CCTTCCACAG CGGAAAGACG ATGACAGCAG ACGACGTTGT CAATTCGTTG AACCTTCACA GGGGCGAAAA TACGGTCTCG CCCGCCAAAT CGCTCTTGAG TCCCGTCAGC AATGTTAAAG CGGACGGTGC CAACAGGGTG GTGATATCGC TGGAAAGTCC GAATGTGAAC TTTGTGAACC TTCTGAGGGA TGACTTCCTG GTTGTTGCGC CTTCCAAGGA CGGCGAGGTG GATCGTACTT CGACGGACGG CACCGGACCC TACGTGCTGG AGAGCTGGGA AGCTGGGCGA AGCCTCCGGT ATAAGCGCTA CGAGAATTTT TGGGACCTGA ACAATTACGG CTTCTTCGAC TCCGCCGAGG TGGTGGTTAT CCAGGACAAC GCAGCCCGGA TGAACGCACT TCGTTCCGGA CAGGTGGATC TCGTGAATTC AGTCGACCTC AAGACCGTCC CGATGCTGAA GCGCGTTCCG AAAATACGGG TTGAGGATAC CCCGAGCGGG ATGTACTACG GCCTGCCCAT GCTTACCGAC GTAGCGCCAT TCAACGACAA CAATGTTCGC CTAGCGCTTA AGTACGCGTT CAACCGGCAG GAAGCCGTCA ACAAGGTCCT CCTCGGGCAC GGCACGGCGG GCAACGACCA TCCGATTTTC GCGAATGACA AATTCAACGA CCCCACGATT CCGCAGCGCG AGTATGATCC AGACAAGGTT CGGTTCTACC TCGACAAAGC CGGTCTCCAA TCGCTTGAAA TCCCTCTGAA CGTGGCAGAG GCGGGTTTCC CAGGAGCCGT CGACACCGCT CAGCTTTACG CATCCTCGGC AGCAGCGGCA GGGATCAAGA TCAACGTAAC ACGCGAACCC GACGACGACT ACTATGAGCG CGTATGGCTC AAGAAGCCAT TCTGCGCAGC TTATTGGAAC CAAGCCATCA CCAACGACGC GCGGTTTACC GAAGCCTTCC TGCCGGACGC TCCGTGGAAT GAAACGCACT ATAACAATCC ACGCGTCACG GAGTTGGTCG TGAAGGCCAG GTCCACGCTG GACGAAGGGG CGCGCGCAAG CATCTATCAC GAGTTGCAGC GTATCATCCA TGACGACGGC GGCCTTCTCA ACCCGATGTT CGTGAATTAT GTCTGGGCAA TGAAAGACAA CGTGAAGAGG CCAGAGAAGG TGTCCACCTT GGGCGACCTC GACGGCTACG AGTGCATCGC CCGTTGGTGG ATGGAGTAG
|
Protein sequence | MTFSRRTFLQ GTTAVALASS FGVKAIAAPK RGGHLRVAVA TGSTTDSLDP TSTPVTWGFI NLATSRNTLV GTDHAGTLIP KLAESWEPSS DLKTWVLNLR KGVTFHSGKT MTADDVVNSL NLHRGENTVS PAKSLLSPVS NVKADGANRV VISLESPNVN FVNLLRDDFL VVAPSKDGEV DRTSTDGTGP YVLESWEAGR SLRYKRYENF WDLNNYGFFD SAEVVVIQDN AARMNALRSG QVDLVNSVDL KTVPMLKRVP KIRVEDTPSG MYYGLPMLTD VAPFNDNNVR LALKYAFNRQ EAVNKVLLGH GTAGNDHPIF ANDKFNDPTI PQREYDPDKV RFYLDKAGLQ SLEIPLNVAE AGFPGAVDTA QLYASSAAAA GIKINVTREP DDDYYERVWL KKPFCAAYWN QAITNDARFT EAFLPDAPWN ETHYNNPRVT ELVVKARSTL DEGARASIYH ELQRIIHDDG GLLNPMFVNY VWAMKDNVKR PEKVSTLGDL DGYECIARWW ME
|
| |