Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3996 |
Symbol | |
ID | 8014806 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 4073267 |
End bp | 4074877 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644826565 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002977776 |
Protein GI | 241206680 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.70207 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0901295 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGTCAG ACAATCGTTC CAGAAGGTTG CAGAGCCTGT TCATGCGCGG CGCCGCCGTG TTGCTGTTGT CGCTCATGGC GGCCGCACCC GTGATGCTCG CCGCGTCTCA AGCCGCCGCG CAGACGGAAA AACCCGTTAG CGGCGGCGCG ATGACCATTA TCAACGGTTC GGACATCAAG AGCTGGGATC CGGCGATCTC CGCCGGCACC TATCCCGGCG GGCCGATGGA TGTGCTCGAC GCCGTCTACG GCTTTATCGT CTACGTCAAC GACAAGGGCG TCGTGACCGG CGGCATGGCC GAAAGCCTGA CCAGCACAGA CGCCGTGACC TGGACCTTGA AGCTTCGCAA GGACATGAAG TTCACAGACG GGACGCCCTA TGATGCGGAA GCCGTCAAAT ACAATTGGGA CCGCGCCGCC GATTCGGCCA CGCTTTCACC GGCGCAGCCG TTCATATCCT CATGGAACAA GGCGATCACC GTCGTCGATC CGCAGACGCT GACGATCAAG CTGTCCTCCC CGAACGCCAA TTTCGCAGCC CAGGTCGCCG AGCTGTGCCC CTTCATCGCC TCGCCGGCAG CATTGAAGGC CGCTAAGGAA AAGACCGACA TCAAGCCGGT GGGCGCCGGC GCCTTTACGC TGACCGAATG GAACCAGGGC ATATCGATGA CCATGGCCCG CAATCCCGGT TATTGGGATC AGCCGCGCCC CTATCTGGAG ACGATCAAGT TCGCGATCAT TCCCGAAACC AACAGCCGCA TCGCCACCGT CGTGCAGGGT GGTGCAACGA TGATGGCCGG TTATCCCTAT CAGTTCGGCT CGAACGCAAC GGCGCCGGGG GTTGCGACCC GCGAGATCCC GATCCGCGGC ATCAACCGCG CCTATCTCAA CCAGGCCAAG GGTATCTTCA CGGATGTTCG CGCCCGCGAG GCCTTCTATT CCGCCATCGA CCGCGCGCGG CTGATGCAGG CCTTCACGCA GATGCCCGGA TACAAGGCAC CCAGCAATTA CTTTGGAGAG AATTCGCCCT ATTTCGACAG CGCTTCATCT CTTCCGGCCT ATGATCCGAA GAAGGCGCAG GAACTGTTCG ACGCTCTGAA GGCGGACGGC AAGCCGTTCT CGATCAAGAT CGTCACCTAT ACCAACTCGG ACCTGAAGCG TCTTGCCGCC TACATCCAGC AGGTGCTGAC CGGCTATGAA GGCGCCTCGG CTGAGATCGT CGAGGTCGAC CAGGCCTCGC TGATCCAGCG ATGCAAGACG CAGCTCGACT TCGACATCTG CGTCGAAGGC GGGGTGCTGG TGTCGAACGG CGCCGAGCCG AACATTTCCA ATCTGCTGAG TTCCGGCGGT GCTTTCAACT GGGGACAGTA CAAGAGCGCC GAGATGGACG CTGCATTGAA GGAGGCCAGC TCCACCCTCG ACCCTGCCGC TGTCAAAGCC GCCTATGTCA AGGTTCAGAA GCTCGTCGCA ACCGAAATGC CGCTTTACAT CTTCGGTGAA CAGACGCGCT CGCTGCTGCT GCGCGACAAT ACCGGCGGCG TCGTTCCTTC GAACGGCGGC ATCCTGCAAA AGCAGTTCCT CTACGTCTGC ACGGATGTCT GCCAGAAATA G
|
Protein sequence | MLSDNRSRRL QSLFMRGAAV LLLSLMAAAP VMLAASQAAA QTEKPVSGGA MTIINGSDIK SWDPAISAGT YPGGPMDVLD AVYGFIVYVN DKGVVTGGMA ESLTSTDAVT WTLKLRKDMK FTDGTPYDAE AVKYNWDRAA DSATLSPAQP FISSWNKAIT VVDPQTLTIK LSSPNANFAA QVAELCPFIA SPAALKAAKE KTDIKPVGAG AFTLTEWNQG ISMTMARNPG YWDQPRPYLE TIKFAIIPET NSRIATVVQG GATMMAGYPY QFGSNATAPG VATREIPIRG INRAYLNQAK GIFTDVRARE AFYSAIDRAR LMQAFTQMPG YKAPSNYFGE NSPYFDSASS LPAYDPKKAQ ELFDALKADG KPFSIKIVTY TNSDLKRLAA YIQQVLTGYE GASAEIVEVD QASLIQRCKT QLDFDICVEG GVLVSNGAEP NISNLLSSGG AFNWGQYKSA EMDAALKEAS STLDPAAVKA AYVKVQKLVA TEMPLYIFGE QTRSLLLRDN TGGVVPSNGG ILQKQFLYVC TDVCQK
|
| |