Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5068 |
Symbol | |
ID | 8007661 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 449589 |
End bp | 450518 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644821983 |
Product | periplasmic binding protein/LacI transcriptional regulator |
Protein accession | YP_002973243 |
Protein GI | 241113408 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.449495 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACGG CCACGCTTGC ATCCATCGCC TTGGCGATAT CGATCTCTGC TGCCCATGCC CAGACGATCG GGGTTTCGAT GTCCGGTCTC GACAAATTCA GGACGGCGCT TCTAAACGGC GTTGTCTCGC ATGGGCAGAC GATATCCGGC CTCAAACTCG TCACCGAGAA TGCCAATGGC GACAAGGAGC TCCAGAAGCA GCAGGTGCAG AAGCTCATTG CCGACAAGGT CGACGCGATC ATCCTTGCCG TCTCCGATGG CGACCTCGGG CCGCAAATGA CCAAGATGGC GGCAGATGCC GGCATTCCGC TGGTGTACAT CAACAACGTT CCTTCCAACC TGCTGGACCT GCCTGACAAT CAGGTGGTGG TCGCCTCCAA CGAGAAGGAA TCCGGAACGC TGGAGACCAA GCAGGTCTGC GCGCTCCTTA AAGGCAAAGG CCGCGTCGTC GTGCTGATGG GCGAACCATT CCACGCCGCC GCGCGTGCCC GCACCCAGGA TATATCAGAC GTCATTGCCA CCCCGGATTG CAGGGGCCTT CAGATCGTCG AGCGGCAGGC GGCCTATTGG TCGAGCGATT ATGCCGACCA GCAGATGCAG GAATGGCTGT CGGCCGGCGT CAAGTTCGAC GCGGTCATCG CCAACAATGA CGAGATGGCG CTCGGCGCGA TCCGGGCCAT GAAGAAGGCC GGCATACCGA TGAAAAATGT CGTCGTCGCC GGCGTCGACG CGACCGACGA CGCGCTCGCA GCGATGGTCG CCGGCGATCT CGACGTGACC ATTCTCCAGA GCGCCGTCGG GCAGGGCGCT GCCGCTGTCG ACGCTGCCGT CAAGCTGATC CGCAAAGAGA AGGTGCCGCG CGAAAACAAC GTTCCCTTCG AACTCGTCAC ACCTGAGAAC ATTGCCACCT ATCTGCCGAA GAGCCAGTGA
|
Protein sequence | MKTATLASIA LAISISAAHA QTIGVSMSGL DKFRTALLNG VVSHGQTISG LKLVTENANG DKELQKQQVQ KLIADKVDAI ILAVSDGDLG PQMTKMAADA GIPLVYINNV PSNLLDLPDN QVVVASNEKE SGTLETKQVC ALLKGKGRVV VLMGEPFHAA ARARTQDISD VIATPDCRGL QIVERQAAYW SSDYADQQMQ EWLSAGVKFD AVIANNDEMA LGAIRAMKKA GIPMKNVVVA GVDATDDALA AMVAGDLDVT ILQSAVGQGA AAVDAAVKLI RKEKVPRENN VPFELVTPEN IATYLPKSQ
|
| |