Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4121 |
Symbol | |
ID | 8014918 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4202839 |
End bp | 4205319 |
Gene Length | 2481 bp |
Protein Length | 826 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644826691 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_002977901 |
Protein GI | 241206805 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGGGC GTTTGGCGAG CATCGGTGCC GAGGAAAGTC TACTTTCCGA AGGCTGGAAC CTGATCCTGA CGGAGCCGGG CGCCTGCGCC GTGCCGCACG ACATCCCGCT CTCCGCACAA TTCATTCCAG CACCCGTTCC CGGCACCGTC GCCGCCGCGC TCGAAAAAGC CGGGCTCTTC GACCGGGAAA ATCCCGAGCC GCTGAACACG CGGGATGCCT GGTATCTCTG CCGGCTTTTC GATGCCGAGC CCGGCGACGC GATCCTGCGT TTCGCAGGGC TGGCAACGCT CTGCCATGTC TTCCTGAACG GCCAGGAAAT CCTGTTTTCC GAGAGCATGT TCACGGCGCA CGAGATTCCG GTGACGCTTT CGGGCGGCGA CGAACTGGCG CTGTGCTTCC GGGCGCTTGG GCCCCGCCTG TCGGAGCCAG GCCCGCGCGC GCGCTGGCGG CCGCAGATGA TCACGCCGGC GGGCTTGAAG AATTTCCGCA CGACGCTGCT CGGCCATATG CCGGGCTGGT GCCCCGATAT CCATGCCGTC GGGCCATGGC GGCCGATTTC GATGGTGCGG CGTCATCCCG TCTCGATCGA CAATGTCTCC ATCCGCGCCG TATTGGAGGA GAGTGGCGTC GGCCGTCTCA GCGTGTCCCT GCATAGCAAT GCCGAGGATC CGGCGATGCT GCTGCGCTGC GGCGGGATGG AGCAGCCCTT CGAGAAGGTC GGCGACAGTC ATTACTCGGC TATCCTCAAG CTTTCCGACA TCGAGGCCTG GTGGCCGCAT ACGCATGGTG CTCCGCGTCT CTACGCGCTG ACACTGGTTT CGGACGGGGT GGAATATCCG CTCGGCAGGA CCGGCTTTCG ACGTATCGAC GTCGACCGTG GTGCCGATGG CGACGACTTC GCGCTTCTCG TCAACGGCGA ACGCATCTTC TGCCGTGGTG CCGTGTGGAC GACGGCCGAT ATCGTGCGAT TGCCGGGCGG GCGGGCGGAT TATGAGCCGC TCCTGCGGCT TGCCGCTGCA GGCGGCATGA ACATGATCCG CATCGGCGGC ACCATGGCCT ACGAAACGCC TGATTTCTTC GCACTCTGCG ACGAGCTCGG TCTGCTCGTC TGGCAGGACT TCATGTTCGC CAATTTTGAT TATCCACGCA ACGACAAGGC TTTTCTTGGT CACGTGCATG CGGAGGTCGA GGAATTCCTC CACGGCGTGC AGGCGTCGCC TTCGTTGGCC GCGCTCTGCG GCGGCAGCGA AGTCCACCAG CAGGCGGCAA TGCTCGGCCT GCCGGCGGAA TTCTGGAGCG GGCCGGTCAC CGATGAAATC ATCCCGGCGG TCGTCGCGCG CATGCGCCCC GATGTGCCCT ATGTGCCGAA CTCGCCCTAT GGCGGAGCGA TGCCCTTTTC GCCGAATGCC GGTATTGCCC ATTATTACGG CGTCGGCGCC TATATGCGGC CAATTGCCGA TGCGCGCCGC GCCGATGTGC GTTTTGCCTC CGAAAGCCTC GCCTTCGCGC ATGTGCCGCA GCAAAGGACG CTGCAGCGTC ATCTCGATGT GCCCGCCGTC CACAGTCCGC TGTGGAAGGC CCGCGTGCCC CGCGACCGCA GCGCATCGTG GGATTTCGAG GATGTTCGCG ATTTCTACCT GCAGCTTCTC TACGGTTTCG ATCCGGCTGA GCTGCGCCGC GAAGATCCGG AACGCTATCT CGATCTCTCC CGCGCCGTTA CCGGCGAGGT GATCGAGGAG ACTTTTGCCG AATGGCGGCG CAAGGGCTCC GCCTGCAACG GCGCACTCGT CTGGACGCTG CAGGACCTGT TGCCCGGTCC CGGCTGGGGA GTGATCGATT CCACCGGAGA GCCGAAGCCT GTCTGGTATG CGATGCGCCG TGCATTCCGG CCGGTGCAAG TGGTCTTCAC CGACGAGGGA ACGAACGGTC TCGACGTGCA TGTCGTCAAC GAGACGGATG CCGCGCTTGA CGTGGAGCTC GAGGTCGTCT GCCTGCGCGG CGGAAAACAG CAGGTCGTCA GCGGCAGCAG GGCCTTCAAG CTGGCGGCAA GAGACACGGA GCGTCTTGCC TGCACCGCGC TGTTCGGCGC CTTCTTCGAT ACGACCTATG CCTTCCGTTT CGGGCCGCCG GCGCATGATG CCAGTGTGGC GCGCCTGCGC TCGCTGGCAG ACGGCGCCAT TCTCGCAGAG AGCTTCCACT TCCCGTGCGG ACGGGGGAAG GCGCTGCATG ACGCAGGGAT CGAAGCATCA TTCACCAGAG ACGGCGACGA CTGGTTCGTC GATCTCAGGA CCGACCGGCT GGCGCAATCG GTGCATATCG ACGTCGAAGG CTATCGGGCC GACGACGACT GGTTCCACCT TGCCCCCGGC GCGATGCGGC GTGTGCAGCT CCACGCGCTG TCCGGCGTGG AAAGCGATAC TCCGCCTGCG GGCGAGATCA GAAGTCTAGG CAGTTCGCAT CGTGTCGCAA TCGAGGGCTG A
|
Protein sequence | MRGRLASIGA EESLLSEGWN LILTEPGACA VPHDIPLSAQ FIPAPVPGTV AAALEKAGLF DRENPEPLNT RDAWYLCRLF DAEPGDAILR FAGLATLCHV FLNGQEILFS ESMFTAHEIP VTLSGGDELA LCFRALGPRL SEPGPRARWR PQMITPAGLK NFRTTLLGHM PGWCPDIHAV GPWRPISMVR RHPVSIDNVS IRAVLEESGV GRLSVSLHSN AEDPAMLLRC GGMEQPFEKV GDSHYSAILK LSDIEAWWPH THGAPRLYAL TLVSDGVEYP LGRTGFRRID VDRGADGDDF ALLVNGERIF CRGAVWTTAD IVRLPGGRAD YEPLLRLAAA GGMNMIRIGG TMAYETPDFF ALCDELGLLV WQDFMFANFD YPRNDKAFLG HVHAEVEEFL HGVQASPSLA ALCGGSEVHQ QAAMLGLPAE FWSGPVTDEI IPAVVARMRP DVPYVPNSPY GGAMPFSPNA GIAHYYGVGA YMRPIADARR ADVRFASESL AFAHVPQQRT LQRHLDVPAV HSPLWKARVP RDRSASWDFE DVRDFYLQLL YGFDPAELRR EDPERYLDLS RAVTGEVIEE TFAEWRRKGS ACNGALVWTL QDLLPGPGWG VIDSTGEPKP VWYAMRRAFR PVQVVFTDEG TNGLDVHVVN ETDAALDVEL EVVCLRGGKQ QVVSGSRAFK LAARDTERLA CTALFGAFFD TTYAFRFGPP AHDASVARLR SLADGAILAE SFHFPCGRGK ALHDAGIEAS FTRDGDDWFV DLRTDRLAQS VHIDVEGYRA DDDWFHLAPG AMRRVQLHAL SGVESDTPPA GEIRSLGSSH RVAIEG
|
| |