Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4785 |
Symbol | |
ID | 6977879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 419766 |
End bp | 422792 |
Gene Length | 3027 bp |
Protein Length | 1008 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643393948 |
Product | glycoside hydrolase family 38 |
Protein accession | YP_002278766 |
Protein GI | 209546848 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0383] Alpha-mannosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.470761 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCTCA CCATTGCCCA ACGCCTCGAC CGCCTGAAAG TCCGTATCAC CGAGCTGGCG CATTGGCGGG ATCGGCAGAG TAGCCCGATC GACGGCTGGA CATTCGAGGG TGAGCCGATC GGCCATCAGC AGGACTGGCC GCATCGCCAA GGCGTCGTGC ATTTTGCCGT AAGTGCCGCA GCGCCGGAGG GCTGGCCCCT CGAAGATATC CGTCTGCAGC TCGATCTCGG CGGCGAGAGC CTGATCACTC TGAGCTATCC CGATGGCAAA AGCGAAACAT TCGGTCTCGA TCCCTATCAT CAGGAATTCC TGGTGAAGGG CCGCCGTTTC TCGATTGCGA CGGAAAGCGT CGCGCGCTTT CCGTTCGGCG AGCCGAACCG GGCCCCGCGG CTGAACAAGG CCCGGTTCAT CTGGCTCGAC GGCCCGGCTC ATCGCCTGCA TCTTCTGTTG AAGCAGGTTT CCGAGGCGAT CGATGTGCTC GGCGAGCATG AAGTCGTGCC GCATCTGATG GATGCCGCCG AACAGGCCCT GCGCAGCCTC GACTGGCCGT CGGATACGGC GGCTTACATC TCCCGGACCG CCGATGCCGT GATGCAGCAG AAAATCTGGG AGCTGCCTGA GCTTGCGGCA AACCCGGCAG GACTGAGCGA TGAGCAGAGC GCCTCGGCAG CCTCGGCCTT CGAGGTTTTG ACGGCGCGGC TGAAGGAGCT GCAGAAGCGC TTTCCGCCGA ATGGCGAGCT GCTGCTGACC GGCCATGCGC ATATCGACCT CGCCTGGCTT TGGCCCTATC GCGAGACGCG GCGGAAGATG CGGCGCACCT TCAATACGGC GCTGTCGCTG ATGGAGCGCT CCGACGATTT CCGCTTCAAC CAGTCGACCG CCCATTATTA CGCACAGATG GAAGAGGAAG ACCCGGAGCT TCTCGAACGC ATCAAGGAGA AGGTCGCGGA AGGAAAATGG GAAACCGTCG GCGGCATGTG GGTGGAGCCC GATACCAATA TGCCCACAGG CGAAAGCCTC GTCCGCCAGG TTCTCTACGG CCAGCGTTAT TTCGAAAAGA CCTTCGGCAC GCGCCATACG GTGTGCTGGC TTCCGGATTG CTTCGGTTTT TCCGGCGCGC TGCCGCAGAT CCTGAGACAG GGCGGCATCG ACAGCTTCTT CACCATCAAG GTCAACTGGA GCGAGACCAA CCACATCCCT TCCGATCTCT TCTGGTGGAA GGGCCTCGAC GGCAGCCAGG TGCTGACCCA CACTTTCGAC AATCCGATGC AGGGATATAA CGGCTTCGTG CAGGCCGATT GTTATGTGCC GACATGGAAG AATTTCCGCG GCAAGGTGCA GCACGATACT TCGCTGCTGG CGGTCGGCTA CGGCGACGGC GGCGGCGGCG TGACGCCCGA AATGGTGGAA CGCGAAGTGC AATTGCGCGA TTTCCCGGCC ATCCCGCAGG CGCGCTGGGG CACCGTCAAA AGCTTCTATG AGAAGGCGCA TCGCACCGCT AGGGAAAAGA ACCTTCCGGT CTGGGACGGC GAAATTTACC TCGAACTGCA CCGCGCGACG CTGACGAGCC AAAGCGGCGT CAAGCGCAAA CATCGCCAGG CCGAACGGGC GTTGATCACC GCAGAAACGC TCGCTTCACT TGCCCATATG CTGGGTGCCG ACAAGCCGAA AAGCCTCGAA GCGGATTGGC GCGTGGTGCT GAAGAACGAG TTTCACGATA TCCTGCCGGG ATCGAGCATC CGCGAGGTCT ATCAGGATGC GGAGCAGGAA CTCGGCGGCG TCATCGACAA TGCCAAGTCC GAGCAGGCAA AGGCCGTGCA GGCGCTCTCG GCCCATCTGC CGAAGGGTGG GGTCGGCGAT GCGCTCGTCG TCGTCAATCC GTCGCTTATG CCGCGGCCGG TGAGCGCCAC GCTTGCCGAC GGCACGGTCA TCTCGGCCGG CGATATCGTC GCTCCTCTGT CCGTGGCGGT CTTCGACAGG CAGTCCCTGC AGCCGGCCGG CGGGCTGAAA GCAAGTTCCG ATCGTCTCGA AAACGATCAT CTCGCCGTCG TCATCGGCAA GGACGGGGCG GTTGCGAGCC TCGTCCACAA GGCGAGCGGC CGCGAAGCGG TCGATGGTTC GGCCAACCAG CTCTGGGTCT ATCCGGCTGA CAAGCCGCGC AATTGGGACG CATGGGATAT CGATGCGGAT TATGCCGAAA AGGCCGTCCG TCTCGAGGCG CCGGAGAGCA TCTCGCTCGT CGAAAGCGGC CCGCACCGCG CGGCAATCCG CGTCATCCAT CGCTACCGGA ATTCGAGCGT CACCCAGACC TATGTGCTGA CCGCCAATTC CAAGCGGCTC GATATCGAAA CGACGATCGA CTGGCACGAC CGCCGCACCC TGCTGCGCAC CCTCAACCCG GTCGATGTAA AGGCCCGGAA GGCAACGTTC GAATGCGCCT TCGGCATCGT CGAGCGGGCC ACACACACGA ACACTTCCTG GGAGCAGGCA ATGTTCGAGG CCGTCGCCCA TCGCTTCGTC GATATCAGCG AGCCGGGTTT CGGCGTGGCC CTGCTCAACA ATGCCAAATA TGGTCACAGC GCCCGCGGCA ATGTGATCGG CATGAGCCTG GTGCGCGGGC CGATCTATCC GGATCCGCTG GCCGATGAAG GCGAGCAGAG CTTCACTTAT GCGCTGCTGC CGCATGACGG CGCCTGGCAT GAAGGCGGCG TGCTCGACGA GGCGCTCGAT CTCAACCAGC CGCTCGTCGC AGTGGAAGCC AAGGGCCTCT CCGCCGGCAC GTTCGCGCCG CTTGCGGTCA CCGGCACCCC CGTCGCCTTT TCCGGCCTGA AACCGGCGGA AGAGGGCGAC GGCCTCATTC TGCGCCTCTA CGAACCCGCC GGCCGGCGCG GCCGCCTCTC GCTTGCCCTG CCGCCCGGAT GGAAGGCGTC GCCGCCGCTC AATATTCTCG AAGAGCCGAT GGAGCGGAAG GGGCCGGCCG AGATCATGCC GTTTGAGATC AGGACGTGGC GGCTGCAAAA GGCCTGA
|
Protein sequence | MPLTIAQRLD RLKVRITELA HWRDRQSSPI DGWTFEGEPI GHQQDWPHRQ GVVHFAVSAA APEGWPLEDI RLQLDLGGES LITLSYPDGK SETFGLDPYH QEFLVKGRRF SIATESVARF PFGEPNRAPR LNKARFIWLD GPAHRLHLLL KQVSEAIDVL GEHEVVPHLM DAAEQALRSL DWPSDTAAYI SRTADAVMQQ KIWELPELAA NPAGLSDEQS ASAASAFEVL TARLKELQKR FPPNGELLLT GHAHIDLAWL WPYRETRRKM RRTFNTALSL MERSDDFRFN QSTAHYYAQM EEEDPELLER IKEKVAEGKW ETVGGMWVEP DTNMPTGESL VRQVLYGQRY FEKTFGTRHT VCWLPDCFGF SGALPQILRQ GGIDSFFTIK VNWSETNHIP SDLFWWKGLD GSQVLTHTFD NPMQGYNGFV QADCYVPTWK NFRGKVQHDT SLLAVGYGDG GGGVTPEMVE REVQLRDFPA IPQARWGTVK SFYEKAHRTA REKNLPVWDG EIYLELHRAT LTSQSGVKRK HRQAERALIT AETLASLAHM LGADKPKSLE ADWRVVLKNE FHDILPGSSI REVYQDAEQE LGGVIDNAKS EQAKAVQALS AHLPKGGVGD ALVVVNPSLM PRPVSATLAD GTVISAGDIV APLSVAVFDR QSLQPAGGLK ASSDRLENDH LAVVIGKDGA VASLVHKASG REAVDGSANQ LWVYPADKPR NWDAWDIDAD YAEKAVRLEA PESISLVESG PHRAAIRVIH RYRNSSVTQT YVLTANSKRL DIETTIDWHD RRTLLRTLNP VDVKARKATF ECAFGIVERA THTNTSWEQA MFEAVAHRFV DISEPGFGVA LLNNAKYGHS ARGNVIGMSL VRGPIYPDPL ADEGEQSFTY ALLPHDGAWH EGGVLDEALD LNQPLVAVEA KGLSAGTFAP LAVTGTPVAF SGLKPAEEGD GLILRLYEPA GRRGRLSLAL PPGWKASPPL NILEEPMERK GPAEIMPFEI RTWRLQKA
|
| |