Gene Rleg2_4785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4785 
Symbol 
ID6977879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp419766 
End bp422792 
Gene Length3027 bp 
Protein Length1008 aa 
Translation table11 
GC content63% 
IMG OID643393948 
Productglycoside hydrolase family 38 
Protein accessionYP_002278766 
Protein GI209546848 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0383] Alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.470761 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCTCA CCATTGCCCA ACGCCTCGAC CGCCTGAAAG TCCGTATCAC CGAGCTGGCG 
CATTGGCGGG ATCGGCAGAG TAGCCCGATC GACGGCTGGA CATTCGAGGG TGAGCCGATC
GGCCATCAGC AGGACTGGCC GCATCGCCAA GGCGTCGTGC ATTTTGCCGT AAGTGCCGCA
GCGCCGGAGG GCTGGCCCCT CGAAGATATC CGTCTGCAGC TCGATCTCGG CGGCGAGAGC
CTGATCACTC TGAGCTATCC CGATGGCAAA AGCGAAACAT TCGGTCTCGA TCCCTATCAT
CAGGAATTCC TGGTGAAGGG CCGCCGTTTC TCGATTGCGA CGGAAAGCGT CGCGCGCTTT
CCGTTCGGCG AGCCGAACCG GGCCCCGCGG CTGAACAAGG CCCGGTTCAT CTGGCTCGAC
GGCCCGGCTC ATCGCCTGCA TCTTCTGTTG AAGCAGGTTT CCGAGGCGAT CGATGTGCTC
GGCGAGCATG AAGTCGTGCC GCATCTGATG GATGCCGCCG AACAGGCCCT GCGCAGCCTC
GACTGGCCGT CGGATACGGC GGCTTACATC TCCCGGACCG CCGATGCCGT GATGCAGCAG
AAAATCTGGG AGCTGCCTGA GCTTGCGGCA AACCCGGCAG GACTGAGCGA TGAGCAGAGC
GCCTCGGCAG CCTCGGCCTT CGAGGTTTTG ACGGCGCGGC TGAAGGAGCT GCAGAAGCGC
TTTCCGCCGA ATGGCGAGCT GCTGCTGACC GGCCATGCGC ATATCGACCT CGCCTGGCTT
TGGCCCTATC GCGAGACGCG GCGGAAGATG CGGCGCACCT TCAATACGGC GCTGTCGCTG
ATGGAGCGCT CCGACGATTT CCGCTTCAAC CAGTCGACCG CCCATTATTA CGCACAGATG
GAAGAGGAAG ACCCGGAGCT TCTCGAACGC ATCAAGGAGA AGGTCGCGGA AGGAAAATGG
GAAACCGTCG GCGGCATGTG GGTGGAGCCC GATACCAATA TGCCCACAGG CGAAAGCCTC
GTCCGCCAGG TTCTCTACGG CCAGCGTTAT TTCGAAAAGA CCTTCGGCAC GCGCCATACG
GTGTGCTGGC TTCCGGATTG CTTCGGTTTT TCCGGCGCGC TGCCGCAGAT CCTGAGACAG
GGCGGCATCG ACAGCTTCTT CACCATCAAG GTCAACTGGA GCGAGACCAA CCACATCCCT
TCCGATCTCT TCTGGTGGAA GGGCCTCGAC GGCAGCCAGG TGCTGACCCA CACTTTCGAC
AATCCGATGC AGGGATATAA CGGCTTCGTG CAGGCCGATT GTTATGTGCC GACATGGAAG
AATTTCCGCG GCAAGGTGCA GCACGATACT TCGCTGCTGG CGGTCGGCTA CGGCGACGGC
GGCGGCGGCG TGACGCCCGA AATGGTGGAA CGCGAAGTGC AATTGCGCGA TTTCCCGGCC
ATCCCGCAGG CGCGCTGGGG CACCGTCAAA AGCTTCTATG AGAAGGCGCA TCGCACCGCT
AGGGAAAAGA ACCTTCCGGT CTGGGACGGC GAAATTTACC TCGAACTGCA CCGCGCGACG
CTGACGAGCC AAAGCGGCGT CAAGCGCAAA CATCGCCAGG CCGAACGGGC GTTGATCACC
GCAGAAACGC TCGCTTCACT TGCCCATATG CTGGGTGCCG ACAAGCCGAA AAGCCTCGAA
GCGGATTGGC GCGTGGTGCT GAAGAACGAG TTTCACGATA TCCTGCCGGG ATCGAGCATC
CGCGAGGTCT ATCAGGATGC GGAGCAGGAA CTCGGCGGCG TCATCGACAA TGCCAAGTCC
GAGCAGGCAA AGGCCGTGCA GGCGCTCTCG GCCCATCTGC CGAAGGGTGG GGTCGGCGAT
GCGCTCGTCG TCGTCAATCC GTCGCTTATG CCGCGGCCGG TGAGCGCCAC GCTTGCCGAC
GGCACGGTCA TCTCGGCCGG CGATATCGTC GCTCCTCTGT CCGTGGCGGT CTTCGACAGG
CAGTCCCTGC AGCCGGCCGG CGGGCTGAAA GCAAGTTCCG ATCGTCTCGA AAACGATCAT
CTCGCCGTCG TCATCGGCAA GGACGGGGCG GTTGCGAGCC TCGTCCACAA GGCGAGCGGC
CGCGAAGCGG TCGATGGTTC GGCCAACCAG CTCTGGGTCT ATCCGGCTGA CAAGCCGCGC
AATTGGGACG CATGGGATAT CGATGCGGAT TATGCCGAAA AGGCCGTCCG TCTCGAGGCG
CCGGAGAGCA TCTCGCTCGT CGAAAGCGGC CCGCACCGCG CGGCAATCCG CGTCATCCAT
CGCTACCGGA ATTCGAGCGT CACCCAGACC TATGTGCTGA CCGCCAATTC CAAGCGGCTC
GATATCGAAA CGACGATCGA CTGGCACGAC CGCCGCACCC TGCTGCGCAC CCTCAACCCG
GTCGATGTAA AGGCCCGGAA GGCAACGTTC GAATGCGCCT TCGGCATCGT CGAGCGGGCC
ACACACACGA ACACTTCCTG GGAGCAGGCA ATGTTCGAGG CCGTCGCCCA TCGCTTCGTC
GATATCAGCG AGCCGGGTTT CGGCGTGGCC CTGCTCAACA ATGCCAAATA TGGTCACAGC
GCCCGCGGCA ATGTGATCGG CATGAGCCTG GTGCGCGGGC CGATCTATCC GGATCCGCTG
GCCGATGAAG GCGAGCAGAG CTTCACTTAT GCGCTGCTGC CGCATGACGG CGCCTGGCAT
GAAGGCGGCG TGCTCGACGA GGCGCTCGAT CTCAACCAGC CGCTCGTCGC AGTGGAAGCC
AAGGGCCTCT CCGCCGGCAC GTTCGCGCCG CTTGCGGTCA CCGGCACCCC CGTCGCCTTT
TCCGGCCTGA AACCGGCGGA AGAGGGCGAC GGCCTCATTC TGCGCCTCTA CGAACCCGCC
GGCCGGCGCG GCCGCCTCTC GCTTGCCCTG CCGCCCGGAT GGAAGGCGTC GCCGCCGCTC
AATATTCTCG AAGAGCCGAT GGAGCGGAAG GGGCCGGCCG AGATCATGCC GTTTGAGATC
AGGACGTGGC GGCTGCAAAA GGCCTGA
 
Protein sequence
MPLTIAQRLD RLKVRITELA HWRDRQSSPI DGWTFEGEPI GHQQDWPHRQ GVVHFAVSAA 
APEGWPLEDI RLQLDLGGES LITLSYPDGK SETFGLDPYH QEFLVKGRRF SIATESVARF
PFGEPNRAPR LNKARFIWLD GPAHRLHLLL KQVSEAIDVL GEHEVVPHLM DAAEQALRSL
DWPSDTAAYI SRTADAVMQQ KIWELPELAA NPAGLSDEQS ASAASAFEVL TARLKELQKR
FPPNGELLLT GHAHIDLAWL WPYRETRRKM RRTFNTALSL MERSDDFRFN QSTAHYYAQM
EEEDPELLER IKEKVAEGKW ETVGGMWVEP DTNMPTGESL VRQVLYGQRY FEKTFGTRHT
VCWLPDCFGF SGALPQILRQ GGIDSFFTIK VNWSETNHIP SDLFWWKGLD GSQVLTHTFD
NPMQGYNGFV QADCYVPTWK NFRGKVQHDT SLLAVGYGDG GGGVTPEMVE REVQLRDFPA
IPQARWGTVK SFYEKAHRTA REKNLPVWDG EIYLELHRAT LTSQSGVKRK HRQAERALIT
AETLASLAHM LGADKPKSLE ADWRVVLKNE FHDILPGSSI REVYQDAEQE LGGVIDNAKS
EQAKAVQALS AHLPKGGVGD ALVVVNPSLM PRPVSATLAD GTVISAGDIV APLSVAVFDR
QSLQPAGGLK ASSDRLENDH LAVVIGKDGA VASLVHKASG REAVDGSANQ LWVYPADKPR
NWDAWDIDAD YAEKAVRLEA PESISLVESG PHRAAIRVIH RYRNSSVTQT YVLTANSKRL
DIETTIDWHD RRTLLRTLNP VDVKARKATF ECAFGIVERA THTNTSWEQA MFEAVAHRFV
DISEPGFGVA LLNNAKYGHS ARGNVIGMSL VRGPIYPDPL ADEGEQSFTY ALLPHDGAWH
EGGVLDEALD LNQPLVAVEA KGLSAGTFAP LAVTGTPVAF SGLKPAEEGD GLILRLYEPA
GRRGRLSLAL PPGWKASPPL NILEEPMERK GPAEIMPFEI RTWRLQKA