Gene Rleg2_4753 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4753 
Symbol 
ID6977847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp384092 
End bp385945 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content58% 
IMG OID643393920 
Productglycoside hydrolase 15-related 
Protein accessionYP_002278738 
Protein GI209546820 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.392129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTTC GGACAAAAGC CAGTCCTGAA GCCGGGATCT CGACTATAGG CAGCGCAATT 
CCTCGCCCTA TCGCAGACCA TGGAATAATT GGCGATCTCG CCACGCTTGC CCTCGTCGCC
AAGGACGGCG CAATTGACTT CATGTGCTGG CCGAACTTCG ATAGCCCGAC GATTTTCGCA
GGGCTTCTCG ATCCAGAGCG CGGGGGCGCC TTTGAACTGG CCCCCGATAT CCCCGATGCG
CGGGTGGTCC AGCAATATCT GCCGGACACG AACGTGCTTC TCACACGCTG GATGGGCGAT
CACGTAAGCG CTGAGCTGAT TGACTTCATG GTCGTCAAAG AAGGCAGGGG GGATATTCCG
ACAACGGTAG CCAGAAGGCT GAGGGTGACC CGTGGTGAGG CAAGCTTCAT CATGCGCTGC
GCCCCGCGCT TTGATTACGC CCGGGAAACC GTGACTGCGC ATGTGACGGC GAACCAGGCG
GTGTGGCAGC CAGTCAATGC ACATGGAATC CGATTGACGT CGAACATCCC GCTTGCCGGC
GACACCGACG GCGCTCGTGC GACCGCTCGC CTCGCAGCTG GCGAAACCGC TGATTTCCTC
CTCGCGGGAA TTGATGAAGA GCCGATCGAG CTTCCGGATA TTCACGAATT TGAGAAAGCG
ACGATCGAAT ACTGGCAAGA TTGGGCAGCG CAGTCATCGT ACCAAGGTCG TTGGCGCGAA
ATGGTGACGC GGTCTGCGTT GACACTCAAG CTCATGACAT CCCGTCGGCA TGGTTCCGTC
ATCGCGGCGG GAACGTTTGG CCTGCCGGAA ACGCCAGGCG GCGCGCGCAA TTGGGATTAC
CGCGCGACAT GGATCAGGGA CGCATCTTTT ACAGTTTATG CTCTCATGCG CCTTGGCTAT
CAGCAGGAAG CCCAGGCATT TAACACATGG ATCGGCAAGC GCGCCACGAA GTGCCAGGGC
AGCGGAAAGC TCGACATCAT GTATACGGTC GACGGACGAA GGGTGCCCGA CGAAGTCGCC
CTTGAACACT TCGCTGGCTA CGGCGGTGCC CGCCCGGTCC GCATCGGAAA CGACGCGGTG
GAACAAATCC AGCTCGATAT CTACGGCGAG CTCATGGACG CGGTCTATCT CAGCAACAAA
TATGGCCATG CCATCTCTCA CGATGGCTGG AACGGCGTGC GTGACGTCAT CAATTATGTC
TGTGATCACT GGGAAGAGCC AGATGCTGGG ATTTGGGAAA TGCGCCGCGA GCCTCAGCAT
TTCCTGCATT CGCGCTTGAT GTGCTGGGTC GCAGTCGACA GGGCGATCAG GCTTGCAAAT
AAACGATCAC TCGCGGCCCC GTTCAGTCGA TGGATCGATG TCCGAAACCT GATCTACGAG
GACATCTGGG CAAACTTCTG GGATGAACAG ACCGGGCATT TCGTGCAGAC AAGGGGCGGG
AAGAGCCTCG ACGCCTCCTT GCTGTTGATG CCGCTGGTCA GGTTCGTCAG TGCAACTGAC
CCCCGATGGC TCTCTACTCT CGAGGCAATC GGGATGGCTC TCGTAGATGA CGGCCTTGTC
TATCGTTATA GGAGCGACGA TGGATTACAC GGTCAGGAGG GCGCATTCTC GGCCTGTTCT
TTCTGGTACG CGGAATGTCT GGCGCGGGCT GGCGATATTG CGAAAGCCCG ACGCGTTTTT
GAGGCCGTAC TGACTTATGC CAACCATCTC GGCCTTTACG CAGAGGAGTT CGACACTCGC
GCAAGCCTGT CGGGAAACTT CCCACAGGCC TTCACGCACA TGGCGCTGAT AAGTGCAGCC
TACTATCTCG ATCGCGAGTT GGACGGGAAA AAGGCTCAGG AATGGCGACC TTAA
 
Protein sequence
MNFRTKASPE AGISTIGSAI PRPIADHGII GDLATLALVA KDGAIDFMCW PNFDSPTIFA 
GLLDPERGGA FELAPDIPDA RVVQQYLPDT NVLLTRWMGD HVSAELIDFM VVKEGRGDIP
TTVARRLRVT RGEASFIMRC APRFDYARET VTAHVTANQA VWQPVNAHGI RLTSNIPLAG
DTDGARATAR LAAGETADFL LAGIDEEPIE LPDIHEFEKA TIEYWQDWAA QSSYQGRWRE
MVTRSALTLK LMTSRRHGSV IAAGTFGLPE TPGGARNWDY RATWIRDASF TVYALMRLGY
QQEAQAFNTW IGKRATKCQG SGKLDIMYTV DGRRVPDEVA LEHFAGYGGA RPVRIGNDAV
EQIQLDIYGE LMDAVYLSNK YGHAISHDGW NGVRDVINYV CDHWEEPDAG IWEMRREPQH
FLHSRLMCWV AVDRAIRLAN KRSLAAPFSR WIDVRNLIYE DIWANFWDEQ TGHFVQTRGG
KSLDASLLLM PLVRFVSATD PRWLSTLEAI GMALVDDGLV YRYRSDDGLH GQEGAFSACS
FWYAECLARA GDIAKARRVF EAVLTYANHL GLYAEEFDTR ASLSGNFPQA FTHMALISAA
YYLDRELDGK KAQEWRP