Gene Rleg_3654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3654 
Symbol 
ID8014501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3699155 
End bp3701131 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content63% 
IMG OID644826217 
Productglycogen debranching enzyme GlgX 
Protein accessionYP_002977436 
Protein GI241206340 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1523] Type II secretory pathway, pullulanase PulA and related glycosidases 
TIGRFAM ID[TIGR02100] glycogen debranching enzyme GlgX 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.328672 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGGGA ACAGTTCGGC GCAAGCCGGC GCGATCGTCT TCGAGACCGG CGTCGAATTT 
GCCGTATGGT CGCATCATGC CGCACAGATC GAACTCTGCC TCTTCGAGGA TGACGGCAAC
AGGGAATTCG CGCGCCTGCC GATGGCGCGC GACAGCAACC ACATCCATCG ACTGTTTGTC
GACGGATTGA AGGCGGGCGC GCGCTACGGC TATCGCGCCG ACGGTATTTA TGCGCCCGAT
AACGGCCTCT GGTTCGATCC CTCCAAACTG CTGATCGATC CCTACGCCAA GGAGATCGAT
AGGCCGTTCC GCTACGATCC CCGCCTCGGC ATCTATGGCG AGGACAGCCA GGATCTGATG
CCGAAGGCGA TTGTCACCAC CGATACCCGA GCCGCGATCA GCAAGCCGCT CTTCAAACCG
GGCGGCTTTA TCTATGAGGT GGCGGTACGG CCCTTCACCA TTCTCCATCC CGACGTGCCG
GAGGCTGAGC GCGGCACGGT CGCAGCGCTT GCCCATCCCT CCGTCGTCGC ACATCTGAAG
CGGATCGGTG TCGATGCCGT CGAACTGATG CCGATCACCG CCTGGATCGA CGAACGCCAC
CTTCCGCCGC TCGGCCTCAC CAACGGCTGG GGCTACAATC CCGTCGCCTT CATGGCGCTC
GACCCGCGGC TCGTGCCTGG CGGCATGACC GAGTTGCGCC AGACGGTCGC GGCCCTCCAT
GCCGAAGGCA TCGCCGTCAT CCTCGACCTC GTCTTCAACC ATACCGGCGA GAGCGACCGT
TACGGCGCGA CGCTGTCGCT GCGCGGCCTC GACAACCTGC ATTATTATCG CCACGCCCAG
AATTGCCCGG GCGAACTCGT CAACGACACA GGCACCGGCA ACACGCTCGC CTGCGATCAT
CCTGAGGTTC GCCGCCTCGT CATCGACAGC CTACGCCATT TCGTGCTCAA CGCCGGCGTC
GACGGTTTTC GCTTCGATCT CGCCCCGGTA CTCGGCCGCA CCGCGACGGG CTTCGAACGC
GACGGAACAC TGGCCTCGAT CCTCTCCGAC GATGTGCTTG CCGACCGGAT CATGATCGCC
GAACCCTGGG ATATCGGCCC GGGCGGTTAC CAGCTCGGCA ATTTCCCGCC GCCCTTCCTT
GAATGGAACG ACCGGGTTCG CGATGATCTG CGCTGCTACT GGCGCGGCGA CGATTGGAAG
ACCGGCGCGC TGGCAACCGC ACTTGCCGGC TCCTCCGACA TCTTCTCCCG CAACGACGGC
AACGAGACGC GCAGCGTCAA TTTTCTCGCC GCCCATGACG GCTTCACGCT GATCGATCTC
GTCTCCTATG CCGCAAAGCA CAACGACGCC AACGGCGAAC ACAATCGCGA CGGCCATAAC
GAGAATCATT CCTGGAACAA CGGCGTCGAG GGGGAAACCG TCTATCCGAC GATCCGCAAG
CGTCGCCGGG ACGATGTGAT GGCGCTGATC TCAACGCTTT TTGCCACCCG CGGCAGCATC
ATGCTGACGG CGGGCGACGA GGGCGGCCGC AGCCAGCACG GCAACAACAA CGCCTATTGC
CAGGACAACG AGATCACCTG GCTGGACTGG AAGGCGTTGG ACGAGGGTCT CATCGCCCAT
ACCGCCTTCG TTGCAGGGTT ACGTCGCCGT TTCACCGTTT TCTCCGAAAC GGGCTTCCTG
GCGGGAAATG GCGATGTCGA ATGGATTTCG CTTTCCGGCG AACCGATGAG CGTTGCCGAA
TGGGAGACGC CGTCGCTCTC CACCCTCGGC ATGCTGTTAT CGACCGGTGA CCGCTCCTCT
CGCGGCAGGC AGACCAGGCT TGGTGTGCTT TTCAATCGCT CGGGGAGCCG CCAATTTTTC
ACGCTGCCTT CTCAGAGCGA ACCGGGCTGG CGCCAGTTGA CCCCGGATGG AGCGAAGAAA
ACCGGTGGCC GTGCAACCGT CGAGCCACGC TCGATTGCCT TTTTCGTAGA AAATTGA
 
Protein sequence
MRGNSSAQAG AIVFETGVEF AVWSHHAAQI ELCLFEDDGN REFARLPMAR DSNHIHRLFV 
DGLKAGARYG YRADGIYAPD NGLWFDPSKL LIDPYAKEID RPFRYDPRLG IYGEDSQDLM
PKAIVTTDTR AAISKPLFKP GGFIYEVAVR PFTILHPDVP EAERGTVAAL AHPSVVAHLK
RIGVDAVELM PITAWIDERH LPPLGLTNGW GYNPVAFMAL DPRLVPGGMT ELRQTVAALH
AEGIAVILDL VFNHTGESDR YGATLSLRGL DNLHYYRHAQ NCPGELVNDT GTGNTLACDH
PEVRRLVIDS LRHFVLNAGV DGFRFDLAPV LGRTATGFER DGTLASILSD DVLADRIMIA
EPWDIGPGGY QLGNFPPPFL EWNDRVRDDL RCYWRGDDWK TGALATALAG SSDIFSRNDG
NETRSVNFLA AHDGFTLIDL VSYAAKHNDA NGEHNRDGHN ENHSWNNGVE GETVYPTIRK
RRRDDVMALI STLFATRGSI MLTAGDEGGR SQHGNNNAYC QDNEITWLDW KALDEGLIAH
TAFVAGLRRR FTVFSETGFL AGNGDVEWIS LSGEPMSVAE WETPSLSTLG MLLSTGDRSS
RGRQTRLGVL FNRSGSRQFF TLPSQSEPGW RQLTPDGAKK TGGRATVEPR SIAFFVEN