Gene Rleg2_5398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5398 
Symbol 
ID6978492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp1039800 
End bp1041692 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content62% 
IMG OID643394500 
Product4-alpha-glucanotransferase 
Protein accessionYP_002279318 
Protein GI209547400 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1640] 4-alpha-glucanotransferase 
TIGRFAM ID[TIGR00217] 4-alpha-glucanotransferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.229574 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00287392 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGAAAT CCGCAGAGCT CGACAAACTC GCCCGCCGCC ACGGCATCAG CCTGACAAGG 
CCTAGCCCCG AGAACCGGGA AGTGGTGATC TCAGCCGCGA CTAAGCGCAA AATACTCTCG
GCATTGAATA TTGAACTGAC GGAGGATCAA GAGCCTGGTG AGCCGCGGCG GAAGGCTAAG
CCGGATGGCA GGAAGATCCC GGTGTCGTTT CTGCCGGATT TCCTATCCGA CACACGGGTC
TGGGGCGTGA GCTTGCAGCT TTACGAGCTC CGTTCGGCAC GCAACTGGGG CATAGGAGAC
TTCCAGGATC TCGCCGATTT GGCCGATCTG GCGGGATCGC TGGGGGCGGA TTTCATCGGT
CTCAATCCGC TTCACGCGCC GTTCCTCGCC GATCCTGATC GCTGCAGCCC CTATGAACCC
TCAAGCCGCC AGCATCTCAA CCCGCTCTAT ATCGCGGTCG ACCAGGTGCC GGGCTTTGCT
GGCAATCCCA AGCTGGAACA GGAATTGGAG CGCCTTCGCC AATCCGATCT CGTCGACTAC
ATCGGTGTCG CGCGGGCCAA GCTTGGAGCC CTTCGTGATC TCTGGTCGGC GCGGCGACAA
TGCCGTGTTG GCGACGAGGC CGATTTCGAC GCATTTGTCG CGCAAGGCGG CGACAGCCTG
CGGCTGCATG CGCTGTTCGA ATGCCTCTCC GCTTTCATGG TCGAGCGCGG GGCGGGCGCC
GGCTGGCAGC GGTGGCCGGC CGAGTTGCAG CGCTTCGACA GCGCCGCTGT CGGCGATTTC
GAACGCGAGC ATGCAGATGA CGTTCGCTTT CACATGTGGC TGCAATGGCT CGCCCACCGC
CAGCTGATGC AGGCGGCAGA TCGGGCGCGC AAGGCCGGCC TCAGGATAGG GCTCTATCTC
GATCTTGCCG TCGGGGAGGC GGTCGACGGC TCGGCGACAT GGAGCGAGCC GGATATCTAT
GTCTCGCAGG CGACGATCGG TAGTCCTCCG GATCCATTCG CCGTCGATGG GCAGGATTGG
CACCTTGCCG GATACCTGCC ATCCGAAATT GCCGGAGGGG AGATGTCGCC TTACCGGCGC
ATGGTTGGCA CCGCCATGCG CTACGCGGGC GCCATTCGTA TTGATCACGC ACCGGCGATC
CGCCGCCTTT TCCTGGTTCC GTTAGGCAGC AGGCCGGATG GCGGCGCCTA CGTCCGCTAT
CCCGAGGACC GGCTGTTGCA GATCCTCGCC GAGGTTTCCG CTGAACATCG ATGCCTTGTC
ATCGGGGAGT CCCTCGGAAT GATTCCTGAA GGCTTGCAAG AGGATCTGGC TACTGCCGGC
ATTCTCTCCT ACCGGATCCT TTCCTATGAA CAGGATGAGA AGGGCTTCAA GCCCGCCGAT
GCCTATCCGG TCCTCGCGCT CGCCTGCATT TCGACGCATG ACCACCAGAC GCTTGCCGGC
TGGTGGCGCG GCGCCGACAT TCAGGATCGC TGTGAACACG GTATCGTGCC GCCCGATCTC
ACCGAAGAAC ATCTCAAATA CCGCAAGCGC GAGCGGAGGT ATCTGAAAGC GGTCTTCAAC
GCCGCTGGCC TCGACGTGCC GCCCCGGCTC ACGGCGGCGC GGGCAAGCCA GGAAGCGTTG
CAAGATCTGA CGGTGAGCGC TTATCGTTTC ATTGCTCGCA CGCCGTCGTT GCTGACATCG
GTGCGGCTTG CCGATCTCAC CGACGAGAAA GCGCCGACCA ATATTCCGGG CACCAGCGAC
AGCTATCCGA ACTGGAAGCC GAAGCTTTCG GTTTTGCTGG AGGATCTGCT GTCGGTCCTG
CTGCTCAAGC GCGTAACGGC GGCGATGCGG GAGGAAAGGC CGCGCTACGC CTCCGCGACG
CGAATGGAAA TCGATCGGGG ACGGAACGAA TAG
 
Protein sequence
MMKSAELDKL ARRHGISLTR PSPENREVVI SAATKRKILS ALNIELTEDQ EPGEPRRKAK 
PDGRKIPVSF LPDFLSDTRV WGVSLQLYEL RSARNWGIGD FQDLADLADL AGSLGADFIG
LNPLHAPFLA DPDRCSPYEP SSRQHLNPLY IAVDQVPGFA GNPKLEQELE RLRQSDLVDY
IGVARAKLGA LRDLWSARRQ CRVGDEADFD AFVAQGGDSL RLHALFECLS AFMVERGAGA
GWQRWPAELQ RFDSAAVGDF EREHADDVRF HMWLQWLAHR QLMQAADRAR KAGLRIGLYL
DLAVGEAVDG SATWSEPDIY VSQATIGSPP DPFAVDGQDW HLAGYLPSEI AGGEMSPYRR
MVGTAMRYAG AIRIDHAPAI RRLFLVPLGS RPDGGAYVRY PEDRLLQILA EVSAEHRCLV
IGESLGMIPE GLQEDLATAG ILSYRILSYE QDEKGFKPAD AYPVLALACI STHDHQTLAG
WWRGADIQDR CEHGIVPPDL TEEHLKYRKR ERRYLKAVFN AAGLDVPPRL TAARASQEAL
QDLTVSAYRF IARTPSLLTS VRLADLTDEK APTNIPGTSD SYPNWKPKLS VLLEDLLSVL
LLKRVTAAMR EERPRYASAT RMEIDRGRNE