Gene Rleg2_5397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5397 
Symbol 
ID6978491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp1037986 
End bp1039776 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content60% 
IMG OID643394499 
Productmalto-oligosyltrehalose trehalohydrolase 
Protein accessionYP_002279317 
Protein GI209547399 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR02402] malto-oligosyltrehalose trehalohydrolase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.334686 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00587548 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTCTTCA CCGAAGGCTT TCCCTTGACC AATATGACGT TCGGCCCCGC ATTTACTGAA 
GACGGTATTC TGTTTCGGCT CTGGGCTCCC CTGCATGAAA GCGTGTCGTT GAAGATCGAA
GGCACCGATC CGCGGCCGAT GCAGGCGGCG GAAGATGGCT GGCATCGATG CACGGTACCG
AATGCCCATG CCGGCACGCG CTATCGCTTC GTTCTGCCGG ACGGTCTTGA AATTCCCGAT
CCTGCGTCGC GGTTCCAGCC ACAGGATGTG CACGGTCCGA GCGAAGTGGT CGACCTGTCC
TATCGCTGGA AGACGAGTGA CTGGACCGGC CGGCCATGGG AGGAGATGGT CGTCTACGAG
ATGCATATCG GCTGCTTCAC GCCGGAGGGG ACTTTCGAAG CTGCGATCGA GCGGCTCGAT
CATCTGCGGG AGCTGGGAGT CACGGCGTTG CAGATCATGC CGGTCAGCGA ATTCCCCGGC
CGTTACAGCT GGGGTTATGA CGGCGTGCTG CCCTATGCTC CTGACAGCAG CTATGGCCGG
CCGGAGGATT TCATGGCGTT GGTCGATGCA GCGCATCAGC GCGGCATCTC GGTGTTCCTC
GATGTGGTCT ACAATCACTT CGGGCCTGAC GGGAATTATA TCCCTTCCTA TGCGCCGCTC
TTTACCGACC GTCACAAGAC ACCCTGGGGC CACGGTATCA ACTATGACGG CGACGGATCG
CAGATGATCC GCGAATTTGT CATTGAGAAT GCCATCTACT GGGTCACCGC ATTTAGGCTC
GACGGCTTCC GCTTGGATGC CGTTCACGCG ATCAAAGACA ATAGCGACGA GCATCTGCTT
CACGAGCTTG CCCGCCGCGT CAGGGCTGCG GCTGTCGACC GGCACATTCA TCTGATCGTC
GAAAACGAGG AGAATGACAG CGACTTATTG AAGCGTGACG AAAAGGGCGC AGCGACGCTG
TTCACCGCCC AGTGGAACGA CGACGTGCAC CACGTGCTGC ATATCACTGC GACCGGCGAA
ACCTTCGGCT ATTATGCCGA TTACGCCGGT GACGCCGGAA AGCTCGGTCG GGCGCTGGCG
GAAGGCTTCG TGTTCCAGGG AGAACATATG CCCTATCGCG GCGGGAACAG GGGCCGGCCG
AGCGCTCACC TGCCGCCAAC GGCTTTCATC TCCTTCATCC AGAACCATGA TCAGATCGGC
AATCGGGCGC TAGGCGACCG GGTCATGGCG TCGAGCCCGG CCGAAGCCGT CCAAGCCGTC
ATATCAATCT ATCTGCTGGC GCCCGAGATT CCGATGCTGT TCATGGGCGA GGAATGGGGC
GCTGCAGAGC CTTTCCCGTT TTTCTGCGAT TTCGACGAGG GGTTGAATGA GAAGGTCAGG
AAAGGCCGTC GTGAGGAGCT TTCGCGTCTG CCGGGCTTCG ATGCCGACGA CCTTCTCGAC
CCGACGGCGC CATCTACCTT TGCTGCGGCC AAACTGGATT GGTCGAAACG CGCTTCCTCC
GAGATCGTCG ATTTCTACAA AATGTTGCTC GACCTCCGGC ACCGGAAGAT CGTTCCTTTG
CTGAAAGGTG TGACGTCTGG AAACGCCGTT TACCGCTCGA CGGGAAATGC GATCGCAGTG
GATTGGTCCC TGGCGGAAGG CCGGCATCTT CATCTGCGTG CCAACCTCGG CGACGAGGCG
GCGGCGCTCG ATTCACAACA GCAGGACGAT GAGACGATAT TCCATCTCGG CGGACGCGAC
GGCGGCGATC TGGCGCCCTG GGCAGTGATC TGGAGCCTGA GCCAGGCGTA A
 
Protein sequence
MLFTEGFPLT NMTFGPAFTE DGILFRLWAP LHESVSLKIE GTDPRPMQAA EDGWHRCTVP 
NAHAGTRYRF VLPDGLEIPD PASRFQPQDV HGPSEVVDLS YRWKTSDWTG RPWEEMVVYE
MHIGCFTPEG TFEAAIERLD HLRELGVTAL QIMPVSEFPG RYSWGYDGVL PYAPDSSYGR
PEDFMALVDA AHQRGISVFL DVVYNHFGPD GNYIPSYAPL FTDRHKTPWG HGINYDGDGS
QMIREFVIEN AIYWVTAFRL DGFRLDAVHA IKDNSDEHLL HELARRVRAA AVDRHIHLIV
ENEENDSDLL KRDEKGAATL FTAQWNDDVH HVLHITATGE TFGYYADYAG DAGKLGRALA
EGFVFQGEHM PYRGGNRGRP SAHLPPTAFI SFIQNHDQIG NRALGDRVMA SSPAEAVQAV
ISIYLLAPEI PMLFMGEEWG AAEPFPFFCD FDEGLNEKVR KGRREELSRL PGFDADDLLD
PTAPSTFAAA KLDWSKRASS EIVDFYKMLL DLRHRKIVPL LKGVTSGNAV YRSTGNAIAV
DWSLAEGRHL HLRANLGDEA AALDSQQQDD ETIFHLGGRD GGDLAPWAVI WSLSQA