Gene Rleg_5112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5112 
Symbol 
ID8006973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp512457 
End bp514217 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content62% 
IMG OID644822026 
Productmalto-oligosyltrehalose trehalohydrolase 
Protein accessionYP_002973286 
Protein GI241113451 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR02402] malto-oligosyltrehalose trehalohydrolase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.413399 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0894541 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTTCG GACCGGCATC CACTGAAGAG GGTTTTCTAT TCCGCCTCTG GGCTCCCTTG 
CATGAAAGCG TGTTGTTGAA GATTGAAGGC GATGATCCGC GGCCGATGCA GGCGGTCGGG
GACGGCTGGC ACCACTCTAC AGTCGCGGAT GCCAATGTCA GTACGCGCTA CTGCTTCGTC
CTGCCGGACG GTCTCGAAAT ACCCGATCCC GCCTCGCGGT TTCAGCCGCA GGATGTGCAC
GGACCGAGCG AAGTGGTCGA CCTTTCCTTC TATCGCTGGA AGACGAGCGA CTGGACGGGA
CGGCCCTGGG AAGAGATGGT GATCTATGAG ATGCATATCG GCTGCTTCAC GCCGGAGGGT
ACTTTCAAGG CCGCGATCGA GCGGCTCGAC CATCTGCAGG CGCTGGGCGT TACGGCGCTG
CAGATCATGC CGCTGAGTGA ATTTCCCGGC CGTTACAGCT GGGGTTATGA CGGCGTGCTT
CCCTATGCTC CGGACAGCAG CTACGGCCGG CCGGAAGATT TCATGGCGTT GGTGGACGCA
GCGCACCAGC ACGGCATCTC GGTTTTCCTG GACGTGGTCT ACAATCACTT CGGGCCGGAC
GGCAATTACA TCCCGGCCTA TGCACCGCTC TTTACCGATC ATCACAAGAC ACCCTGGGGC
AACGGTATCA ATTACGACGG CGACGGGTCG GAGATGATCC GCGAATTCAT CATCGAGAAC
GCCATCTATT GGATCACCGA GTTCAGGCTC GACGGTTTCC GCTTCGATGC CGTGCATGCC
ATCAAGGACG ATAGCTCCGA GCATCTGCTT CACGCGCTTG CCCGCCGCGT CAGGGCCGCG
GCCGGCGACC GGCATGTTCA TCTGATCGTC GAAAACGAGG AGAACGACAG CGACCTGTTG
CAGCGTGACG AAAACGGGGA AGTGAAGCTG TTCACCGCCC AGTGGAACGA CGACGTGCAC
CATGTGCTGC ATATCACCGC CACCGGCGAA ACCTTCGGCT ATTATGCCGA TTACGCTGGT
GACGCCGGCA AGCTCGGCCG GGCGCTGGCG GAAGGTTTCG TGTTTCAGGG GGAACACATG
CCCTATCGCG GCGGAAGCCG CGGCAAGCCG AGCGGCCATC TGCCGCCCAC CGCTTTTATC
TCCTTCATCC AGAACCATGA CCAGATCGGC AACAGGGCGC TGGGGGATAG GGTTCTGGCT
TCGAGCCCGG CTGATGTCGT CAAGGCAGTC GCCGCCATCT ACCTGCTGGC GCCTGAGATC
CCGATGCTGT TCATGGGGGA GGAATGGGGC GCCAGAGAAC CTTTCCCCTT CTTCTGCGAT
TTCGACGAGG ATCTGAACGA GAAGGTCAGG AAAGGCCGTC GCGAGGAGCT TTCCCGTCTC
CCGGGCTTCG ACGCCGACGA CCTTCTCGAC CCGACGGCGC CATCGACCTT TGCCGCCGCC
AAGCTGGACT GGTCGAGACT CGCCTCTTCC GAGTTACTCG GTTTTTACAG GATGCTTCTC
GACCTCCGGC ACCGCAGGAT CGTGCCTTTG CTGAAAGGCG CTGGCGCCGG AACCGCGGTC
TATCGCTCGG CGGGAAGCGC GCTCGCGGTG GATTGGACCC TGGCGCAGAA CCGGCGTCTT
CATCTGCAGG CCAACCTCGG CGCCGAGGCG GTGCCGCTCG TCTCGCCGCA GGACGACGGC
GAGACGATCT TCGGTCTCGG CGGAAGCGAC GGCGGCGATC TCGCACCCTG GACGGTGATT
TGGAATATCA GCGAGGCGTA A
 
Protein sequence
MTFGPASTEE GFLFRLWAPL HESVLLKIEG DDPRPMQAVG DGWHHSTVAD ANVSTRYCFV 
LPDGLEIPDP ASRFQPQDVH GPSEVVDLSF YRWKTSDWTG RPWEEMVIYE MHIGCFTPEG
TFKAAIERLD HLQALGVTAL QIMPLSEFPG RYSWGYDGVL PYAPDSSYGR PEDFMALVDA
AHQHGISVFL DVVYNHFGPD GNYIPAYAPL FTDHHKTPWG NGINYDGDGS EMIREFIIEN
AIYWITEFRL DGFRFDAVHA IKDDSSEHLL HALARRVRAA AGDRHVHLIV ENEENDSDLL
QRDENGEVKL FTAQWNDDVH HVLHITATGE TFGYYADYAG DAGKLGRALA EGFVFQGEHM
PYRGGSRGKP SGHLPPTAFI SFIQNHDQIG NRALGDRVLA SSPADVVKAV AAIYLLAPEI
PMLFMGEEWG AREPFPFFCD FDEDLNEKVR KGRREELSRL PGFDADDLLD PTAPSTFAAA
KLDWSRLASS ELLGFYRMLL DLRHRRIVPL LKGAGAGTAV YRSAGSALAV DWTLAQNRRL
HLQANLGAEA VPLVSPQDDG ETIFGLGGSD GGDLAPWTVI WNISEA