Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5397 |
Symbol | |
ID | 6978491 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 1037986 |
End bp | 1039776 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643394499 |
Product | malto-oligosyltrehalose trehalohydrolase |
Protein accession | YP_002279317 |
Protein GI | 209547399 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0296] 1,4-alpha-glucan branching enzyme |
TIGRFAM ID | [TIGR02402] malto-oligosyltrehalose trehalohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.334686 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00587548 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTCTTCA CCGAAGGCTT TCCCTTGACC AATATGACGT TCGGCCCCGC ATTTACTGAA GACGGTATTC TGTTTCGGCT CTGGGCTCCC CTGCATGAAA GCGTGTCGTT GAAGATCGAA GGCACCGATC CGCGGCCGAT GCAGGCGGCG GAAGATGGCT GGCATCGATG CACGGTACCG AATGCCCATG CCGGCACGCG CTATCGCTTC GTTCTGCCGG ACGGTCTTGA AATTCCCGAT CCTGCGTCGC GGTTCCAGCC ACAGGATGTG CACGGTCCGA GCGAAGTGGT CGACCTGTCC TATCGCTGGA AGACGAGTGA CTGGACCGGC CGGCCATGGG AGGAGATGGT CGTCTACGAG ATGCATATCG GCTGCTTCAC GCCGGAGGGG ACTTTCGAAG CTGCGATCGA GCGGCTCGAT CATCTGCGGG AGCTGGGAGT CACGGCGTTG CAGATCATGC CGGTCAGCGA ATTCCCCGGC CGTTACAGCT GGGGTTATGA CGGCGTGCTG CCCTATGCTC CTGACAGCAG CTATGGCCGG CCGGAGGATT TCATGGCGTT GGTCGATGCA GCGCATCAGC GCGGCATCTC GGTGTTCCTC GATGTGGTCT ACAATCACTT CGGGCCTGAC GGGAATTATA TCCCTTCCTA TGCGCCGCTC TTTACCGACC GTCACAAGAC ACCCTGGGGC CACGGTATCA ACTATGACGG CGACGGATCG CAGATGATCC GCGAATTTGT CATTGAGAAT GCCATCTACT GGGTCACCGC ATTTAGGCTC GACGGCTTCC GCTTGGATGC CGTTCACGCG ATCAAAGACA ATAGCGACGA GCATCTGCTT CACGAGCTTG CCCGCCGCGT CAGGGCTGCG GCTGTCGACC GGCACATTCA TCTGATCGTC GAAAACGAGG AGAATGACAG CGACTTATTG AAGCGTGACG AAAAGGGCGC AGCGACGCTG TTCACCGCCC AGTGGAACGA CGACGTGCAC CACGTGCTGC ATATCACTGC GACCGGCGAA ACCTTCGGCT ATTATGCCGA TTACGCCGGT GACGCCGGAA AGCTCGGTCG GGCGCTGGCG GAAGGCTTCG TGTTCCAGGG AGAACATATG CCCTATCGCG GCGGGAACAG GGGCCGGCCG AGCGCTCACC TGCCGCCAAC GGCTTTCATC TCCTTCATCC AGAACCATGA TCAGATCGGC AATCGGGCGC TAGGCGACCG GGTCATGGCG TCGAGCCCGG CCGAAGCCGT CCAAGCCGTC ATATCAATCT ATCTGCTGGC GCCCGAGATT CCGATGCTGT TCATGGGCGA GGAATGGGGC GCTGCAGAGC CTTTCCCGTT TTTCTGCGAT TTCGACGAGG GGTTGAATGA GAAGGTCAGG AAAGGCCGTC GTGAGGAGCT TTCGCGTCTG CCGGGCTTCG ATGCCGACGA CCTTCTCGAC CCGACGGCGC CATCTACCTT TGCTGCGGCC AAACTGGATT GGTCGAAACG CGCTTCCTCC GAGATCGTCG ATTTCTACAA AATGTTGCTC GACCTCCGGC ACCGGAAGAT CGTTCCTTTG CTGAAAGGTG TGACGTCTGG AAACGCCGTT TACCGCTCGA CGGGAAATGC GATCGCAGTG GATTGGTCCC TGGCGGAAGG CCGGCATCTT CATCTGCGTG CCAACCTCGG CGACGAGGCG GCGGCGCTCG ATTCACAACA GCAGGACGAT GAGACGATAT TCCATCTCGG CGGACGCGAC GGCGGCGATC TGGCGCCCTG GGCAGTGATC TGGAGCCTGA GCCAGGCGTA A
|
Protein sequence | MLFTEGFPLT NMTFGPAFTE DGILFRLWAP LHESVSLKIE GTDPRPMQAA EDGWHRCTVP NAHAGTRYRF VLPDGLEIPD PASRFQPQDV HGPSEVVDLS YRWKTSDWTG RPWEEMVVYE MHIGCFTPEG TFEAAIERLD HLRELGVTAL QIMPVSEFPG RYSWGYDGVL PYAPDSSYGR PEDFMALVDA AHQRGISVFL DVVYNHFGPD GNYIPSYAPL FTDRHKTPWG HGINYDGDGS QMIREFVIEN AIYWVTAFRL DGFRLDAVHA IKDNSDEHLL HELARRVRAA AVDRHIHLIV ENEENDSDLL KRDEKGAATL FTAQWNDDVH HVLHITATGE TFGYYADYAG DAGKLGRALA EGFVFQGEHM PYRGGNRGRP SAHLPPTAFI SFIQNHDQIG NRALGDRVMA SSPAEAVQAV ISIYLLAPEI PMLFMGEEWG AAEPFPFFCD FDEGLNEKVR KGRREELSRL PGFDADDLLD PTAPSTFAAA KLDWSKRASS EIVDFYKMLL DLRHRKIVPL LKGVTSGNAV YRSTGNAIAV DWSLAEGRHL HLRANLGDEA AALDSQQQDD ETIFHLGGRD GGDLAPWAVI WSLSQA
|
| |