Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5383 |
Symbol | |
ID | 6978477 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 1016306 |
End bp | 1019587 |
Gene Length | 3282 bp |
Protein Length | 1093 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643394485 |
Product | trehalose synthase |
Protein accession | YP_002279303 |
Protein GI | 209547385 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases [COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis |
TIGRFAM ID | [TIGR02456] trehalose synthase [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0116468 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.915163 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACACGA TGAACCCCAA TGGCATTGAG CAGCCGCTCT GGTACAAGGA TGCGATCATC TACCAGCTGC ACATCAAGTC GTTCTACGAC GCCAATGGCG ACGGCGTCGG CGACTTTGCC GGCCTGCACG CCAAGCTCGA TCACATCGCC GCGCTCGGCG TCAATGCCAT CTGGCTGCTG CCTTTCTTTC CCTCGCCGCG CCGCGACGAC GGCTACGACA TCGCCGACTA CGGCAATGTC AGCCCCGACT ACGGCACCTT GGAGGACTTT CGCGCCTTCG TCGACGCCGC CCACCAGCGC AATATCCGCG TCATCATCGA GCTCGTCATC AACCACACCT CGGATCAGCA TCCCTGGTTC CAGCGTGCCC GTCAGGCCCC CGCCGGATCA CCGGAGCGCG ATTTTTACGT CTGGTCGGAT ACCGATCAGA AATTCCCGGA AACGCGCATC ATTTTCATCG ATACGGAAAA ATCAAACTGG ACATGGGATG CCGTGGCCGG CGCCTATTAC TGGCATCGCT TTTATTCCCA CCAGCCTGAC CTCAACTTCG ACAGCCCCCT GGTCATGGAG GAACTGCTGA AGGTGATGCG CTTCTGGCTG GAAACCGGTA TCGACGGCTT CCGCCTCGAC GCGATCCCCT ACCTTGTCGA ACGCGAGGGG ACGATCAACG AAAACCTGCC CGAAACCCAT GCGATCCTCA AGCGCATCCG TGCCGCCCTC GACGCCACCC ATCCCGGCGT GATGTTGCTC GCCGAGGCCA ATCAATGGCC GGAGGATACG CGCGAATATT TCGGCGACGG CGACGAATGC CACATGGCCT TCCACTTCCC GCTGATGCCG CGCATGTACA TGGCGATCGC CAAGGAGGAT CGGTTTCCGA TCACCGATAT CCTGCGCCAG ACGCCGGAGA TCCCGGAGAA TTGCCAATGG GCGATCTTCC TGCGCAATCA CGACGAACTG ACGCTCGAAA TGGTGACGGA CGCCGAGCGT GATTATCTCT GGGAAACCTA CGCATCCGAT AAGCGCGCCC GCATCAATCT CGGCATCAGG CGGCGCCTGG CGCCGCTGAT GGAGCGCGAC CGCCGGCGGA TCGAGCTGAT GAATGCGCTG CTGCTCTCGA TGCCGGGAAC GCCGGTCATC TATTACGGCG ACGAGATCGG CATGGGCGAC AATATCTATC TCGGCGACCG GGATGGGGTG AGGACGCCGA TGCAATGGTC TCCCGACCGC AATGGCGGCT TCTCCAGAGT AGACCCGGCG CGACTCGTCC TGCCGCCGGT CGCCGATCCG CTCTACGGTT TCGAGGCCGT CAACGTCGAG GCGCAGAGCA CGGACGCGCA TTCGCTGCTC AACTGGACGC GCAAGATGCT GGCGCTGCGC GGGCGGCATC CGGCCTTCGG CCGCGGTTCG TTGCGGTTTC TCGCGCCGGA GAACCGCAAG ATCCTCGCCT ATCTCAGGGA GTATGAAGGC GAGACCCTGA TGTGTGTCGC AAATCTCTCG CGGCTGCCCC AGGCCGTCGA ACTCGACCTC TCAGCCTTTG AGGGCCGCGT GCCGATCGAA TTGACCGGCA TGTCGCCCTT TCCGCCGATC GGCCAGCTGA CCTATCTGCT GACACTGCCG CCATACGGAT TTTTCTGGTT CCAGCTGGAG GCCGATGCCG ACCCGCCGGC ATGGCGCACC GCGCCGCCGG AGCAGCTTCC CGATCTGGTG ACGATGGTCA CCCGCCGCGG CCTGCTCGAT CTTGTCGATG AACCCAAGCT TGCACGCGTT CTCAGCAATG AAATTCTGCC GGCCTATCTG GCGAAGCGGC GGTGGTTCGG GGCGAAGGAC CAGCCGCTGC AGGCGGCGCG TCTGATTTCC GCAACGCCGA TCCCCTTTGC CGACGGGATC GTTCTCGGCG AACTGGAGGC CGTGCTGCCG AACCATAGCG AATCCTACCA ACTGCCGCTG GCGGTCGCCT GGGACGATGC GCAGCCATCG GTGCTCAGCC AGCAGCTTGC GCTCGGGCGG GTCCGCCAGG GCCGGCGCGT CGGCTTCCTG ACGGACGGAT TTGCCGTGGA GGCGATGGCG CGCGGCATCC TGCGCGGGCT TGGCGACCGC TCGCGCACCA CCGGCCGCAC CGGCACGCTT GAATTTCTCG GCACGGAACG GCTCGATCGT CTCGCCGTCA CCGACGACCT TCCGGTCCAT TGGCTCTCGG CTGAACAGTC CAACAGTTCG CTGATCGTCG GCGATCTCGC GATGATCAAG CTGATCAGGC ACATCTTTTC TGGCATTCAT CCGGAGGTCG AGATGACGCG TTATCTCACC CGTGCCGGCT ACGATCACAC GGCGCCCTTG CTCGGCGAGG TCGCGCACAC GGACTCCAGT GGACGCCGCT CGACATTGAT CATCGTCCAA GGCGCGATCC GCAATCAGGG CGACGCCTGG AACTGGATGC TGAACAACCT GCGCCGCGGG GCAGACGAAC TCGTGCTGAA CGACCCGGCC GTCCAGCCGG GCGACGACGT CTTCCAGCCA CTGATCAGCT TCGTTGCGAT GGTCGGCATG CGGCTCGGAG AACTGCATGT CGTGCTCGCC GGGGAGACTG CGGACGAAGC CTTCAGCCCG GTAATCGCCG GCGACAGCGA AGTCGAGGCG ATGAAGAAGG CGGTTGCCGG CGAAGTCGCC TATGCCATGT CGAAACTTGC CGAACGGGAG GAGAATGCCG ATCCGGCGAT CGACCTGCTC GCTGCTCCGT TGCTCAAGCG CCGCTCCGAA CTCGTCGAGC TTGCCGCGAC GCTGACCGAG GCCGCGCGCC ATACGCTGAT GACGCGCACG CATGGCGATT TTCACCTTGG TCAGATCCTC GTCAGCGAGG GAGATGCCGT CATTATCGAT TTCGAGGGCG AGCCGGCGAA AAACCTGGCC GAACGCCGCG CCAAGACCAA TCCGCTGCGT GATGTCGCCG GGCTTTTGAG GTCACTGAGT TATCTCGTCG CCACCGCCCA GCTTGATAAC GACGCCGTCA TCGAACACGA GAACGAAGTC CGACGCGAGG CCATCGCCCG CTTCGGACGC CAGGCGGAAG AGGCGTTTCT CGATGCCTAT TCGCAGGCGG TTTCGGCATC GAAGGCGCTA GATATGCCAC CCGATCAACG ACGGAGGGTC CTCGATGCCT TTCTTCTCGA AAAGGCCGCT TATGAAATCG CCTATGAAGC CCGCAACCGG CCGAAATGGC TTCCGATCCC GCTTTCCGGC CTCACCGAAA TCGTCTCGCG TCTGGCGGGG GTCCCGGCAT GA
|
Protein sequence | MDTMNPNGIE QPLWYKDAII YQLHIKSFYD ANGDGVGDFA GLHAKLDHIA ALGVNAIWLL PFFPSPRRDD GYDIADYGNV SPDYGTLEDF RAFVDAAHQR NIRVIIELVI NHTSDQHPWF QRARQAPAGS PERDFYVWSD TDQKFPETRI IFIDTEKSNW TWDAVAGAYY WHRFYSHQPD LNFDSPLVME ELLKVMRFWL ETGIDGFRLD AIPYLVEREG TINENLPETH AILKRIRAAL DATHPGVMLL AEANQWPEDT REYFGDGDEC HMAFHFPLMP RMYMAIAKED RFPITDILRQ TPEIPENCQW AIFLRNHDEL TLEMVTDAER DYLWETYASD KRARINLGIR RRLAPLMERD RRRIELMNAL LLSMPGTPVI YYGDEIGMGD NIYLGDRDGV RTPMQWSPDR NGGFSRVDPA RLVLPPVADP LYGFEAVNVE AQSTDAHSLL NWTRKMLALR GRHPAFGRGS LRFLAPENRK ILAYLREYEG ETLMCVANLS RLPQAVELDL SAFEGRVPIE LTGMSPFPPI GQLTYLLTLP PYGFFWFQLE ADADPPAWRT APPEQLPDLV TMVTRRGLLD LVDEPKLARV LSNEILPAYL AKRRWFGAKD QPLQAARLIS ATPIPFADGI VLGELEAVLP NHSESYQLPL AVAWDDAQPS VLSQQLALGR VRQGRRVGFL TDGFAVEAMA RGILRGLGDR SRTTGRTGTL EFLGTERLDR LAVTDDLPVH WLSAEQSNSS LIVGDLAMIK LIRHIFSGIH PEVEMTRYLT RAGYDHTAPL LGEVAHTDSS GRRSTLIIVQ GAIRNQGDAW NWMLNNLRRG ADELVLNDPA VQPGDDVFQP LISFVAMVGM RLGELHVVLA GETADEAFSP VIAGDSEVEA MKKAVAGEVA YAMSKLAERE ENADPAIDLL AAPLLKRRSE LVELAATLTE AARHTLMTRT HGDFHLGQIL VSEGDAVIID FEGEPAKNLA ERRAKTNPLR DVAGLLRSLS YLVATAQLDN DAVIEHENEV RREAIARFGR QAEEAFLDAY SQAVSASKAL DMPPDQRRRV LDAFLLEKAA YEIAYEARNR PKWLPIPLSG LTEIVSRLAG VPA
|
| |