Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5098 |
Symbol | |
ID | 8007690 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 492008 |
End bp | 495289 |
Gene Length | 3282 bp |
Protein Length | 1093 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644822012 |
Product | trehalose synthase |
Protein accession | YP_002973272 |
Protein GI | 241113437 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | [TIGR02456] trehalose synthase [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.436768 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00978496 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGACACGA TGAATGCAGA TAGCGCATCG CAGCCGCTCT GGTACAAGGA TGCAATCATC TACCAGCTGC ACATCAAGTC GTTCTACGAC GCCAATGGTG ACGGGGTCGG CGACTTTGCC GGCTTGCACC AGAAGCTCGA TCACATCGCA GCCCTCGGCG TCAATGCCAT CTGGCTTTTG CCTTTCTTTC CCTCTCCGCG TCGAGACGAC GGCTACGACA TCGCCGACTA TGGCAGCGTC AGCCCCGATT ACGGGACGGT GGAGGATTTC CGGGCTTTCG TCGACGCCGC CCACCAGCGC AATATCCGCG TCATCATCGA GCTCGTCATC AACCACACTT CCGATCAGCA TCCCTGGTTC CAGCGCGCCC GCCAGTCGCC GGCAGGCTCG CCCGAGCGCG ACTTCTATGT CTGGTCCGAT ACCGATCAGA AATTTCCGGA AACGCGCATT ATTTTCATCG ATACGGAAAA ATCCAACTGG ACATGGGACG CGGTCGCCGG CGCCTACTAC TGGCACCGCT TCTATTCCCA TCAACCCGAC CTCAATTTCG ACAGCCCTCT CGTCATGGAG GAATTGCTGA GGGTGATGCG TTTCTGGCTG GAAACCGGCA TCGACGGGTT TCGTCTCGAC GCTATACCTT ACCTCGTCGA GCGCGAGGGG ACGATCAACG AAAATCTGCC GGAAACCCAC GCGATCCTCA AACGCATACG CGCCGCGCTC GATGCCACCC ATCCCGGCGT GATGCTGCTT GCCGAGGCCA ATCAATGGCC GGAGGACACG CGCGAATATT TCGGAGACGG CGACGAATGC CACATGGCCT TCCACTTTCC GCTGATGCCG CGCATGTATA TGGCGATCGC CAAGGAGGAC CGGTTTCCGA TCACCGATAT CCTGCGCCAG ACGCCGGAGA TTCCCGACAA CTGCCAATGG GCGATCTTCC TTCGCAACCA CGACGAGCTG ACGCTTGAAA TGGTGACCGA CGCAGAGCGG GATTATCTCT GGGAGACCTA TGCATCCGAC AAACGTGCCC GCATCAATCT CGGCATAAGG CGGCGCCTAG CGCCGTTGAT GGAGCGCGAC CGCCGGCGGA TCGAGCTGAT GAACGCGCTT CTTCTGTCGA TGCCGGGAAC GCCGGTGATC TATTATGGCG ACGAGATCGG CATGGGCGAC AATATCTATC TCGGCGACCG GGATGGAGTG AGGACGCCGA TGCAATGGTC TCCGGACCGC AATGGCGGTT TCTCCAGGGC AGATCCGGCG CGTCTCGTCC TGCCGCCCGT CGCTGACCCG CTCTACGGTT TCGAGGCCGT CAACGTCGAG GCGCAGAGCA CGGACGCGCA TTCGCTGCTC AATTGGACGC GCAGAATGTT GGCGTTGCGC GGCAGGCATC CGGCCTTCGG ACGTGGCACG CTGCGGTTTC TTTCACCGGA AAATCGCAAG ATCCTTGCCT ATCTCAGGGA GTATGAAGGC GAGGTATTGC TTTGTGTTGC AAGTCTCTCG CGCCTGCCCC AGGCCGTCGA ACTCGACTTG TCGAGCTTCG AGGGGCGCGT GCCCATTGAA CTGACCGGCA TGTCGCCGTT TCCGCCGATC GGCCAGTTGA CCTATCTTCT GACCCTGCCG CCCTACGGTT TCTTCTGGTT CCAGCTGACG GCTGATGCCG ACCCGCCGGC GTGGCGCACC GCACCGCCCG AACAGCTTCC TGATCTGTTG ACGATGGTCA TCCGGCGTAG CCTGCTCGAC CTCGTGGACG AACCGGGCCA TGCACGCATC CTGAGCGGCG AAATCTTGCC CGCCTATCTC TCCAGGCGGC GATGGTTCGG GGCAAAGGAC CAGCCGCTTC AGGCCGCCCG ACTGGTCTCC GCGACACCAA TCCCATTCGC CGACGGCGTC GTCCTCGGCG AGCTGGAGGT CGTGCTGCCG AACCACAGCG AATCCTACCA ACTGCCGCTC ACGGTCGCCT GGGACGATGC CCACCCTTCC GCGCTTGCCC AGCAGCTTGC GCTTGGGAGG ATTCGCCAAG GTAGACGCGT CGGTTTCCTC ACAGATGGAT TTGCCGTGGA GGCAATGGCG CGCGGCATTC TGCACGGACT TCGCGACCGC TCGCGCACCA CCGGCCGGAC CGGCACGCTC GAATTCCTTG GGACAGAACA GCTCGACAGT CTCGATATTT CAGACGAGCT GCCGGTGCAT TGGCTATCGG CTGAACAATC CAACAGCTCG CTGCTCGTCG GCGACGTAGC GATGATCAAA CTGATCAGGC ACATCTTCCC GGGCATCCAT CCGGAAGTGG AGATGACGCG CTATCTCACC CGCGCCGGCT ATGACCACAC AGCGCCCCTG CTCGGCGAGG TCGCGCATAC CGATTCCAGC GGACGCCGTT CGACCTTGAT CATCGTCCAG GGCGCGATCC GCAATCAGGG CGACGCCTGG AACTGGATGC TGAACAATCT GCGCCGCGGG GCCGACGAAC TGGTGCTGAA CGATCCGGCG GTCCAACCAG ACGACGACGT TTTCCAGTCG CTGATCAGCT TCGTCGCGAT GGTCGGCCTC AGGCTCGGTG AGCTGCATGT CGTGCTCGCT GCGAAGACCG GGGACGAGGC CTTCAGCCCG GTGGTATCAG GCGACAAAGA GGTCGAGGCG ATGAAGAAGG CCGTTTCCGG CGAGGTAGCC TATGCCATGT CGAAGCTTGA CGAACGCGAG CAGAATGCCG ACCCTGCAAT CGACCTGCTG GCGGCCCCGC TTCTCGAGCG CCGCTCCGAA CTCGCAGAGC TCGCCGGGAC GCTGGCGGAG AGCTGCCGCC ATACGCTGAT GACACGCACG CATGGCGACT TCCATCTTGG CCAGATCCTT GTCAGCGAGG GTGATGCCGT CATCATCGAC TTCGAGGGCG AGCCTGCGAA AAATCTGACC GAGCGCCGCG CTAAGACAAA CCCGCTGCGC GATGTCGCCG GGCTTTTGAG GTCGCTGAGC TACCTCGTGG CCACCGCGCA GCTCGATAAT GACGCGGTCA TCGAACACGA CAACGAGGTC CGCCGCAAGG CGATCGCCCG CTTCGGGCGC AATGCCGAGG AGGCCTTTCT GGATGCCTAC TGGCAGGCGG TCTCCGTATC GAAGGAGCTT GACATGCCAC CCGATCAAAG ACGCAGGGTT CTCGATGCCT TTCTTCTCGA AAAGGCCGCC TACGAGATCG CCTATGAGGC TCGCAACAGG CCGAAATGGC TGCCGATCCC GCTATCTGGC CTTACCGAAA TCGTATCGCG TCTGGCGGGG GTCACGGCAT GA
|
Protein sequence | MDTMNADSAS QPLWYKDAII YQLHIKSFYD ANGDGVGDFA GLHQKLDHIA ALGVNAIWLL PFFPSPRRDD GYDIADYGSV SPDYGTVEDF RAFVDAAHQR NIRVIIELVI NHTSDQHPWF QRARQSPAGS PERDFYVWSD TDQKFPETRI IFIDTEKSNW TWDAVAGAYY WHRFYSHQPD LNFDSPLVME ELLRVMRFWL ETGIDGFRLD AIPYLVEREG TINENLPETH AILKRIRAAL DATHPGVMLL AEANQWPEDT REYFGDGDEC HMAFHFPLMP RMYMAIAKED RFPITDILRQ TPEIPDNCQW AIFLRNHDEL TLEMVTDAER DYLWETYASD KRARINLGIR RRLAPLMERD RRRIELMNAL LLSMPGTPVI YYGDEIGMGD NIYLGDRDGV RTPMQWSPDR NGGFSRADPA RLVLPPVADP LYGFEAVNVE AQSTDAHSLL NWTRRMLALR GRHPAFGRGT LRFLSPENRK ILAYLREYEG EVLLCVASLS RLPQAVELDL SSFEGRVPIE LTGMSPFPPI GQLTYLLTLP PYGFFWFQLT ADADPPAWRT APPEQLPDLL TMVIRRSLLD LVDEPGHARI LSGEILPAYL SRRRWFGAKD QPLQAARLVS ATPIPFADGV VLGELEVVLP NHSESYQLPL TVAWDDAHPS ALAQQLALGR IRQGRRVGFL TDGFAVEAMA RGILHGLRDR SRTTGRTGTL EFLGTEQLDS LDISDELPVH WLSAEQSNSS LLVGDVAMIK LIRHIFPGIH PEVEMTRYLT RAGYDHTAPL LGEVAHTDSS GRRSTLIIVQ GAIRNQGDAW NWMLNNLRRG ADELVLNDPA VQPDDDVFQS LISFVAMVGL RLGELHVVLA AKTGDEAFSP VVSGDKEVEA MKKAVSGEVA YAMSKLDERE QNADPAIDLL AAPLLERRSE LAELAGTLAE SCRHTLMTRT HGDFHLGQIL VSEGDAVIID FEGEPAKNLT ERRAKTNPLR DVAGLLRSLS YLVATAQLDN DAVIEHDNEV RRKAIARFGR NAEEAFLDAY WQAVSVSKEL DMPPDQRRRV LDAFLLEKAA YEIAYEARNR PKWLPIPLSG LTEIVSRLAG VTA
|
| |