Gene Rleg_5098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5098 
Symbol 
ID8007690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp492008 
End bp495289 
Gene Length3282 bp 
Protein Length1093 aa 
Translation table11 
GC content62% 
IMG OID644822012 
Producttrehalose synthase 
Protein accessionYP_002973272 
Protein GI241113437 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.436768 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00978496 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGACACGA TGAATGCAGA TAGCGCATCG CAGCCGCTCT GGTACAAGGA TGCAATCATC 
TACCAGCTGC ACATCAAGTC GTTCTACGAC GCCAATGGTG ACGGGGTCGG CGACTTTGCC
GGCTTGCACC AGAAGCTCGA TCACATCGCA GCCCTCGGCG TCAATGCCAT CTGGCTTTTG
CCTTTCTTTC CCTCTCCGCG TCGAGACGAC GGCTACGACA TCGCCGACTA TGGCAGCGTC
AGCCCCGATT ACGGGACGGT GGAGGATTTC CGGGCTTTCG TCGACGCCGC CCACCAGCGC
AATATCCGCG TCATCATCGA GCTCGTCATC AACCACACTT CCGATCAGCA TCCCTGGTTC
CAGCGCGCCC GCCAGTCGCC GGCAGGCTCG CCCGAGCGCG ACTTCTATGT CTGGTCCGAT
ACCGATCAGA AATTTCCGGA AACGCGCATT ATTTTCATCG ATACGGAAAA ATCCAACTGG
ACATGGGACG CGGTCGCCGG CGCCTACTAC TGGCACCGCT TCTATTCCCA TCAACCCGAC
CTCAATTTCG ACAGCCCTCT CGTCATGGAG GAATTGCTGA GGGTGATGCG TTTCTGGCTG
GAAACCGGCA TCGACGGGTT TCGTCTCGAC GCTATACCTT ACCTCGTCGA GCGCGAGGGG
ACGATCAACG AAAATCTGCC GGAAACCCAC GCGATCCTCA AACGCATACG CGCCGCGCTC
GATGCCACCC ATCCCGGCGT GATGCTGCTT GCCGAGGCCA ATCAATGGCC GGAGGACACG
CGCGAATATT TCGGAGACGG CGACGAATGC CACATGGCCT TCCACTTTCC GCTGATGCCG
CGCATGTATA TGGCGATCGC CAAGGAGGAC CGGTTTCCGA TCACCGATAT CCTGCGCCAG
ACGCCGGAGA TTCCCGACAA CTGCCAATGG GCGATCTTCC TTCGCAACCA CGACGAGCTG
ACGCTTGAAA TGGTGACCGA CGCAGAGCGG GATTATCTCT GGGAGACCTA TGCATCCGAC
AAACGTGCCC GCATCAATCT CGGCATAAGG CGGCGCCTAG CGCCGTTGAT GGAGCGCGAC
CGCCGGCGGA TCGAGCTGAT GAACGCGCTT CTTCTGTCGA TGCCGGGAAC GCCGGTGATC
TATTATGGCG ACGAGATCGG CATGGGCGAC AATATCTATC TCGGCGACCG GGATGGAGTG
AGGACGCCGA TGCAATGGTC TCCGGACCGC AATGGCGGTT TCTCCAGGGC AGATCCGGCG
CGTCTCGTCC TGCCGCCCGT CGCTGACCCG CTCTACGGTT TCGAGGCCGT CAACGTCGAG
GCGCAGAGCA CGGACGCGCA TTCGCTGCTC AATTGGACGC GCAGAATGTT GGCGTTGCGC
GGCAGGCATC CGGCCTTCGG ACGTGGCACG CTGCGGTTTC TTTCACCGGA AAATCGCAAG
ATCCTTGCCT ATCTCAGGGA GTATGAAGGC GAGGTATTGC TTTGTGTTGC AAGTCTCTCG
CGCCTGCCCC AGGCCGTCGA ACTCGACTTG TCGAGCTTCG AGGGGCGCGT GCCCATTGAA
CTGACCGGCA TGTCGCCGTT TCCGCCGATC GGCCAGTTGA CCTATCTTCT GACCCTGCCG
CCCTACGGTT TCTTCTGGTT CCAGCTGACG GCTGATGCCG ACCCGCCGGC GTGGCGCACC
GCACCGCCCG AACAGCTTCC TGATCTGTTG ACGATGGTCA TCCGGCGTAG CCTGCTCGAC
CTCGTGGACG AACCGGGCCA TGCACGCATC CTGAGCGGCG AAATCTTGCC CGCCTATCTC
TCCAGGCGGC GATGGTTCGG GGCAAAGGAC CAGCCGCTTC AGGCCGCCCG ACTGGTCTCC
GCGACACCAA TCCCATTCGC CGACGGCGTC GTCCTCGGCG AGCTGGAGGT CGTGCTGCCG
AACCACAGCG AATCCTACCA ACTGCCGCTC ACGGTCGCCT GGGACGATGC CCACCCTTCC
GCGCTTGCCC AGCAGCTTGC GCTTGGGAGG ATTCGCCAAG GTAGACGCGT CGGTTTCCTC
ACAGATGGAT TTGCCGTGGA GGCAATGGCG CGCGGCATTC TGCACGGACT TCGCGACCGC
TCGCGCACCA CCGGCCGGAC CGGCACGCTC GAATTCCTTG GGACAGAACA GCTCGACAGT
CTCGATATTT CAGACGAGCT GCCGGTGCAT TGGCTATCGG CTGAACAATC CAACAGCTCG
CTGCTCGTCG GCGACGTAGC GATGATCAAA CTGATCAGGC ACATCTTCCC GGGCATCCAT
CCGGAAGTGG AGATGACGCG CTATCTCACC CGCGCCGGCT ATGACCACAC AGCGCCCCTG
CTCGGCGAGG TCGCGCATAC CGATTCCAGC GGACGCCGTT CGACCTTGAT CATCGTCCAG
GGCGCGATCC GCAATCAGGG CGACGCCTGG AACTGGATGC TGAACAATCT GCGCCGCGGG
GCCGACGAAC TGGTGCTGAA CGATCCGGCG GTCCAACCAG ACGACGACGT TTTCCAGTCG
CTGATCAGCT TCGTCGCGAT GGTCGGCCTC AGGCTCGGTG AGCTGCATGT CGTGCTCGCT
GCGAAGACCG GGGACGAGGC CTTCAGCCCG GTGGTATCAG GCGACAAAGA GGTCGAGGCG
ATGAAGAAGG CCGTTTCCGG CGAGGTAGCC TATGCCATGT CGAAGCTTGA CGAACGCGAG
CAGAATGCCG ACCCTGCAAT CGACCTGCTG GCGGCCCCGC TTCTCGAGCG CCGCTCCGAA
CTCGCAGAGC TCGCCGGGAC GCTGGCGGAG AGCTGCCGCC ATACGCTGAT GACACGCACG
CATGGCGACT TCCATCTTGG CCAGATCCTT GTCAGCGAGG GTGATGCCGT CATCATCGAC
TTCGAGGGCG AGCCTGCGAA AAATCTGACC GAGCGCCGCG CTAAGACAAA CCCGCTGCGC
GATGTCGCCG GGCTTTTGAG GTCGCTGAGC TACCTCGTGG CCACCGCGCA GCTCGATAAT
GACGCGGTCA TCGAACACGA CAACGAGGTC CGCCGCAAGG CGATCGCCCG CTTCGGGCGC
AATGCCGAGG AGGCCTTTCT GGATGCCTAC TGGCAGGCGG TCTCCGTATC GAAGGAGCTT
GACATGCCAC CCGATCAAAG ACGCAGGGTT CTCGATGCCT TTCTTCTCGA AAAGGCCGCC
TACGAGATCG CCTATGAGGC TCGCAACAGG CCGAAATGGC TGCCGATCCC GCTATCTGGC
CTTACCGAAA TCGTATCGCG TCTGGCGGGG GTCACGGCAT GA
 
Protein sequence
MDTMNADSAS QPLWYKDAII YQLHIKSFYD ANGDGVGDFA GLHQKLDHIA ALGVNAIWLL 
PFFPSPRRDD GYDIADYGSV SPDYGTVEDF RAFVDAAHQR NIRVIIELVI NHTSDQHPWF
QRARQSPAGS PERDFYVWSD TDQKFPETRI IFIDTEKSNW TWDAVAGAYY WHRFYSHQPD
LNFDSPLVME ELLRVMRFWL ETGIDGFRLD AIPYLVEREG TINENLPETH AILKRIRAAL
DATHPGVMLL AEANQWPEDT REYFGDGDEC HMAFHFPLMP RMYMAIAKED RFPITDILRQ
TPEIPDNCQW AIFLRNHDEL TLEMVTDAER DYLWETYASD KRARINLGIR RRLAPLMERD
RRRIELMNAL LLSMPGTPVI YYGDEIGMGD NIYLGDRDGV RTPMQWSPDR NGGFSRADPA
RLVLPPVADP LYGFEAVNVE AQSTDAHSLL NWTRRMLALR GRHPAFGRGT LRFLSPENRK
ILAYLREYEG EVLLCVASLS RLPQAVELDL SSFEGRVPIE LTGMSPFPPI GQLTYLLTLP
PYGFFWFQLT ADADPPAWRT APPEQLPDLL TMVIRRSLLD LVDEPGHARI LSGEILPAYL
SRRRWFGAKD QPLQAARLVS ATPIPFADGV VLGELEVVLP NHSESYQLPL TVAWDDAHPS
ALAQQLALGR IRQGRRVGFL TDGFAVEAMA RGILHGLRDR SRTTGRTGTL EFLGTEQLDS
LDISDELPVH WLSAEQSNSS LLVGDVAMIK LIRHIFPGIH PEVEMTRYLT RAGYDHTAPL
LGEVAHTDSS GRRSTLIIVQ GAIRNQGDAW NWMLNNLRRG ADELVLNDPA VQPDDDVFQS
LISFVAMVGL RLGELHVVLA AKTGDEAFSP VVSGDKEVEA MKKAVSGEVA YAMSKLDERE
QNADPAIDLL AAPLLERRSE LAELAGTLAE SCRHTLMTRT HGDFHLGQIL VSEGDAVIID
FEGEPAKNLT ERRAKTNPLR DVAGLLRSLS YLVATAQLDN DAVIEHDNEV RRKAIARFGR
NAEEAFLDAY WQAVSVSKEL DMPPDQRRRV LDAFLLEKAA YEIAYEARNR PKWLPIPLSG
LTEIVSRLAG VTA