Gene Rleg2_5383 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5383 
Symbol 
ID6978477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp1016306 
End bp1019587 
Gene Length3282 bp 
Protein Length1093 aa 
Translation table11 
GC content63% 
IMG OID643394485 
Producttrehalose synthase 
Protein accessionYP_002279303 
Protein GI209547385 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases
[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0116468 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.915163 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACGA TGAACCCCAA TGGCATTGAG CAGCCGCTCT GGTACAAGGA TGCGATCATC 
TACCAGCTGC ACATCAAGTC GTTCTACGAC GCCAATGGCG ACGGCGTCGG CGACTTTGCC
GGCCTGCACG CCAAGCTCGA TCACATCGCC GCGCTCGGCG TCAATGCCAT CTGGCTGCTG
CCTTTCTTTC CCTCGCCGCG CCGCGACGAC GGCTACGACA TCGCCGACTA CGGCAATGTC
AGCCCCGACT ACGGCACCTT GGAGGACTTT CGCGCCTTCG TCGACGCCGC CCACCAGCGC
AATATCCGCG TCATCATCGA GCTCGTCATC AACCACACCT CGGATCAGCA TCCCTGGTTC
CAGCGTGCCC GTCAGGCCCC CGCCGGATCA CCGGAGCGCG ATTTTTACGT CTGGTCGGAT
ACCGATCAGA AATTCCCGGA AACGCGCATC ATTTTCATCG ATACGGAAAA ATCAAACTGG
ACATGGGATG CCGTGGCCGG CGCCTATTAC TGGCATCGCT TTTATTCCCA CCAGCCTGAC
CTCAACTTCG ACAGCCCCCT GGTCATGGAG GAACTGCTGA AGGTGATGCG CTTCTGGCTG
GAAACCGGTA TCGACGGCTT CCGCCTCGAC GCGATCCCCT ACCTTGTCGA ACGCGAGGGG
ACGATCAACG AAAACCTGCC CGAAACCCAT GCGATCCTCA AGCGCATCCG TGCCGCCCTC
GACGCCACCC ATCCCGGCGT GATGTTGCTC GCCGAGGCCA ATCAATGGCC GGAGGATACG
CGCGAATATT TCGGCGACGG CGACGAATGC CACATGGCCT TCCACTTCCC GCTGATGCCG
CGCATGTACA TGGCGATCGC CAAGGAGGAT CGGTTTCCGA TCACCGATAT CCTGCGCCAG
ACGCCGGAGA TCCCGGAGAA TTGCCAATGG GCGATCTTCC TGCGCAATCA CGACGAACTG
ACGCTCGAAA TGGTGACGGA CGCCGAGCGT GATTATCTCT GGGAAACCTA CGCATCCGAT
AAGCGCGCCC GCATCAATCT CGGCATCAGG CGGCGCCTGG CGCCGCTGAT GGAGCGCGAC
CGCCGGCGGA TCGAGCTGAT GAATGCGCTG CTGCTCTCGA TGCCGGGAAC GCCGGTCATC
TATTACGGCG ACGAGATCGG CATGGGCGAC AATATCTATC TCGGCGACCG GGATGGGGTG
AGGACGCCGA TGCAATGGTC TCCCGACCGC AATGGCGGCT TCTCCAGAGT AGACCCGGCG
CGACTCGTCC TGCCGCCGGT CGCCGATCCG CTCTACGGTT TCGAGGCCGT CAACGTCGAG
GCGCAGAGCA CGGACGCGCA TTCGCTGCTC AACTGGACGC GCAAGATGCT GGCGCTGCGC
GGGCGGCATC CGGCCTTCGG CCGCGGTTCG TTGCGGTTTC TCGCGCCGGA GAACCGCAAG
ATCCTCGCCT ATCTCAGGGA GTATGAAGGC GAGACCCTGA TGTGTGTCGC AAATCTCTCG
CGGCTGCCCC AGGCCGTCGA ACTCGACCTC TCAGCCTTTG AGGGCCGCGT GCCGATCGAA
TTGACCGGCA TGTCGCCCTT TCCGCCGATC GGCCAGCTGA CCTATCTGCT GACACTGCCG
CCATACGGAT TTTTCTGGTT CCAGCTGGAG GCCGATGCCG ACCCGCCGGC ATGGCGCACC
GCGCCGCCGG AGCAGCTTCC CGATCTGGTG ACGATGGTCA CCCGCCGCGG CCTGCTCGAT
CTTGTCGATG AACCCAAGCT TGCACGCGTT CTCAGCAATG AAATTCTGCC GGCCTATCTG
GCGAAGCGGC GGTGGTTCGG GGCGAAGGAC CAGCCGCTGC AGGCGGCGCG TCTGATTTCC
GCAACGCCGA TCCCCTTTGC CGACGGGATC GTTCTCGGCG AACTGGAGGC CGTGCTGCCG
AACCATAGCG AATCCTACCA ACTGCCGCTG GCGGTCGCCT GGGACGATGC GCAGCCATCG
GTGCTCAGCC AGCAGCTTGC GCTCGGGCGG GTCCGCCAGG GCCGGCGCGT CGGCTTCCTG
ACGGACGGAT TTGCCGTGGA GGCGATGGCG CGCGGCATCC TGCGCGGGCT TGGCGACCGC
TCGCGCACCA CCGGCCGCAC CGGCACGCTT GAATTTCTCG GCACGGAACG GCTCGATCGT
CTCGCCGTCA CCGACGACCT TCCGGTCCAT TGGCTCTCGG CTGAACAGTC CAACAGTTCG
CTGATCGTCG GCGATCTCGC GATGATCAAG CTGATCAGGC ACATCTTTTC TGGCATTCAT
CCGGAGGTCG AGATGACGCG TTATCTCACC CGTGCCGGCT ACGATCACAC GGCGCCCTTG
CTCGGCGAGG TCGCGCACAC GGACTCCAGT GGACGCCGCT CGACATTGAT CATCGTCCAA
GGCGCGATCC GCAATCAGGG CGACGCCTGG AACTGGATGC TGAACAACCT GCGCCGCGGG
GCAGACGAAC TCGTGCTGAA CGACCCGGCC GTCCAGCCGG GCGACGACGT CTTCCAGCCA
CTGATCAGCT TCGTTGCGAT GGTCGGCATG CGGCTCGGAG AACTGCATGT CGTGCTCGCC
GGGGAGACTG CGGACGAAGC CTTCAGCCCG GTAATCGCCG GCGACAGCGA AGTCGAGGCG
ATGAAGAAGG CGGTTGCCGG CGAAGTCGCC TATGCCATGT CGAAACTTGC CGAACGGGAG
GAGAATGCCG ATCCGGCGAT CGACCTGCTC GCTGCTCCGT TGCTCAAGCG CCGCTCCGAA
CTCGTCGAGC TTGCCGCGAC GCTGACCGAG GCCGCGCGCC ATACGCTGAT GACGCGCACG
CATGGCGATT TTCACCTTGG TCAGATCCTC GTCAGCGAGG GAGATGCCGT CATTATCGAT
TTCGAGGGCG AGCCGGCGAA AAACCTGGCC GAACGCCGCG CCAAGACCAA TCCGCTGCGT
GATGTCGCCG GGCTTTTGAG GTCACTGAGT TATCTCGTCG CCACCGCCCA GCTTGATAAC
GACGCCGTCA TCGAACACGA GAACGAAGTC CGACGCGAGG CCATCGCCCG CTTCGGACGC
CAGGCGGAAG AGGCGTTTCT CGATGCCTAT TCGCAGGCGG TTTCGGCATC GAAGGCGCTA
GATATGCCAC CCGATCAACG ACGGAGGGTC CTCGATGCCT TTCTTCTCGA AAAGGCCGCT
TATGAAATCG CCTATGAAGC CCGCAACCGG CCGAAATGGC TTCCGATCCC GCTTTCCGGC
CTCACCGAAA TCGTCTCGCG TCTGGCGGGG GTCCCGGCAT GA
 
Protein sequence
MDTMNPNGIE QPLWYKDAII YQLHIKSFYD ANGDGVGDFA GLHAKLDHIA ALGVNAIWLL 
PFFPSPRRDD GYDIADYGNV SPDYGTLEDF RAFVDAAHQR NIRVIIELVI NHTSDQHPWF
QRARQAPAGS PERDFYVWSD TDQKFPETRI IFIDTEKSNW TWDAVAGAYY WHRFYSHQPD
LNFDSPLVME ELLKVMRFWL ETGIDGFRLD AIPYLVEREG TINENLPETH AILKRIRAAL
DATHPGVMLL AEANQWPEDT REYFGDGDEC HMAFHFPLMP RMYMAIAKED RFPITDILRQ
TPEIPENCQW AIFLRNHDEL TLEMVTDAER DYLWETYASD KRARINLGIR RRLAPLMERD
RRRIELMNAL LLSMPGTPVI YYGDEIGMGD NIYLGDRDGV RTPMQWSPDR NGGFSRVDPA
RLVLPPVADP LYGFEAVNVE AQSTDAHSLL NWTRKMLALR GRHPAFGRGS LRFLAPENRK
ILAYLREYEG ETLMCVANLS RLPQAVELDL SAFEGRVPIE LTGMSPFPPI GQLTYLLTLP
PYGFFWFQLE ADADPPAWRT APPEQLPDLV TMVTRRGLLD LVDEPKLARV LSNEILPAYL
AKRRWFGAKD QPLQAARLIS ATPIPFADGI VLGELEAVLP NHSESYQLPL AVAWDDAQPS
VLSQQLALGR VRQGRRVGFL TDGFAVEAMA RGILRGLGDR SRTTGRTGTL EFLGTERLDR
LAVTDDLPVH WLSAEQSNSS LIVGDLAMIK LIRHIFSGIH PEVEMTRYLT RAGYDHTAPL
LGEVAHTDSS GRRSTLIIVQ GAIRNQGDAW NWMLNNLRRG ADELVLNDPA VQPGDDVFQP
LISFVAMVGM RLGELHVVLA GETADEAFSP VIAGDSEVEA MKKAVAGEVA YAMSKLAERE
ENADPAIDLL AAPLLKRRSE LVELAATLTE AARHTLMTRT HGDFHLGQIL VSEGDAVIID
FEGEPAKNLA ERRAKTNPLR DVAGLLRSLS YLVATAQLDN DAVIEHENEV RREAIARFGR
QAEEAFLDAY SQAVSASKAL DMPPDQRRRV LDAFLLEKAA YEIAYEARNR PKWLPIPLSG
LTEIVSRLAG VPA