Gene Rleg2_0136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0136 
Symbol 
ID6978846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp128036 
End bp129451 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content62% 
IMG OID643394847 
Productalpha,alpha-trehalose-phosphate synthase (UDP-forming) 
Protein accessionYP_002279664 
Protein GI209547747 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0380] Trehalose-6-phosphate synthase 
TIGRFAM ID[TIGR02400] alpha,alpha-trehalose-phosphate synthase [UDP-forming] 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.502961 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGTC TGATCGTCGT TTCCAATCGT GTTCCCGTGC CGAGCAAGGA TGGGGGTGCT 
GCTGCCGGCG GCCTGGCCGT TGCGCTGCAG GCAGCCCTGC AGGAGCGCGG CGGCATCTGG
ATGGGGTGGT CGGGAGAGTC GAGCGGCGAT CGGGAACCGG GCCCGCTGTC GCAACAGCAG
AAGGGCAACA TCACCTATGC GCTGACCGAC CTCACCGACA CCGACGTCGA GGAATATTAC
CGCGGTTTTG CCAATCGCGT TCTCTGGCCG ATCTGCCACT ATCGCCTGGA CCTTGCCGAA
TATGGCCGGA AGGAAATGGC CGGCTATTTC CGCGTCAACC GCTTCTTCGC GCACCGGCTG
GCGCCGCTGA TCGAGCCGGA TGATATTATC TGGGTTCACG ATTACCACCT CATTCCGCTG
GCGGCGGAGC TGCGGCAGAT GGGGCTGAAG AACCGGATCG GCTTCTTTCT GCATATCCCC
TGGCCGCCGG CCGACATCCT CGTCACCATG CCGGTCCACG AAGAAATCAT GCGTGGGCTT
TCGCATTACG ATCTCGTCGG CTTCCAGACC GACTACGACC TGCAGAACTT CGCCGGCTAT
CTCAGAAGGG AAGGCATTGG CGACGACCTC GGCAATGGTT TGTTCGATTC GCACGGGCGG
GTCTTCAAGG CCGGCGCCTA TCCGATCGGC ATCGAAACCG CTGGATTCGC AGCATTCGCC
GAAAAAGCCG CAAACAACGT CATGGTCCAG AAGACCCGGC GCAGCATTGA AGGCCGCGAT
ATGATCATCG GGGTCGACCG GCTCGATTAT TCGAAGGGCA TCATTGAGCG GCTGGAGGCG
TTCGAGCGCT TCATCACCAG CAATCCGGCC TACCAGAACA AGGTGACCTT TCTGCAGGTC
ACGCCGAAAT CGCGATCGGA AGTTCCCGAA TACGAGCAGA TGCAGAAGAT GGTCGCCGAA
CAGGCAGGCC GCGTGAACGG CGCCATCGGC ACGGTCGACT GGGTGCCGAT CCGCTATGTC
AACCGATCGA TCAGCCGCAA CGTGCTCGCC GGCCTTTTCC GGCTGGCGAC GATCGGGCTG
GTCACGCCGC TGCGCGACGG CATGAACCTC GTCGCCAAGG AATATGTCGC CGCCCAGGAT
CCGGATCGCC CCGGCGTATT GGTGCTATCC CGCTTCGCCG GCGCGGCCCG CGAGTTGAAG
GGTGCGCTGC TGGTCAATCC TTATGACGTC GAAGGCACGG CCAATGCCGT CGCCAAGGGC
CTTGCCATGT CGCTTGCCGA ACGCCGCGAC CGCTGGAGGA TGATGATGGA CCATCTGCTC
TCGCACGACG TCTCGCTCTG GTGCAAAAAT TTCTTGGGGG ATCTTGTGGC TGGTCCCGAG
CTTCGGCCGG AGCATAGCTC CCGGACAGTC TCCTGA
 
Protein sequence
MSRLIVVSNR VPVPSKDGGA AAGGLAVALQ AALQERGGIW MGWSGESSGD REPGPLSQQQ 
KGNITYALTD LTDTDVEEYY RGFANRVLWP ICHYRLDLAE YGRKEMAGYF RVNRFFAHRL
APLIEPDDII WVHDYHLIPL AAELRQMGLK NRIGFFLHIP WPPADILVTM PVHEEIMRGL
SHYDLVGFQT DYDLQNFAGY LRREGIGDDL GNGLFDSHGR VFKAGAYPIG IETAGFAAFA
EKAANNVMVQ KTRRSIEGRD MIIGVDRLDY SKGIIERLEA FERFITSNPA YQNKVTFLQV
TPKSRSEVPE YEQMQKMVAE QAGRVNGAIG TVDWVPIRYV NRSISRNVLA GLFRLATIGL
VTPLRDGMNL VAKEYVAAQD PDRPGVLVLS RFAGAARELK GALLVNPYDV EGTANAVAKG
LAMSLAERRD RWRMMMDHLL SHDVSLWCKN FLGDLVAGPE LRPEHSSRTV S