Gene Rleg_0456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0456 
Symbol 
ID8011656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp474468 
End bp475568 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content56% 
IMG OID644823050 
Productglycosyl transferase group 1 
Protein accessionYP_002974304 
Protein GI241203208 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAATTA GTATTGACGC AACCGGATTG GGCGGCCCCA AGACAGGAAC ATCAGTCTAT 
CTTATAGAGA TTTTGTCGCG CTGGAGCCGC AATACGTCCA TCAACCACGA GTTCACGATC
TTCGCGAGCG AGAAGGCCGT TTCCCTCTGC TCGGAAGCCG GATTGGACCA TCGGTTTCGT
TTCGTCCGCG CGCCCAACAA CCGCCATATC AGAGTGATCT GGCAGCAGCT AATGATTCCG
TGGCATATGC GCCGACTTGG AATCGATGTG CATTGGGGGA CGGCCTTCGT ATTACCGGTG
GCTTCGCAAA GGCCAATGGC CGTTACAATA CATGACCTAA CCTTCCAACT GTTTCCCGAG
GTGCACGAGC GCTTAAAGCG CTTTTACTTT CCGGCTATTA TGCAACGTTC AGTGGCAAAG
GCGCAGGCTG TATTTGCGGT GTCTCGGACC ACAGAAACGG ACCTAAAACG CATCATTCCA
GAGAGTAGAG GAAAGACAAC CGTCACGCTG CTGGCTGCAC GCAAGCTGGG CTCGGATTCG
CAGGCTCCCC GCGACCAACG TAACTCAGGC GACTACCTGC TCTTCGTCGG AACCTTAGAG
CCACGAAAGA ATCTTCCACG ATTGCTGGCC GCCTGGCAGA TGCTCGATGA TGCCACCCGG
GGCAACACGC GGCTTGTTAT CGTCGGCGCC ACGGGATGGA TGGTAAGCGA CTTGCTACAA
AGCCTCAAGA CGAACGATAC CATAGATTTT CTGGGGCACG TCAGCGATTC TTCTCTAGCA
GAACTGATGC AAGGCGCTAG GGCCCTTCTC TATCCATCAC TCTACGAGGG GTTTGGTTTG
CCGGTGGTTG AAGCGATGGC GCGCGGAATA CCGCTGTTGA CCAGCAATAC CGGCGCTACC
GCGGAGATCG CCGAAGGCGC GGCGATCCTT GTCGACCCGA CGAATGTGGA TGACATCCGT
GGCGGACTTG TGAGGCTGCT GACGGAACCA GAGCTGCTTG GCGCCCTGTC CGCCCAAGGC
CGCGAGCGGG CAAAATCATT CTCCTGGGAA CGCACGGCCC AACTGACATT GGAAACCCTG
GAAGGGTTGA AGCGAGCATG A
 
Protein sequence
MRISIDATGL GGPKTGTSVY LIEILSRWSR NTSINHEFTI FASEKAVSLC SEAGLDHRFR 
FVRAPNNRHI RVIWQQLMIP WHMRRLGIDV HWGTAFVLPV ASQRPMAVTI HDLTFQLFPE
VHERLKRFYF PAIMQRSVAK AQAVFAVSRT TETDLKRIIP ESRGKTTVTL LAARKLGSDS
QAPRDQRNSG DYLLFVGTLE PRKNLPRLLA AWQMLDDATR GNTRLVIVGA TGWMVSDLLQ
SLKTNDTIDF LGHVSDSSLA ELMQGARALL YPSLYEGFGL PVVEAMARGI PLLTSNTGAT
AEIAEGAAIL VDPTNVDDIR GGLVRLLTEP ELLGALSAQG RERAKSFSWE RTAQLTLETL
EGLKRA