Gene Rleg_4546 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4546 
Symbol 
ID8015942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4673232 
End bp4674239 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content64% 
IMG OID644827123 
Productglycosyl transferase family 2 
Protein accessionYP_002978323 
Protein GI241207227 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.947898 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.992745 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGAT CGCGCACCGA TAATCTGAAT ATTGCCGTTC TGCTTCCCTG CTATAATGAA 
GCGGCGACGA TCAGCGCTGT CGTGCAGGGC TTCCGGGCGA CGCTGCCCGA TGCTGCGATC
CACGTCTACG ACAACAATTC CACCGACGGC ACCGCGCTGC AGGCAATGCT TGCCGGCGCG
CATGTCGTGC GCGAGCGGCG TCAGGGCAAG GGCCATGTCG TGCGCCGGAT GTTCGCCGAT
ATCGACGCCG ACATCTACAT CATCGCCGAC GGCGACGGCA CCTATGCGCC TGAGGATGCC
GAAGAACTGG TGCGCACGCT GCTCACCGAG CGCGCCGACA TGGTGGTCGG CACCAGGCGC
GGCGTGCACG CCGATGCCGG CCGTCAGGGC CATGCGCTCG GCAACCGGCT TTTCAACCTG
CTCTACCGGA TGATCTTCGG CCCCGACTTC ACCGATATCT TTTCCGGCTA CCGCGCGTTT
TCGCGCCGCT TCGTCAAGAG TTTCCCGGCG GTATCCGGCG GCTTCGAGAT CGAAACCGAG
ATGTCGGTGC ATGCCTCCCG GCTGAAGCTG CCGGTCAGCG AGCTGGAGCT CGACTATGGC
CGCCGGCCGG AAGGCTCGCA TTCCAAGCTT TCGACATTCC GCGACGGCGC CAAGATCCTC
TGGATGTTCG CGATGCTGAT GAAGGAAACC CGGCCCTTCG CCTTTTTCAG CGCGATCAGC
GCCACCTTCA TGCTGGCGAG CCTCGGCTTC ATGGCGCCGG TGCTGGCGGA ATATTTCGAA
ACGGGTCTCG TCAGCCGCAT GCCGACCTGG GTGCTGTCGA CGGCGCTGCT GATGATCTCC
TTCATGCTAT TCACCGCCGG CGTCATTCTG GATTCCGTTG CGCGTGCCCG CGCCGAACAG
CTTCGTATCC ATTATATGGG CCTCGAAAGG CCGAGCGCGT TGAAGGCACC GCTCAGCGAC
GCAGGGCCGG TATCGCGTGC GCGTCCCGGC AAGGCGGATG CCGCATGA
 
Protein sequence
MARSRTDNLN IAVLLPCYNE AATISAVVQG FRATLPDAAI HVYDNNSTDG TALQAMLAGA 
HVVRERRQGK GHVVRRMFAD IDADIYIIAD GDGTYAPEDA EELVRTLLTE RADMVVGTRR
GVHADAGRQG HALGNRLFNL LYRMIFGPDF TDIFSGYRAF SRRFVKSFPA VSGGFEIETE
MSVHASRLKL PVSELELDYG RRPEGSHSKL STFRDGAKIL WMFAMLMKET RPFAFFSAIS
ATFMLASLGF MAPVLAEYFE TGLVSRMPTW VLSTALLMIS FMLFTAGVIL DSVARARAEQ
LRIHYMGLER PSALKAPLSD AGPVSRARPG KADAA