Gene Rleg2_4215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4215 
Symbol 
ID6982988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4392767 
End bp4394005 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content57% 
IMG OID643398946 
Productglycosyl transferase family 2 
Protein accessionYP_002283703 
Protein GI209551786 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCGGCA TCAAACTCAG CATTTGCATC CCGACCTACA ATCGTGAAGC CTATCTCAGA 
AACTCGCTGA CCTACTGCGA GAACGACTAC AGGTTCGACT TTCCCTTTGA AGTCGTCATT
TGCGACAATG CCTCCACGGA TGGTACGCAG CAGGTGGTCG AGGAATTCAT CAGCCGCGGG
CTGCCGATCC GCTACTACAA GCGTGAAACC AACGCCGGCG CCGCGGCGAA CGTCACGAGC
GCTCTGCGCC TCGGCAAGGG CGAATATCTC ATCTACCTGG CCGACGACGA TATCCTGATT
GCCGATGCGG TGGCCGATAC CGTCTTATAT CTCGACAACA ATCCGGAAGT GACCTGCGCC
CATGCGCCGT GGTTCCTCTA CGACGAAGTC GCTAAAACCG ACATCATGAA GTTCTACAAT
GTCGAGGAAG ATCGGAAATT TCAGCGCGGC AGCTTTGGCG ACGTCTTCCA ATATCTCTGC
GAACGCCACA TCTTTCCGGA AATCGCGATC TACCGCTCGT CAACGCTGCG GTCGGCCTGG
GTCCCGCGGG AGTTCTGCTT CTACCCGTTC GCGTTTTTTG CGCATTTTCT CGATCAGGGC
GCGGTTACTT TCCTGCAGCG CCCGTTCTAC CGCTCGATCG CCAATTCGGC GATCACCCGC
GATCGCCCGC AGGAAGGCAC CAATGACGTC ATGACGAGCT GGGATCGCTA TCGTGGCGGG
CTTGAATATT TCCTCTATAC GGGCGTCAGG CGCGGGGCGC TGGCCCTGAC GCCCGAGACG
CGTCTCAAAT ATGACGAGAT GTGCAGGATC TTCACGCTCA ATCGAATGGC GGTCGCCTTC
CGCTTCTGGG CAGAGCGCAA AAATTTCATC AAGGCCTATG AACTCTATAC CCGCATCATG
TGGGGCGGGA TGCTCGACCA CCCGGAAATC CGCGCCTTCC GTGAAAGGCT ACCCCTGATG
GTCGTCATTC AGACGCTGGT GAGCGAAGTG AATTCGGCAA TCGGTATCGA TACGCTGCTT
CTTGCCGGCT TCTCGGAAAT CGCGGTGCTC GAAGACCTGA TGCGCGAACT CGGTCTCAAT
GAAAAAGTAA GGTTTACCAC AGAACTCAGC GACCGCGCGC TCGATAGCAC CGCGGTCTTC
GTCACCGTCG ACAGAGACCG GGAATATTTC GTAGCCCTCG GCTACCTGCC CAATCTGGTG
TTCCACGAGC ACGATCTTGC CCGGCACATT ATCATGTGA
 
Protein sequence
MSGIKLSICI PTYNREAYLR NSLTYCENDY RFDFPFEVVI CDNASTDGTQ QVVEEFISRG 
LPIRYYKRET NAGAAANVTS ALRLGKGEYL IYLADDDILI ADAVADTVLY LDNNPEVTCA
HAPWFLYDEV AKTDIMKFYN VEEDRKFQRG SFGDVFQYLC ERHIFPEIAI YRSSTLRSAW
VPREFCFYPF AFFAHFLDQG AVTFLQRPFY RSIANSAITR DRPQEGTNDV MTSWDRYRGG
LEYFLYTGVR RGALALTPET RLKYDEMCRI FTLNRMAVAF RFWAERKNFI KAYELYTRIM
WGGMLDHPEI RAFRERLPLM VVIQTLVSEV NSAIGIDTLL LAGFSEIAVL EDLMRELGLN
EKVRFTTELS DRALDSTAVF VTVDRDREYF VALGYLPNLV FHEHDLARHI IM