Gene Rleg_4120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4120 
Symbol 
ID8014917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4201615 
End bp4202793 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content63% 
IMG OID644826690 
Productglycosyl transferase group 1 
Protein accessionYP_002977900 
Protein GI241206804 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGAACGA TATACCGCTT TCTGCGCGCC CATGTTCTGC AGCGGCTGAT CCCGCGGTCG 
CGTCTTGCCT TCAATCCGCG CAGGCCGGTG GAAATCGTCG GCTATCTCTC GATGGCGGTC
GGCGTTGGCG AATCGGCAAG GCTCTGCGCC GGCGCATTGA CGGAAGCTGG GCGGGCGATT
TCGCTCTCCG ACGTCAGCAC GCATCCTGAC GAGAATTCCT TCGCCGGATG GACACCGTCG
CATCTCTCCG CCGAACCCGC GGGGAGCCGG ATCTGGCATC TCAATCCGCC GATGCTGCCG
CGCGCGATCC TGAAGAAGGG CGTTGCGAAT TTCACCCGCG CTTTCAACAT CGGCTATTTC
GCCTGGGAGC TCGAAGTCGT GCCGGCGGAG TGGCGCAATG CGATGCATTA CATGAATGCC
GTCTTCGTGC CGTCGGAATT CACCAGGCGG GCGATTGCGC CCCTCACTGC GGCACCGGTC
ATCGTCGTTC CGCATCCCGT CACCGAGAAG CCGGCGACCG AAGGAATGCG CCAGAAATTC
GGCATCGAGG AAGACGCGTT TCTCGTCAGC TTCATCTTCA GCGCCGGGTC CTCGATCAAC
CGGAAGAATC CGCAGGCCGT CATCGAGGCC TTCAGGATAT TTGCCGCCGA ATGCCCCAGC
GCCTTCCTGT TGATGAAGGC CAGCGGCAAT ATCGACAAGG ATGAAGGCCT GCGAGAACTG
ATCGGCTCGG TCGCTGGCGA CAGCCGGATC AGGATCGTCA CCGACAGGCT GTCGGACTCC
GAGATCAACG GCCTGATCCG CTGTTCCGAC GCCTATCTTT CGCTGCATCG TTCCGAGGGT
TTCGGGCTGA CGGTGGCCGA GGCGATCATG CAGCGTACGC CGGTCGTTTC CACGGCCTGG
TCGGGCACGG TGGATTTCTG CGATCCCGAC AATAGCTGGC TGGTTGCCTC TCCCCTCATT
CCGGTGGTCG ATACCCATCC CGAATTTGCC GGGCTCGAAG GCGCGGTCTG GGCCGATCCC
TCACCCGAGG CAGCAGCCGG GCATCTGAAG GATATCTTCC TCGCGCCTGA GCGAGCGCGT
GAGAAGGCCG AGAAGGCGCG GGAGTTCCTG CTGCGCTACC TCGCCGAGAA CAGCTATGAC
AAGGCGCTCA AGGCGCTGGA GGCGATGCAG ACCGCCTAA
 
Protein sequence
MRTIYRFLRA HVLQRLIPRS RLAFNPRRPV EIVGYLSMAV GVGESARLCA GALTEAGRAI 
SLSDVSTHPD ENSFAGWTPS HLSAEPAGSR IWHLNPPMLP RAILKKGVAN FTRAFNIGYF
AWELEVVPAE WRNAMHYMNA VFVPSEFTRR AIAPLTAAPV IVVPHPVTEK PATEGMRQKF
GIEEDAFLVS FIFSAGSSIN RKNPQAVIEA FRIFAAECPS AFLLMKASGN IDKDEGLREL
IGSVAGDSRI RIVTDRLSDS EINGLIRCSD AYLSLHRSEG FGLTVAEAIM QRTPVVSTAW
SGTVDFCDPD NSWLVASPLI PVVDTHPEFA GLEGAVWADP SPEAAAGHLK DIFLAPERAR
EKAEKAREFL LRYLAENSYD KALKALEAMQ TA