Gene Rleg_6004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6004 
Symbol 
ID8016270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012852 
Strand
Start bp31041 
End bp32315 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content56% 
IMG OID644827316 
ProductO-antigen polymerase 
Protein accessionYP_002978516 
Protein GI241258632 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.881514 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATTATT TGAGTCAACT CAAGATGGCA GTGCTTTATC CGGAGGATGA CGCCGGGAGC 
GCGCTCGACC GAAACAACCG CATTGCCGTC TTTCTCTTTG CGATCCTTCC GGGGCTTTCG
CCGAACTTTA ACTCCTTCAT CCTCCTCGCG TCGATGGTGT GGGGCGTCTA CTGTCTGGCG
ACGGGCCGCC TCGCCTTGAA TCTGTCGAGA TGCGACCGGC TGGTTGCCAT TTTCATGTCG
ATCTACCCGC TGGTGATGAT CGCCAGCATT TTCATCAATC CCCCTTACTC CGAAGTACCG
GACTGGATAT TCCGGCTGCT ACCTTTCTTT TCCATCTGGC TGATATTGCC GCGGATGCGC
CAATCTCCCG ATGGCCGTCT CGTGCCGCTC TTTATCCTCG GCGCAGGCAT CGGTATGATC
GTCACCTTTC TTTTCAGTCT CCTGCAGATC ATGTTTCTGA TGGAGAGAGC AGAGGCTGGG
ACTTCGAACG CTGCGCTCCT GGGCGTGATA GGGATCCTGT TCGGCGGCAT CGCGCTTCTC
AACGTTCAAT CTCCCAAAAG CGTGGAGCAA AGAATTGCAA TATTGGGATA TGCCGCCGGT
CTGGGCTGCG TTCTGCTTTC GGGAACGCGC TCGGCCTGGC TGGTCATTCC CATCCATATC
GTCATCTTCC TCTGGTATTT TCGCAAACAC AGCTTCCATC TAAGTTTGCG CAGCCTAGCT
ATAACCAGCT CCTTGCTGTT GGCGGGACTT ATCACCCTGG GCAGCGGCCA GATCCTCCAT
CGCATCCAGG CGCTTCAGGA AAATCTGACA TCGCTCGAAC GCACCGACGG CGAGATCACG
TCGCTGAGTG CCCGATTCGC CTTGTACAAA GGCGCACTAT CGGCAATTAG TAAGGACCCG
TTGACGGGTT ATGGCCCGCA AAACCGGATG AGTTCGGTTT TGGCAGAGCT GCCCGATAAC
TTAAGGCCGC AGCTACCTTA CTCGCATGTC CACAACGGCT TTCTGACCGC AGGAATCGAC
GCCGGCGTTT TCGGTATTGC GGCACTTTCG CTGATGCTAC TGACACCGGT GATAGGCGCG
TTGAGAAAAG AAGCGGGACC GGGCCGGGAT TTAGCGATTG CGCTCGCTCT TCTGCTCGTC
AGCAGCTACG TCATAACGGG CAGCTTTGGC ATCATGTTCA ATCAGAAAGC TTTGGATCCG
ATCTTCGCTT ACCTCGTCGC CCTTATTTGT GCGGATCGCG GCAGCACGCG CTTTGCCCCC
GTCGTCCGGA GCTGA
 
Protein sequence
MNYLSQLKMA VLYPEDDAGS ALDRNNRIAV FLFAILPGLS PNFNSFILLA SMVWGVYCLA 
TGRLALNLSR CDRLVAIFMS IYPLVMIASI FINPPYSEVP DWIFRLLPFF SIWLILPRMR
QSPDGRLVPL FILGAGIGMI VTFLFSLLQI MFLMERAEAG TSNAALLGVI GILFGGIALL
NVQSPKSVEQ RIAILGYAAG LGCVLLSGTR SAWLVIPIHI VIFLWYFRKH SFHLSLRSLA
ITSSLLLAGL ITLGSGQILH RIQALQENLT SLERTDGEIT SLSARFALYK GALSAISKDP
LTGYGPQNRM SSVLAELPDN LRPQLPYSHV HNGFLTAGID AGVFGIAALS LMLLTPVIGA
LRKEAGPGRD LAIALALLLV SSYVITGSFG IMFNQKALDP IFAYLVALIC ADRGSTRFAP
VVRS