Gene Rleg2_0414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0414 
Symbol 
ID6979129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp425574 
End bp426542 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content58% 
IMG OID643395127 
Productputative glycosyltransferase spore coat polysaccharide biosynthesis protein 
Protein accessionYP_002279939 
Protein GI209548022 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3980] Spore coat polysaccharide biosynthesis protein, predicted glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.407624 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.45905 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGTCT TCTGCATAGA GAGTTCGCAT GCACGCGGGA TGGGGCATCT GTTTCGGTCG 
CTGACGCTCG CCACCGAACT GCGTTCGCGC GGTCATTCGG TCCGTTTCGC GGCGAATGAT
CATCCGAATT CGCTGAGGAT CATTCGGGAG CGCGGCTTTG ACGTTGCGCT TTACGATCTC
GCCGCCGTCA CTGGATGGGA GGAGGGTCTC GTCGATCCCA CTACCGTTCC GTCGCCGATC
TGGATCAACG ACCGCCTCGA TACGAGAAGA CCTCACAGCG AAACGATCAA GCGTTTGGGC
GCCAAACTCG TGACTTTTGA TGATCGCGGC GATGGCGCTG AACTTGCCGA CATGAATATC
TGCGCTCTTC TTTTCGAAAA GACGGAGGAT CTGAAGGGCG AAGATATCCG GCTGGGGGTG
GAGTACATGA TACTCAATCC TGAAATCGAG AGATATCGCA GAGTTCGGCA AAGCCTTGCA
TCGATACTCG TCACACTCGG GGGCGCCGAT ACCTACGGAG TGACGGTCCG CGTCGCCAAA
TGGCTGAGCA GCAAGCCTTT TCCTGTCACC ATCGTCACAG GCCCGAGCTT CCAGCATATG
GCGGAGCTTG AAGAGGTCGT CTCGACCGCA GAGCCGGATC GGTTCAAGCT GCTGAATCAG
GTGCCGTCGC TTGCGGCAGA GATGTACGGG CACGATCTGG CGATTACCGG CGGTGGCGTT
ACGCCCTTCG AAGCCTGTGC GGCCGGCCTG CCGTGCGTGG TGATCGCCAA CGAACCTTTC
GAAATCCCGG TCGGCCGTGC TCTTGAAGGA TTGGGGGCCG CGTTCTTTGC CGGACATCAC
TCTGAATTCG ATCTCGGCAT CCTGGAAAAG GCGATTCCGA TCAGGAGCAT GAGCGAGACT
GCCATGACCA AGGTCGACCT CGGTGGGGTC GGGCGTATTG CCGCTTTGCT GGAAAGATTG
GCTGCATGA
 
Protein sequence
MFVFCIESSH ARGMGHLFRS LTLATELRSR GHSVRFAAND HPNSLRIIRE RGFDVALYDL 
AAVTGWEEGL VDPTTVPSPI WINDRLDTRR PHSETIKRLG AKLVTFDDRG DGAELADMNI
CALLFEKTED LKGEDIRLGV EYMILNPEIE RYRRVRQSLA SILVTLGGAD TYGVTVRVAK
WLSSKPFPVT IVTGPSFQHM AELEEVVSTA EPDRFKLLNQ VPSLAAEMYG HDLAITGGGV
TPFEACAAGL PCVVIANEPF EIPVGRALEG LGAAFFAGHH SEFDLGILEK AIPIRSMSET
AMTKVDLGGV GRIAALLERL AA