Gene Rleg2_0352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0352 
Symbol 
ID6979066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp353905 
End bp355710 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content68% 
IMG OID643395064 
Productputative succinoglycan biosynthesis transport protein 
Protein accessionYP_002279877 
Protein GI209547960 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0570325 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.193411 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGATA TCAGGGCCAG AAGCACCTTT CACGATCGCC CAGCCACCCG GCCGGCGGCG 
GCCGCACCGG CGCGCGCCCA CGACGATCGC TATGGACTGC CTGGGGGCGT GCCCCGGTCG
AACCGGCCCG ATATCGCGCC GCCCGAAGCC GAACTGCTGC GCGCCATCGG CCGCGCGCTG
GAAGAGCAGC GCGCCAAAGC GGCGCGCACG GCGCCGCTGG TCGATCGCAT CGAAACCATC
CTCGGCAATC ACCTGCGGGC GGCCAACGAC ATTGGCTATC CGCTTCCTCA GGAACAGCCG
CACGGTGACG AACCGGCTGC TCAGGCCGAT GCGCCGCTGA TCTCGCCGGA ACCGCTGGCC
GCGCTCGTTA AGGAACCGGC CGAGCCTCGC CCGGTAGCGC CGCCGCGGCG CGTCGCCGGC
AGCGCACTTG TGATGACGGT CGCCGCTGCG ACGATGATCG GCGCCTGCCT GCCGGTGCTG
ATGCCGGCTT CACCGGCCCT CTACCGTGCC GAAGCAACGC TTGCGGTGAA GACCGATGCC
GCAAGCCGGG CCGCCTTCAC CGAGGCTGCG GCGAAAGGGC TGATGTCGGC GCGGGTGGTT
GCTTCGACGG TTGCCGCCCT GAAACTCGAC CACGATCCCG AATTTGCCGG CCAGAGGGCC
AATGCGCTCG GCGTTGCGCT CGATCTGCTT TCGGCGACCG GTGCTGCCGC CGACCCGGCC
TCGCGGGCCG AGGCAACCTT GAAACATTCG GTCGAGATCC TGCCCGATGC CGCTGCCGGC
ACGATCCTCG TGCGGGTGAC GACCGGCGAC AGCGGAAAAT CCACGCGCAT CGCCGCAAGG
CTTGCCGAAG CGGTTTCCCC AGCGAACGGA ACCGGCGGCA ACACCGAAAG CGACGCCGCC
TTGCGCAAGG CCTATGACGA GGCGAAAGCA GAACTTGCAG CCTTCACGGC AAAGAGCGGC
GAGGGCAACG TCAAGGTGGC CGTCGATCTT CGCCGCCAGA TCGACCGGCT CGATGCCGAT
CTGAAACAGG CCGACCAGAC TATCCTGGAG GCCAAGGCGC AGGCCGACCG ATTGAAAGCC
GCAAAACTTG CCGGCGTGCT CGACGGTTCT CTCCCCTCCG ATATGCTCTC TCCGGCACTG
CAGGACTGGC GCGACAAATA TGCGGTCGCC AAGACAGCGC TTGCGCAGCT TTCGGCCGAA
CTCGGCCCGC GCCATCCGCG GCTGTTGCAG CAGCAGGCCG AAACCGATGG TCTGAAGGAG
AATATGGGCA AGGAACTTGC CCGTCTTGCT CAAGCCGCCA ACGCCGCCGC CAAGTTGGCC
GTCGATGCGC GCAAGGGCCT GAACGACCGG CGCAACACGC TGATTGCGCA AAGCCGGGAT
ACCGGCGTCG ATCTTGCCCG GCTGACCGAG CTCAGCGAGA AGGCGGCCGC CGCGCGTTCG
CGCCTCGACG ATACGGCCTC CGCTTTGGCG GGAACGGCCG GCGACGGCCA TATCACTCTG
ATGAAGCCGG CGTTGGCAAC CGCGGTATCG GCGCCCGACG GCCTGACTGG CCGCTCGCTG
GCCGGTGCTG CCGCGGGTCT CGCCGCCGGT CTTGCTGCGG CTTTCCTGCT GCGCCTGCGT
AAACCCTTGG CCGCAGCCGA AGAGGAAATG CCGCCGTCCC AAGCGCTGTC CCAACCGCTA
TCTTCGCCGG CACCGCAACC GGTCCCGGCC GAGCTCGACG AGATGGAGGC GCTGCGCTCC
GAAATCTCCG GCCTGCGCGA CCGGCTTCTT GTTCATGGCC TCGACGCGCG GCAGCCGCTG
CGCTGA
 
Protein sequence
MYDIRARSTF HDRPATRPAA AAPARAHDDR YGLPGGVPRS NRPDIAPPEA ELLRAIGRAL 
EEQRAKAART APLVDRIETI LGNHLRAAND IGYPLPQEQP HGDEPAAQAD APLISPEPLA
ALVKEPAEPR PVAPPRRVAG SALVMTVAAA TMIGACLPVL MPASPALYRA EATLAVKTDA
ASRAAFTEAA AKGLMSARVV ASTVAALKLD HDPEFAGQRA NALGVALDLL SATGAAADPA
SRAEATLKHS VEILPDAAAG TILVRVTTGD SGKSTRIAAR LAEAVSPANG TGGNTESDAA
LRKAYDEAKA ELAAFTAKSG EGNVKVAVDL RRQIDRLDAD LKQADQTILE AKAQADRLKA
AKLAGVLDGS LPSDMLSPAL QDWRDKYAVA KTALAQLSAE LGPRHPRLLQ QQAETDGLKE
NMGKELARLA QAANAAAKLA VDARKGLNDR RNTLIAQSRD TGVDLARLTE LSEKAAAARS
RLDDTASALA GTAGDGHITL MKPALATAVS APDGLTGRSL AGAAAGLAAG LAAAFLLRLR
KPLAAAEEEM PPSQALSQPL SSPAPQPVPA ELDEMEALRS EISGLRDRLL VHGLDARQPL
R