Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_0352 |
Symbol | |
ID | 6979066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 353905 |
End bp | 355710 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643395064 |
Product | putative succinoglycan biosynthesis transport protein |
Protein accession | YP_002279877 |
Protein GI | 209547960 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0570325 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.193411 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACGATA TCAGGGCCAG AAGCACCTTT CACGATCGCC CAGCCACCCG GCCGGCGGCG GCCGCACCGG CGCGCGCCCA CGACGATCGC TATGGACTGC CTGGGGGCGT GCCCCGGTCG AACCGGCCCG ATATCGCGCC GCCCGAAGCC GAACTGCTGC GCGCCATCGG CCGCGCGCTG GAAGAGCAGC GCGCCAAAGC GGCGCGCACG GCGCCGCTGG TCGATCGCAT CGAAACCATC CTCGGCAATC ACCTGCGGGC GGCCAACGAC ATTGGCTATC CGCTTCCTCA GGAACAGCCG CACGGTGACG AACCGGCTGC TCAGGCCGAT GCGCCGCTGA TCTCGCCGGA ACCGCTGGCC GCGCTCGTTA AGGAACCGGC CGAGCCTCGC CCGGTAGCGC CGCCGCGGCG CGTCGCCGGC AGCGCACTTG TGATGACGGT CGCCGCTGCG ACGATGATCG GCGCCTGCCT GCCGGTGCTG ATGCCGGCTT CACCGGCCCT CTACCGTGCC GAAGCAACGC TTGCGGTGAA GACCGATGCC GCAAGCCGGG CCGCCTTCAC CGAGGCTGCG GCGAAAGGGC TGATGTCGGC GCGGGTGGTT GCTTCGACGG TTGCCGCCCT GAAACTCGAC CACGATCCCG AATTTGCCGG CCAGAGGGCC AATGCGCTCG GCGTTGCGCT CGATCTGCTT TCGGCGACCG GTGCTGCCGC CGACCCGGCC TCGCGGGCCG AGGCAACCTT GAAACATTCG GTCGAGATCC TGCCCGATGC CGCTGCCGGC ACGATCCTCG TGCGGGTGAC GACCGGCGAC AGCGGAAAAT CCACGCGCAT CGCCGCAAGG CTTGCCGAAG CGGTTTCCCC AGCGAACGGA ACCGGCGGCA ACACCGAAAG CGACGCCGCC TTGCGCAAGG CCTATGACGA GGCGAAAGCA GAACTTGCAG CCTTCACGGC AAAGAGCGGC GAGGGCAACG TCAAGGTGGC CGTCGATCTT CGCCGCCAGA TCGACCGGCT CGATGCCGAT CTGAAACAGG CCGACCAGAC TATCCTGGAG GCCAAGGCGC AGGCCGACCG ATTGAAAGCC GCAAAACTTG CCGGCGTGCT CGACGGTTCT CTCCCCTCCG ATATGCTCTC TCCGGCACTG CAGGACTGGC GCGACAAATA TGCGGTCGCC AAGACAGCGC TTGCGCAGCT TTCGGCCGAA CTCGGCCCGC GCCATCCGCG GCTGTTGCAG CAGCAGGCCG AAACCGATGG TCTGAAGGAG AATATGGGCA AGGAACTTGC CCGTCTTGCT CAAGCCGCCA ACGCCGCCGC CAAGTTGGCC GTCGATGCGC GCAAGGGCCT GAACGACCGG CGCAACACGC TGATTGCGCA AAGCCGGGAT ACCGGCGTCG ATCTTGCCCG GCTGACCGAG CTCAGCGAGA AGGCGGCCGC CGCGCGTTCG CGCCTCGACG ATACGGCCTC CGCTTTGGCG GGAACGGCCG GCGACGGCCA TATCACTCTG ATGAAGCCGG CGTTGGCAAC CGCGGTATCG GCGCCCGACG GCCTGACTGG CCGCTCGCTG GCCGGTGCTG CCGCGGGTCT CGCCGCCGGT CTTGCTGCGG CTTTCCTGCT GCGCCTGCGT AAACCCTTGG CCGCAGCCGA AGAGGAAATG CCGCCGTCCC AAGCGCTGTC CCAACCGCTA TCTTCGCCGG CACCGCAACC GGTCCCGGCC GAGCTCGACG AGATGGAGGC GCTGCGCTCC GAAATCTCCG GCCTGCGCGA CCGGCTTCTT GTTCATGGCC TCGACGCGCG GCAGCCGCTG CGCTGA
|
Protein sequence | MYDIRARSTF HDRPATRPAA AAPARAHDDR YGLPGGVPRS NRPDIAPPEA ELLRAIGRAL EEQRAKAART APLVDRIETI LGNHLRAAND IGYPLPQEQP HGDEPAAQAD APLISPEPLA ALVKEPAEPR PVAPPRRVAG SALVMTVAAA TMIGACLPVL MPASPALYRA EATLAVKTDA ASRAAFTEAA AKGLMSARVV ASTVAALKLD HDPEFAGQRA NALGVALDLL SATGAAADPA SRAEATLKHS VEILPDAAAG TILVRVTTGD SGKSTRIAAR LAEAVSPANG TGGNTESDAA LRKAYDEAKA ELAAFTAKSG EGNVKVAVDL RRQIDRLDAD LKQADQTILE AKAQADRLKA AKLAGVLDGS LPSDMLSPAL QDWRDKYAVA KTALAQLSAE LGPRHPRLLQ QQAETDGLKE NMGKELARLA QAANAAAKLA VDARKGLNDR RNTLIAQSRD TGVDLARLTE LSEKAAAARS RLDDTASALA GTAGDGHITL MKPALATAVS APDGLTGRSL AGAAAGLAAG LAAAFLLRLR KPLAAAEEEM PPSQALSQPL SSPAPQPVPA ELDEMEALRS EISGLRDRLL VHGLDARQPL R
|
| |