Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4353 |
Symbol | |
ID | 5318202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 850833 |
End bp | 852446 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640776158 |
Product | lipopolysaccharide biosynthesis protein |
Protein accession | YP_001313091 |
Protein GI | 150376495 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.201059 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.455905 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATCGGC GCAGCGAGCC CGAGGGACAG GGATTACGCC ATGGAAGGCC TTCACAGCGC CTTCGCCCGG CTGGGGGTTC GTTGCTTGAC TTCGTGCCGG CCTCGGCGGA AGAGACTGCT CCTCCCGAAG AGGCGCGCAC TCTAGAGAAG GTTGGCGCCT CCGTCCGTGT CGTGGACCTG CCGGCTCCGA GCCATAACGC ACGCGCTCCT GAGGAGCCGG CGTCCGCCGC CGATTTCTTC CCGCAACTCC ACCTTTTAGG CAGCATCGGG ATCAACGACA TCATCGTCTG GCTTCGCGAC GGGATCCGCT GGATCGTCAT CGCGCTCCTC CTCTCCGGTG CCGCGGCTTT CGCTTATGCC GTGACGGCGA CGCCGCGATA CACCGTTTAC ACCGACCTCG TCGTGGACCC TTCCAATCTC AACGTCGTAA GCGACGACGT TTTCACGACC AATCCGCAGC GAGACGCGCA ATTGCTGGAG GTCGAAAGCA AACTCCGGAT CCTGACATCG CGCAATGTGC TCCAGCGCGT GATCGACGAG CTGCGGCTTT CCGAAGACCC GGAATTCGTC AAGCCGACCT TGCTGGACTG GCTGAAGGCG CTGCTTGCCC CGCGGGACGG TAAGACAGAC AAGGATCTGG CGGCCATGCG CGTCCTGTCG GAGAGGGTCG AAGCGCGCCG GGAGGAGCGC TCTTTCGTCG TGGTCCTGAA GGTCTGGAGC GAGGAGCCTG CCAAAGCCAT CGCCCTTTCG GACGCGATCG TCGAGGCGTT CGAGGCGGAA CTGTTCCAGT CGTCCGCCGA GAGTGCCGGC CGGGTGGCGC AGAACCTGAA CGCACGCCTC GATGAACTGC GCCGCAACGT CACCGAGGCG GAAAGGAGGG TTGAGGACTT CCGCCGCCAG AACGGTCTGC AATCCACCAA TGGCGAGCTC GTCAGCAATC AGCTTTCGAG CGAGCTCAAC ACGCAGGTTC TGGACGGTCA GCAGCGCTTC ATTCAGGCGG AAACGCGCTA CAGGCAGATG AGCTCCGCCG TTGCCGGGGG CCGGACCGCC AGCGCTTCGG TGTTCGAGTC GGCCAACATG ACCGATCTGC GCCAGCAGTA TAATGCCTTG CAGCAGCAGA TAGGATCAAT GCAGCTTACC TATGGAGAGC GGCATCCGCG GCTCGTCGCC GCCCGCTCCG AGCGTGCGAC GCTGGAAACC GCGATGAAGG ATGAAGCCCG CCGCATTCTG GAGCGCGCAA AGGCCGATTT CGATCGGGAG CGAAAGGCGC TCGCTACGCT GCGCGGCAAG GCGGACAACG AAAAATCGAA CGTCTTCACC GATAATCAAG CCCAGGTGCA GCTTCGCGAC CTCGAGCGCG ACGCGCGCAG CAAGGCGGCG ATCTACGAAA CGCACCTGGC ACGCGCACAG CAGATCACCG AGCGCCAGCA GATCGACACA ACCAATGTCC GCGTCATCTC GCGCGCCCTG CCGCCGAATG CGCGAAGCTG GCCGCCTCGT ACCCTGGTTC TCCTGATCGG CGGCGCCTTT CTGGGTCTCG CCCTCGGAAT TGCTACGGCC CTCGCCCTCG GCCTTTGGCG GTTCCTTCGC GGCAAAACGC GCGCGGCTGC CTGA
|
Protein sequence | MYRRSEPEGQ GLRHGRPSQR LRPAGGSLLD FVPASAEETA PPEEARTLEK VGASVRVVDL PAPSHNARAP EEPASAADFF PQLHLLGSIG INDIIVWLRD GIRWIVIALL LSGAAAFAYA VTATPRYTVY TDLVVDPSNL NVVSDDVFTT NPQRDAQLLE VESKLRILTS RNVLQRVIDE LRLSEDPEFV KPTLLDWLKA LLAPRDGKTD KDLAAMRVLS ERVEARREER SFVVVLKVWS EEPAKAIALS DAIVEAFEAE LFQSSAESAG RVAQNLNARL DELRRNVTEA ERRVEDFRRQ NGLQSTNGEL VSNQLSSELN TQVLDGQQRF IQAETRYRQM SSAVAGGRTA SASVFESANM TDLRQQYNAL QQQIGSMQLT YGERHPRLVA ARSERATLET AMKDEARRIL ERAKADFDRE RKALATLRGK ADNEKSNVFT DNQAQVQLRD LERDARSKAA IYETHLARAQ QITERQQIDT TNVRVISRAL PPNARSWPPR TLVLLIGGAF LGLALGIATA LALGLWRFLR GKTRAAA
|
| |