Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4948 |
Symbol | |
ID | 5318159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 1460367 |
End bp | 1461395 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640776731 |
Product | glycosyl transferase family protein |
Protein accession | YP_001313663 |
Protein GI | 150377067 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.217559 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCAG CCGTGCCGAC AAATGTCTGC ATAATCATCG CCGCGAAGAA CGCCGCCGAC ACCATTGCCC GTGCGGTCTC CTCGGCGCTT GCCGAGCCGG AGGCGGCGGA AGTGGTCGTC ATCGATGACG GCTCGACCGA TGACAGTGCT GCGGTCGCGC GCGCTGCAGA TGATGGCTCC GGCCGGCTGA ACGTCGTTCG CTTCGAGGAA AATCGCGGCC CGGCGGCGGC ACGTAATCAT GCAATCGAGA TTTCGAAGTC TCAGCTTCTT GGCGTGCTCG ACGCGGATGA TTTCCTCTTC CCCGGCCGGC TGCGCCAGTT GCTTTCGCAG GAGGGCTGGG ACTTCATCGC CGATAATATC GCCTTTATCG ATGCCGCGCA GGCGGCAAAC GCACAAGCCA GTATCGATCG CTTCGCCCCT GCTCCTCGGC TCATCGATCT TGTCGGCTTC ATCGAGGGAA ACATTTCGCG CCGCGGCGTG CGGCGCGGAG AAATCGGATT CCTGAAGCCG TTGATGCGGC GCGCCTTCCT CGACCAGCAT GGCCTGCGCT ATAACGAGAC TTTGCGCCTC GGCGAAGATT ACGACCTCTA CGCTCGCGCG CTGGCGAAGG GCGCACGCTA CAAGATCATC CACAGCTGCG GCTATGCCGC GGTCGTGCGC GGCGACTCGC TGAGCGGCAG TCATCGAACC ATCGACTTGA AGCGTCTCTA TGAGGCCGAT CGTGCAATTC TCGCCGGAAA CAAGCTGAGC AACGACGCCG AAGCAGCCCT ACGCCGACAC GAGCGACACA TCCGGGACCG CTACGAACTG CGCCATTTTC TCGATCTCAA GAACCAGCGG GGCTTCGCTC GTGCAATCAG CTATGCCCTG ACCCACCCAG CGGCTCTGCC GGCGATCCTG GGTGGCATCC TTGCGGATAA GACCGAGCGT TTTCGTCCGT CCCGCGCGCC GGCTCCCCTT GCCCTCGGCG GAACGGGCGA TGTCCGCTAT CTGCTCGAGG CCTTGGCCAT GGATCAGCCT CAAAAATAG
|
Protein sequence | MTAAVPTNVC IIIAAKNAAD TIARAVSSAL AEPEAAEVVV IDDGSTDDSA AVARAADDGS GRLNVVRFEE NRGPAAARNH AIEISKSQLL GVLDADDFLF PGRLRQLLSQ EGWDFIADNI AFIDAAQAAN AQASIDRFAP APRLIDLVGF IEGNISRRGV RRGEIGFLKP LMRRAFLDQH GLRYNETLRL GEDYDLYARA LAKGARYKII HSCGYAAVVR GDSLSGSHRT IDLKRLYEAD RAILAGNKLS NDAEAALRRH ERHIRDRYEL RHFLDLKNQR GFARAISYAL THPAALPAIL GGILADKTER FRPSRAPAPL ALGGTGDVRY LLEALAMDQP QK
|
| |