Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4356 |
Symbol | |
ID | 5318205 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 854551 |
End bp | 855780 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640776161 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001313094 |
Protein GI | 150376498 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.784045 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCCGAA AACGCCTCCT TGCGATCAAC AACTATTTCT ATCGCCGCGG CGGCGCGGAG GCAGTCTTCT TCGACCATAT GAGCATGTTC GGCGAGATCG GCTGGGACAT CGTCCCTTTC GCCATGCAGC ACGATCTCAA CGAGCCCTCG CCCTGGTCGG ACTACTTTGT CTCGGAGATA GAATATGGCC GTCGGACCGG TCTCCTGCGG AAAGCCGTGC AGGCGGCGAG CGTCATCTAT TCGCTGGAGG CGCAACGAAA TCTCGGCCGG CTGATCGAGC GCGCTCGTCC TGCGGTAGCG CACGCACACA ACGTCTACCA CCATCTGTCG CCCGCCATCT TCTCCACTTT GAAGGCAGCC GGCATTCCCG TGGTAATGAC GGTCCATGAC CTGAAGCTTG CCTGCCCCTC CTATAAGATG CTTCGCGACG GCAAGGTGTG CGAGGACTGT CGCGGCGGCA GGGTCTACAA CGTGCTGCGC CATCGCTGCG TGAAGGGATC GGCCCCGCTG AGCGCCGTCG TGCTCGCCGA AACGCTGCTG CACCGCTTGC TTGGACTCTA TCGGGACAAA GTGGACCGGT TGGTCGTACC CAGCCGGTTT TATCTGGAGA AGCTCGCCGA ATGGGGCTGG CCGCGCGAAA AGATGGTCCA CATCGCTAAT TTCGTCGACG TCACAAATCT TTCCGTTCAT CGGCAGGAGA GTGATTATTT CGCCTTTGCC GGGCGCCTGG CACCGGAAAA GGGCCTCACG ACCCTGATCA GGGCGGTTGC CCTGTCGAAA CAGCGGCTCG TCATTGCCGG CACTGGACCG GAGGAGCAGG CCCTTCGCGG GCTAGCAGCC GAACTTGGGG CGGATGTGAG CTTTGCCGGC TATCTCTCGG GGGAGAGGCT GCACAGGCTG ATCGGCGAGT CGCGCGCACT CGTCCTCCCG TCGGAATGGT ACGAAAACGC TCCCATAAGC GTGCTCGAAA CCTACGCGCT CGAGCGACCG GTGATAGGCG CAGCGATCGG TGGAATTCCC GAAATGGTGA AGGAGGGCGA AACCGGCCTG TTGGCCGCTC CGGGGAACGT CGAAGACCTT GCCAGAGCGC TACGCGAAAT GGCGGCACTC TCGCCCGCGG CACGCGCGCG CATGGGGACC GCTGGCCGAT CGTGGGTCGC GACCGAATTC TCCGCCGCCG CCTACCGGGA GAGGACACTC GATCTCTACG CGGGAATCGG TGTTGCCTGA
|
Protein sequence | MTRKRLLAIN NYFYRRGGAE AVFFDHMSMF GEIGWDIVPF AMQHDLNEPS PWSDYFVSEI EYGRRTGLLR KAVQAASVIY SLEAQRNLGR LIERARPAVA HAHNVYHHLS PAIFSTLKAA GIPVVMTVHD LKLACPSYKM LRDGKVCEDC RGGRVYNVLR HRCVKGSAPL SAVVLAETLL HRLLGLYRDK VDRLVVPSRF YLEKLAEWGW PREKMVHIAN FVDVTNLSVH RQESDYFAFA GRLAPEKGLT TLIRAVALSK QRLVIAGTGP EEQALRGLAA ELGADVSFAG YLSGERLHRL IGESRALVLP SEWYENAPIS VLETYALERP VIGAAIGGIP EMVKEGETGL LAAPGNVEDL ARALREMAAL SPAARARMGT AGRSWVATEF SAAAYRERTL DLYAGIGVA
|
| |