Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3889 |
Symbol | |
ID | 5318683 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 346824 |
End bp | 347975 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640775701 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001312634 |
Protein GI | 150376038 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0066878 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTCC ATGCCCTTTC CGGCTCAAGA CAAGAGGTCC GCCACGACGC CCCGAGCCCG TTGCCGCGGC GGATATTGAT GACCGTCGAT GCGGTGGGCG GTGTCTGGCG CTATGCGGTG GATCTCGCCG AAGCCATGCG CCGTTCCGGC GTCGAGACGC TGATCGTCGG CTTCGGCCCG GCCCCTTCCC CCGAGCAACG GCGCGAGGCG GAGAGCATCG GCCGGCTCGA ATGGTCGGAT CAGCCGCTCG ACTGGATGGT GGAAGATGAA AGTGAACTCT GCGGCGTTCC GGACCTCTTG TCCGAACTGG CGCTCAAGCA CTCCGTCGAT CTCATGCACC TCAACCTGCC TTCCCAGGCT TCGGGAATAA CGGCTGAGAT TCCCGTCGTC ACCGTTTCGC ATTCCTGCGT AGCCACCTGG TTCGAGGCGG TGCGCGGATC CGGCTTGCCA ACCGGCTGGT GCTGGCAGAA GCGATTGAAC CGAATTGGAT TCGACCGCGC CGACCTGGTG CTTGCCCCGA GCCGGAGTCA TGCCATGGCG GTGACGCGTT GTTACGGTCC GGTTCGCGAT CTCGCGGTCG TCTACAATGC GAGCCGGAAC GCATCCTCCA TCTCCGCCAA GGAGAACTTT GCTTTTGCGG CCGGGCGCTG GTGGGACGAA GGCAAGAATG GAGCCGTACT GGACAGGGCG GCCGCCGATA TAGGCTGGCC GGTCGTCATG GCCGGTGCCT GTGACGGCCC CAACGGGCAA CGCCTGGCGA TTGCGCATGC CGATCACAAG GGCGAACTCA GTCACGACAG GGCGATTTCG CTGATGTCGA GGGCGGCGAT CGTCGTTTCC CCTTCGGTCT ACGAACCATT CGGTCTGGTG GCGCTGGAGG CAGCACGCGC CGGTGCGGCA TTGGTTCTCG CCGACATCGA AACCTATCGC GAGCTCTGGG AGGGCAGCGC TCTTTTCGCC GATGCCGAGG ATCCCGCCGC CTTCGCCGGG GCCGTCAACC GGCTTGCGGA GGATGCGGGG TTCAGGGCCG AGCTCGGCCG GCGAGCGCAG TCGCGCGGTT CCGACTTCAC CCTTGAGGCC CAGCGCGAGG CGATATTCGC CGCCTATAGG CGCGTGATGC GCGGGGAAAA TCGACGGACG GCAGCGGAGT GA
|
Protein sequence | MSVHALSGSR QEVRHDAPSP LPRRILMTVD AVGGVWRYAV DLAEAMRRSG VETLIVGFGP APSPEQRREA ESIGRLEWSD QPLDWMVEDE SELCGVPDLL SELALKHSVD LMHLNLPSQA SGITAEIPVV TVSHSCVATW FEAVRGSGLP TGWCWQKRLN RIGFDRADLV LAPSRSHAMA VTRCYGPVRD LAVVYNASRN ASSISAKENF AFAAGRWWDE GKNGAVLDRA AADIGWPVVM AGACDGPNGQ RLAIAHADHK GELSHDRAIS LMSRAAIVVS PSVYEPFGLV ALEAARAGAA LVLADIETYR ELWEGSALFA DAEDPAAFAG AVNRLAEDAG FRAELGRRAQ SRGSDFTLEA QREAIFAAYR RVMRGENRRT AAE
|
| |