Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4715 |
Symbol | |
ID | 5318895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 1235316 |
End bp | 1236563 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640776513 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001313445 |
Protein GI | 150376849 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0188107 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATGTAG CATTCGTACA CCGCCGTGGT TTCGGCCAGT TCGCCGCCTT GGCGCGGCAC CTTGCCGAGG CGGGGAACGA GGTCACCCTC GTGACGGAGA CCGTGGATCA GCGGATACCA TCCGTTCGGG TCGTCCGGCA TCGGGCGGAG CCAGGTCCGC AGCCGAACCC GCATATTGCC CGTCACTTCG GGGTTCCCGA TCATCATGTG CGCATCGGGC ACAGGGTTGC GGAGACATTC GACGCCATGC GCCGCCTCGG GCAGGCTCCC GACGTCATTC TCGGCCATAT AGGTTGGGGC AGCATGATGT TCGTGAAGGA TGTCCTGCCG CGGGTGCCGG CACTGGGCTA TTGCGAGTTT TTCTATCGGG CTGAAGGAGC GGATGTCGGC TTTGCGCCCG ACGACCGTCC CGATTCCGAG ACGCGGAAAC GCTTGCGCCT TCGCAATGTG GCGCAGCTCC TTTCTCTCGA AGCGATGGAT GGCGGCATCA GCCCCACGAA CTGGCAGAGA AGCCTCTATC CAGGTGACGC GCAGCAAAGA ATTGCCGTTT GCCACGAAGG CATAGACACG CGCCGTTTTC GCTCCGATCC TGCAGCTTCG CTGAAGCTTC CGGACGGGCG TGTCCTGAAG GCTGGAGATC CGGTCGTCAC TTTCGTCGCG CGGGACCTTG AACCTTATCG CGGGTTCCCG CAAGCACTGG AGGCGGCGGC GAAGGTCGTC CGACGGCATC CGGACGCGCT GTTCGTTTTC GTCGGCGGCG ACGGGGTCAG CTACGGCGCG CCGCCCCCCG GCGGGGGATC GTGGAAGGAT CATCTGCTTG CGTCGCTGGA CGTTCCGCGT GAGAGACTTA TTTTCCCGGG CGTCGTGCCG CATTCGGTGT TGCGACAACT CTTTCAGATC TCCGCGGCGC ATCTCTACCT CACCTATCCC TTCGTGCTTT CCTGGTCGGT GCTTGAGGCA ATGGCCTGCG GCGCGCTTGT TATCGGATCG GATACGGCGC CGGTTCAGGA AGTCATCCGC TCGGGCCGCA ACGGGCTTCT CGTCCCGTTC TTTGACCCCG ACACGCTGGC CGGAACGATT CTGGACGTGT TGAAGGGTGC AGAGGGGGTC AGTGCAATGC GCCGCGCGGC GCGCAGAACG GTGGAGCAGA GGTTTCGGTT GGACGATTGC CTCGCGCGCC AGTTGAAGCT GGTAGACAAT CTCGCCCGGC GGAATGCTGC ACTAGCGAAG GAAACGCAGA TTACGTAG
|
Protein sequence | MHVAFVHRRG FGQFAALARH LAEAGNEVTL VTETVDQRIP SVRVVRHRAE PGPQPNPHIA RHFGVPDHHV RIGHRVAETF DAMRRLGQAP DVILGHIGWG SMMFVKDVLP RVPALGYCEF FYRAEGADVG FAPDDRPDSE TRKRLRLRNV AQLLSLEAMD GGISPTNWQR SLYPGDAQQR IAVCHEGIDT RRFRSDPAAS LKLPDGRVLK AGDPVVTFVA RDLEPYRGFP QALEAAAKVV RRHPDALFVF VGGDGVSYGA PPPGGGSWKD HLLASLDVPR ERLIFPGVVP HSVLRQLFQI SAAHLYLTYP FVLSWSVLEA MACGALVIGS DTAPVQEVIR SGRNGLLVPF FDPDTLAGTI LDVLKGAEGV SAMRRAARRT VEQRFRLDDC LARQLKLVDN LARRNAALAK ETQIT
|
| |