Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4794 |
Symbol | |
ID | 5319023 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 1310194 |
End bp | 1311414 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640776588 |
Product | glycosyl transferase family protein |
Protein accession | YP_001313520 |
Protein GI | 150376924 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGAAA CATTCTTGTG CTATGCAATC TTCGGGTTGT CAGCAGTCCT TTCAGTCCCG AGTGGAATTT ACGCGGTGGA GTGCTCTATC GGCAGTCTGC CGCTACAAAG GAGGCGAGTC GCCGAATTCG GCCTGAAGGC AAATGCTGCG GTAACGGCCG TTCTGGTACC TGCACATAAC GAGGAAAGCG GCATCGCCGA TACCCTTGCC AATATCGGCG CCCAGCTTTG CGATCAGGAT CGCTTGATCG TAGTGGCCGA TAACTGCTCC GACCGGACTG CGGCACTTGC CCAGGAGGCG GGGGCCGAGG TGATCGAGCG GTTCGACGTC GACCGCCGAG GCAAGGGCTA CGCCCTTGAT GCCGGAATTC GCCACCTGGA AAAAATGCCG CCGGAAATCG TGGTGTTGAT GGATGCGGAC TGCCGCCTTG GGGAAAAGGC GCTGGAGAGG CTTAGAGCGT CGGTTCTCGC GAGCGGCATG CCGGGCCAAT CGCGCAACCT GATGACAGCT CCTGATGGTG CCGCGCCTAA TCTTTTGGTC GCAGAATTTG CGTTCCTCGT TAAGAACTAT GTGCGTCCAC TCGGACTCGT CCGCATGAAC CTACCGTGTC ACGCCACCGG AACGGGTCTC GCAATCCCGT GGCATGCCTT ACGCGGCGCG GATGTCGCCC ATGCGCATCG TGTTGAGGAC ATGAAGCTTG GGCTGGATCT CGCGAGCGCC GGGTATGCGC CGCGGTTCTG CGAGGAGGCA CTTGTGACCA GCCAATTCCC CTGCTCCGGG GAAGGGGCAA ACGCGCAGCG CCGGCGTTGG GAGGGAGGGC ATCTGGAGAT GATCCGCTCG GAGCTACGCT CTCTGGTGAA CCCGTTGGTT CTCCGAAATC CTGCACGCCT CGCGCTGGCG CTGGATCTCA TGGTGCCGCC TTTGACACTG CTTGTGCTCC TGCTCGCATC TGTGATGCTG CTTTCCGTCA TGCTCGCCGC ATCCGGGCTA TCCAGCTGGC CATTGGTCAT TGCGACGGTG AATCTGGCCC TTGTTTTCGT CGCAACACTC GGTGCCTGGT ACGTCCACGG GCGCAAGGCA CTGCCTGCGG CGACGATAAG TAGAATTCCA CTCTATGTCC TATGGAAGTT GCGGCTGTAC CCGCGCGCCC TCTTGGGTGC GAATGAGGGC TGGGTCAGGA CTGATCGCGA CAAGGACGTT TCGGGAGAAA GCCGAACTTA G
|
Protein sequence | MLETFLCYAI FGLSAVLSVP SGIYAVECSI GSLPLQRRRV AEFGLKANAA VTAVLVPAHN EESGIADTLA NIGAQLCDQD RLIVVADNCS DRTAALAQEA GAEVIERFDV DRRGKGYALD AGIRHLEKMP PEIVVLMDAD CRLGEKALER LRASVLASGM PGQSRNLMTA PDGAAPNLLV AEFAFLVKNY VRPLGLVRMN LPCHATGTGL AIPWHALRGA DVAHAHRVED MKLGLDLASA GYAPRFCEEA LVTSQFPCSG EGANAQRRRW EGGHLEMIRS ELRSLVNPLV LRNPARLALA LDLMVPPLTL LVLLLASVML LSVMLAASGL SSWPLVIATV NLALVFVATL GAWYVHGRKA LPAATISRIP LYVLWKLRLY PRALLGANEG WVRTDRDKDV SGESRT
|
| |