Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4717 |
Symbol | |
ID | 5318897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 1238200 |
End bp | 1240881 |
Gene Length | 2682 bp |
Protein Length | 893 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640776515 |
Product | glycosyl transferase family protein |
Protein accession | YP_001313447 |
Protein GI | 150376851 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.337059 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0164521 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGGCA ATAGGGAAGA TACCGACCTC ATAGTGGTGG CGATGCCGCT CTACGGCCAT GCAGCCCTCG TTCTGGAGGC GATCGAATCC GTTCTCGCCT CGAAGCTCTC CGGATGCAGT GTAGCGGTCG TCGTATCGGT CGATGGGGAT CCGCGACAGG AGACCTTCGA TCAGCTGCTT CTCTACGCGG CGGCCCATCC GGCGGTCCAT GTTCTCTTCG GCGCCAATGC CGGGCCGGGC GGGGCGCGCA ACCGTGCGAT CGAATACGTT CTCGCGAACC TGCCGGAAGC CGAAGCAGTC TATTTCCTCG ATGCCGACAA TCGCGTGCTG CCCGGGACCA TCGAGACCCT ATACAGGCAA CTGCGCACAA GCGGTTGCGG GTGGATTTAC ACCAATATCG ACACTTTCTC GGTGAGCTGG CGCGCCCATT ACGGCAATCG CTACTCGCGG CTCCTGCACT GCATAACGGA CAATATCTGC GACACCGGCT CCATGATCTC GCTCGATGTG TTTCGGGCGG GGGTTCGTTT CGACGCCGAC AGGCAGAACG GTTTCGAGGA CTGGGAATTC TGGCTGTCCT GCATCGAGCA CGGCTTCGTC GGAGAACCAT GCCACGACAC GACCTTCGAA TACCGGCTAA GGGCGGAGAG CCGGTTCAAG GAGGCGAACC GCGATCGCGC TGCCTCCGTG AGCTTCCTCC GAAAGCGCCA CAGGGCGCTG TTCCAGCGCC CCATGCTCGT CGATTTCGAG CATGAGGAAT GCCCGCGTTA TCTCTTTGCG CGGACGGAAG ACGCCGCCAT TTCTTACTTT ACCGACCCGA CAAAGGCACC TAAGCGGCTT CGCCTCGACG ATATCATTCC GGCGTTCTGG GCGAGCGTCG GTGAGCCGGA CAACGTGCAT TTCCCGCCCT TCCTGATCGC CGGAAGCGGC GCCACGCTCG ACCTGCTGCT GCGCTCCCGG ATGCTGCCAA ACGTGCTCTC CCACCTGGAG CGCCTGAGCG AGAAGGCCAA TGTGGTATTC GTCCAGCTCG GCAACGATGC GGCTCAGCGC AAGATCGAAC CGGTCTTCCT TGAGGCCGGC GCCCAGCATA ACGCTTCGCC GGACCTCATT TTTCTGTCCA CATCGCTCGT GCGCGACGTC ATCCAGAACA AGGCGCTGGA CTGGTTCGCG TCCATCGGCA ATCAGCAGGT CTGGCCGACA TCGGCTATCT TGAAAGTGCG TTTTCCCTTC CCGAGAAGCC TGCCGCGCCG TTTGCTCATC ACACCCCAGC AGGTGATGAT CAACTGCGTC AACGCCATCG CGACCAGCCC GCTGCGGCGA ACGGCCGGCA AACGCTGGAC CTGGCGCCCG GCACGGCTCG TTCCCTATTC CGATCTTCAC AAAGCCCTTC GCCAGGAGGT CGGAGGCTCG CCCATATTGC CGCTCGGCCA TAGCGAGGGG GGCAGAAAAA CGGCGGCCCT GCTCGTGCCC AATGCATCCT TCGGTGGCGC CGAGAAGGTC GTATACGCTG CTTCGCGCGA GCTGAAGGCT GCGGGCTACG AGACACATCT TTTCGTGCTC GGCACATCCA GAATGGATGT GATCGACGAG TTCGACCAGA GCTTCGATTA TATTCACTTC TGGGACCAGG GTATCCCCGC CTGGGGCGGC TCCGGCTCGT TCCTGGGGCA CGATTTCATT GCCGAAGGCC ATGACGTCGA TTGGGCCGCG CTCAAGGGTC AGCTTTCAGG CTTCGACCTC GTGATCAACA ATCACGTCAT GGCGGTGCAT CCGCTTATCG CGCGCCTGCG CTCGGAAGGA ACCCGAACAG CCTGCTACCT TCATGTGGTC GACAATACCG CCTTCAAAAG ACCCGCCGGC CAGCCTTTTG CGGCGATCGC CCACGAGCAT TGCTATGATG CGTTCCTGAC CTGCTCGGAA CAGCTCAAGC TTTATCTTCA CAGTTTCGGC GTGCCGCATG AAAAGATCTT TGCCGTACCC AACGGCGCGA GCTTCTCCGT GCCGCCCAAG GTCCTGTCTG AAGTGCTTTC GGTCAGACGG ATCGAACGGA AGGACGATCG GCTTCGGGTG CTCTATATGG GTCGCCTCGA CCAGCAAAAG GGGATCGATC GTCTGGCTGC GGCCATCGCC GAGCTGCGCG CGTCGCGCGT GCCTTTCGAT GCCCGGGCGG TTGGCGGTGA GATTCTTGCG GATGCCACGA TATCCTGGAC GGATCGCCTG AGGGACTTGG GCGTCGAAGT GCGCTCACCG GTCTTCGCCA GCAAGGATCT GATCAAGGCG CTCGGTTGGG CGGATGTCCT TCTGATGCCG TCCCGCTGGG AGGGGGCTCC CTTGATGATT GCCGAGGCAC AGCAGCTCGG CTGCGTACCG ATCGCGACGG CGGTTGGTGC CGTCGACGAG CTCATCACCG ACGGCGAGGA CGGCATTCTG ATCGAGGCCG CGGCCGATCC GCAGGTGGTG CGGGATATGG CAAAGGCCAT CGAGGAAGTG GCCCTCAATC GTAAGCAGCT GGCGCCCCTC ATGGAAGGCT GCCTCAGAAC AGCGGCACGC CGTTCATGGG GATCGTCCTT CTCCGAGTTC CTCGGCTGGT GCGATCGCTC TGTGCATAAT TCATCGCTTT CGCGCGCAAC GGTCATACGT GGCCGCGAGG CATCCAATCC CGGAGTAGCG GCTGTGGGCT GA
|
Protein sequence | MSGNREDTDL IVVAMPLYGH AALVLEAIES VLASKLSGCS VAVVVSVDGD PRQETFDQLL LYAAAHPAVH VLFGANAGPG GARNRAIEYV LANLPEAEAV YFLDADNRVL PGTIETLYRQ LRTSGCGWIY TNIDTFSVSW RAHYGNRYSR LLHCITDNIC DTGSMISLDV FRAGVRFDAD RQNGFEDWEF WLSCIEHGFV GEPCHDTTFE YRLRAESRFK EANRDRAASV SFLRKRHRAL FQRPMLVDFE HEECPRYLFA RTEDAAISYF TDPTKAPKRL RLDDIIPAFW ASVGEPDNVH FPPFLIAGSG ATLDLLLRSR MLPNVLSHLE RLSEKANVVF VQLGNDAAQR KIEPVFLEAG AQHNASPDLI FLSTSLVRDV IQNKALDWFA SIGNQQVWPT SAILKVRFPF PRSLPRRLLI TPQQVMINCV NAIATSPLRR TAGKRWTWRP ARLVPYSDLH KALRQEVGGS PILPLGHSEG GRKTAALLVP NASFGGAEKV VYAASRELKA AGYETHLFVL GTSRMDVIDE FDQSFDYIHF WDQGIPAWGG SGSFLGHDFI AEGHDVDWAA LKGQLSGFDL VINNHVMAVH PLIARLRSEG TRTACYLHVV DNTAFKRPAG QPFAAIAHEH CYDAFLTCSE QLKLYLHSFG VPHEKIFAVP NGASFSVPPK VLSEVLSVRR IERKDDRLRV LYMGRLDQQK GIDRLAAAIA ELRASRVPFD ARAVGGEILA DATISWTDRL RDLGVEVRSP VFASKDLIKA LGWADVLLMP SRWEGAPLMI AEAQQLGCVP IATAVGAVDE LITDGEDGIL IEAAADPQVV RDMAKAIEEV ALNRKQLAPL MEGCLRTAAR RSWGSSFSEF LGWCDRSVHN SSLSRATVIR GREASNPGVA AVG
|
| |